HEADSC 2ghs COMMNT S2C correlation file created: Mon May 17 05:23:38 EDT 2010 COMMNT COMMNT If you use this database, please cite: COMMNT COMMNT Guoli Wang, Jonathan W. Arthur, and Roland L. Dunbrack, Jr. COMMNT "S2C: A database correlating sequence and atomic COMMNT coordinate numbering in the Protein Data Bank" COMMNT dunbrack.fccc.edu/Guoli/s2c COMMNT Copyright (c) February 2000, April 2002. COMMNT COMMNT SEQCRD columns are as follows: COMMNT COMMNT Column Positions Item COMMNT 1 0-6 Record identifier COMMNT 2 8 Chain COMMNT 3 10 One letter residue code COMMNT 4 12-14 SEQRES three letter residue code COMMNT 5 16-18 ATOM three letter residue code COMMNT 6 20-24 SEQRES residue number COMMNT 7 26-31 ATOM residue number COMMNT 8 33 PDB secondary structure COMMNT 9 35 STRIDE secondary structure COMMNT 10 37-43 Error flags COMMNT COMMNT Secondary structrue annotation: COMMNT H: Helix E: Strand T: Turn COMMNT B: Bridge G: 310Helix C: Coil COMMNT SEQCRD A M MSE --- 1 - - - 367 SEQCRD A G GLY --- 2 - - - 367 SEQCRD A S SER --- 3 - - - 367 SEQCRD A D ASP --- 4 - - - 367 SEQCRD A K LYS --- 5 - - - 367 SEQCRD A I ILE --- 6 - - - 367 SEQCRD A H HIS --- 7 - - - 367 SEQCRD A H HIS --- 8 - - - 367 SEQCRD A H HIS --- 9 - - - 367 SEQCRD A H HIS --- 10 - - - 367 SEQCRD A H HIS --- 11 - - - 367 SEQCRD A H HIS --- 12 - - - 367 SEQCRD A M MSE --- 13 - - - 367 SEQCRD A N ASN --- 14 - - - 367 SEQCRD A A ALA --- 15 - - - 367 SEQCRD A P PRO --- 16 - - - 367 SEQCRD A L LEU --- 17 - - - 367 SEQCRD A S SER --- 18 - - - 367 SEQCRD A H HIS --- 19 - - - 367 SEQCRD A S SER --- 20 - - - 367 SEQCRD A R ARG --- 21 - - - 367 SEQCRD A P PRO --- 22 - - - 367 SEQCRD A M MSE --- 23 - - - 367 SEQCRD A M MSE --- 24 - - - 367 SEQCRD A Q GLN --- 25 - - - 367 SEQCRD A P PRO --- 26 - - - 367 SEQCRD A S SER --- 27 - - - 367 SEQCRD A E GLU --- 28 - - - 367 SEQCRD A D ASP --- 29 - - - 367 SEQCRD A K LYS --- 30 - - - 367 SEQCRD A S SER --- 31 - - - 367 SEQCRD A L LEU LEU 32 20 C C 4 SEQCRD A A ALA ALA 33 21 C C 4 SEQCRD A T THR THR 34 22 E E 4 SEQCRD A V VAL VAL 35 23 E E 4 SEQCRD A F PHE PHE 36 24 E E 4 SEQCRD A P PRO PRO 37 25 C C 4 SEQCRD A F PHE PHE 38 26 C C 4 SEQCRD A A ALA ALA 39 27 C C 4 SEQCRD A G GLY GLY 40 28 C C 4 SEQCRD A R ARG ARG 41 29 E E 4 SEQCRD A V VAL VAL 42 30 E E 4 SEQCRD A L LEU LEU 43 31 E E 4 SEQCRD A D ASP ASP 44 32 E E 4 SEQCRD A E GLU GLU 45 33 C T 45 SEQCRD A T THR THR 46 34 C C 4 SEQCRD A P PRO PRO 47 35 C C 4 SEQCRD A M MSE MSE 48 36 C T 45 SEQCRD A L LEU LEU 49 37 C T 45 SEQCRD A L LEU LEU 50 38 C B 45 SEQCRD A G GLY GLY 51 39 E E 4 SEQCRD A E GLU GLU 52 40 E E 4 SEQCRD A G GLY GLY 53 41 E E 4 SEQCRD A P PRO PRO 54 42 E E 4 SEQCRD A T THR THR 55 43 E E 4 SEQCRD A F PHE PHE 56 44 E E 4 SEQCRD A D ASP ASP 57 45 E E 4 SEQCRD A P PRO PRO 58 46 C T 45 SEQCRD A A ALA ALA 59 47 C T 45 SEQCRD A S SER SER 60 48 C T 45 SEQCRD A G GLY GLY 61 49 C T 45 SEQCRD A T THR THR 62 50 E E 4 SEQCRD A A ALA ALA 63 51 E E 4 SEQCRD A W TRP TRP 64 52 E E 4 SEQCRD A W TRP TRP 65 53 E E 4 SEQCRD A F PHE PHE 66 54 E E 4 SEQCRD A N ASN ASN 67 55 E E 4 SEQCRD A I ILE ILE 68 56 C G 45 SEQCRD A L LEU LEU 69 57 C G 45 SEQCRD A E GLU GLU 70 58 C G 45 SEQCRD A R ARG ARG 71 59 C C 4 SEQCRD A E GLU GLU 72 60 E E 4 SEQCRD A L LEU LEU 73 61 E E 4 SEQCRD A H HIS HIS 74 62 E E 4 SEQCRD A E GLU GLU 75 63 E E 4 SEQCRD A L LEU LEU 76 64 E E 4 SEQCRD A H HIS HIS 77 65 E E 4 SEQCRD A L LEU LEU 78 66 C T 45 SEQCRD A A ALA ALA 79 67 C T 45 SEQCRD A S SER SER 80 68 C T 45 SEQCRD A G GLY GLY 81 69 C T 45 SEQCRD A R ARG ARG 82 70 E E 4 SEQCRD A K LYS LYS 83 71 E E 4 SEQCRD A T THR THR 84 72 E E 4 SEQCRD A V VAL VAL 85 73 E E 4 SEQCRD A H HIS HIS 86 74 E E 4 SEQCRD A A ALA ALA 87 75 E E 4 SEQCRD A L LEU LEU 88 76 C C 4 SEQCRD A P PRO PRO 89 77 C C 4 SEQCRD A F PHE PHE 90 78 C C 4 SEQCRD A M MSE MSE 91 79 C C 4 SEQCRD A G GLY GLY 92 80 E E 4 SEQCRD A S SER SER 93 81 E E 4 SEQCRD A A ALA ALA 94 82 E E 4 SEQCRD A L LEU LEU 95 83 E E 4 SEQCRD A A ALA ALA 96 84 E E 4 SEQCRD A K LYS LYS 97 85 E E 4 SEQCRD A I ILE ILE 98 86 E E 4 SEQCRD A S SER SER 99 87 E E 4 SEQCRD A D ASP ASP 100 88 C T 45 SEQCRD A S SER SER 101 89 C T 45 SEQCRD A K LYS LYS 102 90 E E 4 SEQCRD A Q GLN GLN 103 91 E E 4 SEQCRD A L LEU LEU 104 92 E E 4 SEQCRD A I ILE ILE 105 93 E E 4 SEQCRD A A ALA ALA 106 94 E E 4 SEQCRD A S SER SER 107 95 E E 4 SEQCRD A D ASP ASP 108 96 C T 45 SEQCRD A D ASP ASP 109 97 C T 45 SEQCRD A G GLY GLY 110 98 E E 4 SEQCRD A L LEU LEU 111 99 E E 4 SEQCRD A F PHE PHE 112 100 E E 4 SEQCRD A L LEU LEU 113 101 E E 4 SEQCRD A R ARG ARG 114 102 E E 4 SEQCRD A D ASP ASP 115 103 E E 4 SEQCRD A T THR THR 116 104 C T 45 SEQCRD A A ALA ALA 117 105 C T 45 SEQCRD A T THR THR 118 106 C T 45 SEQCRD A G GLY GLY 119 107 C T 45 SEQCRD A V VAL VAL 120 108 C C 4 SEQCRD A L LEU LEU 121 109 E E 4 SEQCRD A T THR THR 122 110 E E 4 SEQCRD A L LEU LEU 123 111 E E 4 SEQCRD A H HIS HIS 124 112 E E 4 SEQCRD A A ALA ALA 125 113 E E 4 SEQCRD A E GLU GLU 126 114 C E 45 SEQCRD A L LEU LEU 127 115 C T 45 SEQCRD A E GLU GLU 128 116 C T 45 SEQCRD A S SER SER 129 117 C T 45 SEQCRD A D ASP ASP 130 118 C T 45 SEQCRD A L LEU LEU 131 119 C T 45 SEQCRD A P PRO PRO 132 120 C T 45 SEQCRD A G GLY GLY 133 121 C T 45 SEQCRD A N ASN ASN 134 122 E E 4 SEQCRD A R ARG ARG 135 123 E E 4 SEQCRD A S SER SER 136 124 E E 4 SEQCRD A N ASN ASN 137 125 E E 4 SEQCRD A D ASP ASP 138 126 E E 4 SEQCRD A G GLY GLY 139 127 E E 4 SEQCRD A R ARG ARG 140 128 E E 4 SEQCRD A M MSE MSE 141 129 E E 4 SEQCRD A H HIS HIS 142 130 C T 45 SEQCRD A P PRO PRO 143 131 C T 45 SEQCRD A S SER SER 144 132 C T 45 SEQCRD A G GLY GLY 145 133 C T 45 SEQCRD A A ALA ALA 146 134 C C 4 SEQCRD A L LEU LEU 147 135 E E 4 SEQCRD A W TRP TRP 148 136 E E 4 SEQCRD A I ILE ILE 149 137 E E 4 SEQCRD A G GLY GLY 150 138 E E 4 SEQCRD A T THR THR 151 139 E E 4 SEQCRD A M MSE MSE 152 140 E E 4 SEQCRD A G GLY GLY 153 141 E E 4 SEQCRD A R ARG ARG 154 142 C T 45 SEQCRD A K LYS LYS 155 143 C T 45 SEQCRD A A ALA ALA 156 144 C T 45 SEQCRD A E GLU GLU 157 145 C T 45 SEQCRD A T THR THR 158 146 C T 45 SEQCRD A G GLY GLY 159 147 C T 45 SEQCRD A A ALA ALA 160 148 C T 45 SEQCRD A G GLY GLY 161 149 E E 4 SEQCRD A S SER SER 162 150 E E 4 SEQCRD A I ILE ILE 163 151 E E 4 SEQCRD A Y TYR TYR 164 152 E E 4 SEQCRD A H HIS HIS 165 153 E E 4 SEQCRD A V VAL VAL 166 154 E E 4 SEQCRD A A ALA ALA 167 155 E E 4 SEQCRD A K LYS LYS 168 156 C T 45 SEQCRD A G GLY GLY 169 157 C T 45 SEQCRD A K LYS LYS 170 158 E E 4 SEQCRD A V VAL VAL 171 159 E E 4 SEQCRD A T THR THR 172 160 E E 4 SEQCRD A K LYS LYS 173 161 E E 4 SEQCRD A L LEU LEU 174 162 E E 4 SEQCRD A F PHE PHE 175 163 E E 4 SEQCRD A A ALA ALA 176 164 E E 4 SEQCRD A D ASP ASP 177 165 E E 4 SEQCRD A I ILE ILE 178 166 E E 4 SEQCRD A S SER SER 179 167 C T 45 SEQCRD A I ILE ILE 180 168 C T 45 SEQCRD A P PRO PRO 181 169 E E 4 SEQCRD A N ASN ASN 182 170 E E 4 SEQCRD A S SER SER 183 171 E E 4 SEQCRD A I ILE ILE 184 172 E E 4 SEQCRD A C CYS CYS 185 173 E E 4 SEQCRD A F PHE PHE 186 174 E E 4 SEQCRD A S SER SER 187 175 C T 45 SEQCRD A P PRO PRO 188 176 C T 45 SEQCRD A D ASP ASP 189 177 C T 45 SEQCRD A G GLY GLY 190 178 C T 45 SEQCRD A T THR THR 191 179 C C 4 SEQCRD A T THR THR 192 180 E E 4 SEQCRD A G GLY GLY 193 181 E E 4 SEQCRD A Y TYR TYR 194 182 E E 4 SEQCRD A F PHE PHE 195 183 E E 4 SEQCRD A V VAL VAL 196 184 E E 4 SEQCRD A D ASP ASP 197 185 E E 4 SEQCRD A T THR THR 198 186 C T 45 SEQCRD A K LYS LYS 199 187 C T 45 SEQCRD A V VAL VAL 200 188 C T 45 SEQCRD A N ASN ASN 201 189 C T 45 SEQCRD A R ARG ARG 202 190 E E 4 SEQCRD A L LEU LEU 203 191 E E 4 SEQCRD A M MSE MSE 204 192 E E 4 SEQCRD A R ARG ARG 205 193 E E 4 SEQCRD A V VAL VAL 206 194 E E 4 SEQCRD A P PRO PRO 207 195 E E 4 SEQCRD A L LEU LEU 208 196 C E 45 SEQCRD A D ASP ASP 209 197 C E 45 SEQCRD A A ALA ALA 210 198 C T 45 SEQCRD A R ARG ARG 211 199 C T 45 SEQCRD A T THR THR 212 200 C T 45 SEQCRD A G GLY GLY 213 201 C T 45 SEQCRD A L LEU LEU 214 202 C E 45 SEQCRD A P PRO PRO 215 203 C E 45 SEQCRD A T THR THR 216 204 C C 4 SEQCRD A G GLY GLY 217 205 C C 4 SEQCRD A K LYS LYS 218 206 C C 4 SEQCRD A A ALA ALA 219 207 C C 4 SEQCRD A E GLU GLU 220 208 E E 4 SEQCRD A V VAL VAL 221 209 E E 4 SEQCRD A F PHE PHE 222 210 E E 4 SEQCRD A I ILE ILE 223 211 E E 4 SEQCRD A D ASP ASP 224 212 E E 4 SEQCRD A S SER SER 225 213 C T 45 SEQCRD A T THR THR 226 214 C T 45 SEQCRD A G GLY GLY 227 215 C T 45 SEQCRD A I ILE ILE 228 216 C T 45 SEQCRD A K LYS LYS 229 217 C C 4 SEQCRD A G GLY GLY 230 218 C C 4 SEQCRD A G GLY GLY 231 219 E E 4 SEQCRD A M MSE MSE 232 220 E E 4 SEQCRD A D ASP ASP 233 221 E E 4 SEQCRD A G GLY GLY 234 222 E E 4 SEQCRD A S SER SER 235 223 E E 4 SEQCRD A V VAL VAL 236 224 E E 4 SEQCRD A C CYS CYS 237 225 E E 4 SEQCRD A D ASP ASP 238 226 C T 45 SEQCRD A A ALA ALA 239 227 C T 45 SEQCRD A E GLU GLU 240 228 C T 45 SEQCRD A G GLY GLY 241 229 C T 45 SEQCRD A H HIS HIS 242 230 C C 4 SEQCRD A I ILE ILE 243 231 E E 4 SEQCRD A W TRP TRP 244 232 E E 4 SEQCRD A N ASN ASN 245 233 E E 4 SEQCRD A A ALA ALA 246 234 E E 4 SEQCRD A R ARG ARG 247 235 E E 4 SEQCRD A W TRP TRP 248 236 E E 4 SEQCRD A G GLY GLY 249 237 C T 45 SEQCRD A E GLU GLU 250 238 C T 45 SEQCRD A G GLY GLY 251 239 C T 45 SEQCRD A A ALA ALA 252 240 E E 4 SEQCRD A V VAL VAL 253 241 E E 4 SEQCRD A D ASP ASP 254 242 E E 4 SEQCRD A R ARG ARG 255 243 E E 4 SEQCRD A Y TYR TYR 256 244 E E 4 SEQCRD A D ASP ASP 257 245 C T 45 SEQCRD A T THR THR 258 246 C T 45 SEQCRD A D ASP ASP 259 247 C T 45 SEQCRD A G GLY GLY 260 248 C T 45 SEQCRD A N ASN ASN 261 249 C C 4 SEQCRD A H HIS HIS 262 250 E E 4 SEQCRD A I ILE ILE 263 251 E E 4 SEQCRD A A ALA ALA 264 252 E E 4 SEQCRD A R ARG ARG 265 253 E E 4 SEQCRD A Y TYR TYR 266 254 E E 4 SEQCRD A E GLU GLU 267 255 E E 4 SEQCRD A V VAL VAL 268 256 C C 4 SEQCRD A P PRO PRO 269 257 C C 4 SEQCRD A G GLY GLY 270 258 C T 45 SEQCRD A K LYS LYS 271 259 C T 45 SEQCRD A Q GLN GLN 272 260 C B 45 SEQCRD A T THR THR 273 261 E E 4 SEQCRD A T THR THR 274 262 E E 4 SEQCRD A C CYS CYS 275 263 E E 4 SEQCRD A P PRO PRO 276 264 E E 4 SEQCRD A A ALA ALA 277 265 E E 4 SEQCRD A F PHE PHE 278 266 E E 4 SEQCRD A I ILE ILE 279 267 E E 4 SEQCRD A G GLY GLY 280 268 C T 45 SEQCRD A P PRO PRO 281 269 C T 45 SEQCRD A D ASP ASP 282 270 C T 45 SEQCRD A A ALA ALA 283 271 C T 45 SEQCRD A S SER SER 284 272 C C 4 SEQCRD A R ARG ARG 285 273 E E 4 SEQCRD A L LEU LEU 286 274 E E 4 SEQCRD A L LEU LEU 287 275 E E 4 SEQCRD A V VAL VAL 288 276 E E 4 SEQCRD A T THR THR 289 277 E E 4 SEQCRD A S SER SER 290 278 E E 4 SEQCRD A A ALA ALA 291 279 C B 45 SEQCRD A R ARG ARG 292 280 C T 45 SEQCRD A E GLU GLU 293 281 C T 45 SEQCRD A H HIS HIS 294 282 C T 45 SEQCRD A L LEU LEU 295 283 C T 45 SEQCRD A D ASP ASP 296 284 H C 45 SEQCRD A D ASP ASP 297 285 H H 4 SEQCRD A D ASP ASP 298 286 H H 4 SEQCRD A A ALA ALA 299 287 H H 4 SEQCRD A I ILE ILE 300 288 H H 4 SEQCRD A T THR THR 301 289 H H 4 SEQCRD A A ALA ALA 302 290 H H 4 SEQCRD A N ASN ASN 303 291 H T 45 SEQCRD A P PRO PRO 304 292 C T 45 SEQCRD A Q GLN GLN 305 293 C T 45 SEQCRD A H HIS HIS 306 294 C T 45 SEQCRD A G GLY GLY 307 295 C T 45 SEQCRD A L LEU LEU 308 296 C T 45 SEQCRD A T THR THR 309 297 E E 4 SEQCRD A F PHE PHE 310 298 E E 4 SEQCRD A E GLU GLU 311 299 E E 4 SEQCRD A L LEU LEU 312 300 C E 45 SEQCRD A G GLY GLY 313 301 C C 4 SEQCRD A I ILE ILE 314 302 C C 4 SEQCRD A E GLU GLU 315 303 C C 4 SEQCRD A V VAL VAL 316 304 C C 4 SEQCRD A K LYS LYS 317 305 C C 4 SEQCRD A G GLY GLY 318 306 C C 4 SEQCRD A R ARG ARG 319 307 C C 4 SEQCRD A F PHE PHE 320 308 C C 4 SEQCRD A E GLU GLU 321 309 C C 4 SEQCRD A P PRO PRO 322 310 C C 4 SEQCRD A L LEU LEU 323 311 C C 4 SEQCRD A Y TYR TYR 324 312 C C 4 SEQCRD A R ARG ARG 325 313 C C 4 SEQCRD A L LEU LEU 326 314 C C 4 COMMNT S2CERR 1 0 No standard amino acid code S2CERR 2 0 SEQRES and ATOM residue names differ S2CERR 3 31 No ATOM record S2CERR 4 295 SEQRES and ATOM residue numbers differ S2CERR 5 97 PDB and STRIDE secondary structures differ S2CERR 6 31 PDB secondary structure is absent S2CERR 7 31 STRIDE secondary structure is absent COMMNT COMMNT Crystallographic technical parameters: PARAME method 'X-RAY DIFFRACTION' PARAME resolution 1.550 PARAME R-factor 0.13911 PARAME B-factor 11.803 COMMNT COMMNT Reference database information: DATABA source: DATABA UNP: Q7D0W3_AGRT5 (Q7D0W3) COMMNT DATABA mutation: DATABA MSE A 1 --> . ? 'LEADER SEQUENCE' DATABA HIS A 9 --> . ? 'LEADER SEQUENCE' DATABA LYS A 5 --> . ? 'LEADER SEQUENCE' DATABA MSE A 91 --> MET 79 'MODIFIED RESIDUE' DATABA MSE A 232 --> MET 220 'MODIFIED RESIDUE' DATABA HIS A 10 --> . ? 'LEADER SEQUENCE' DATABA HIS A 7 --> . ? 'LEADER SEQUENCE' DATABA MSE A 13 --> MET 1 'MODIFIED RESIDUE' DATABA MSE A 23 --> MET 11 'MODIFIED RESIDUE' DATABA MSE A 24 --> MET 12 'MODIFIED RESIDUE' DATABA GLY A 2 --> . ? 'LEADER SEQUENCE' DATABA HIS A 8 --> . ? 'LEADER SEQUENCE' DATABA MSE A 48 --> MET 36 'MODIFIED RESIDUE' DATABA SER A 3 --> . ? 'LEADER SEQUENCE' DATABA ASP A 4 --> . ? 'LEADER SEQUENCE' DATABA MSE A 204 --> MET 192 'MODIFIED RESIDUE' DATABA MSE A 152 --> MET 140 'MODIFIED RESIDUE' DATABA ILE A 6 --> . ? 'LEADER SEQUENCE' DATABA HIS A 12 --> . ? 'LEADER SEQUENCE' DATABA MSE A 141 --> MET 129 'MODIFIED RESIDUE' DATABA HIS A 11 --> . ? 'LEADER SEQUENCE'