HEADSC 2ggm
COMMNT S2C correlation file created: Sat Apr 29 08:15:26 EDT 2006
COMMNT
COMMNT If you use this database, please cite:
COMMNT
COMMNT Guoli Wang, Jonathan W. Arthur, and Roland L. Dunbrack, Jr.
COMMNT "S2C: A database correlating sequence and atomic
COMMNT coordinate numbering in the Protein Data Bank"
COMMNT dunbrack.fccc.edu/Guoli/s2c
COMMNT Copyright (c) February 2000, April 2002.
COMMNT
COMMNT SEQCRD columns are as follows:
COMMNT
COMMNT Column Positions Item
COMMNT 1 0-6 Record identifier
COMMNT 2 8 Chain
COMMNT 3 10 One letter residue code
COMMNT 4 12-14 SEQRES three letter residue code
COMMNT 5 16-18 ATOM three letter residue code
COMMNT 6 20-24 SEQRES residue number
COMMNT 7 26-31 ATOM residue number
COMMNT 8 33 PDB secondary structure
COMMNT 9 35 STRIDE secondary structure
COMMNT 10 37-43 Error flags
COMMNT
COMMNT Secondary structrue annotation:
COMMNT H: Helix E: Strand T: Turn
COMMNT B: Bridge G: 310Helix C: Coil
COMMNT
SEQCRD A M MET --- 1 - - - 367
SEQCRD A A ALA --- 2 - - - 367
SEQCRD A S SER --- 3 - - - 367
SEQCRD A N ASN --- 4 - - - 367
SEQCRD A F PHE --- 5 - - - 367
SEQCRD A K LYS --- 6 - - - 367
SEQCRD A K LYS --- 7 - - - 367
SEQCRD A A ALA --- 8 - - - 367
SEQCRD A N ASN --- 9 - - - 367
SEQCRD A M MET --- 10 - - - 367
SEQCRD A A ALA --- 11 - - - 367
SEQCRD A S SER --- 12 - - - 367
SEQCRD A S SER --- 13 - - - 367
SEQCRD A S SER --- 14 - - - 367
SEQCRD A Q GLN --- 15 - - - 367
SEQCRD A R ARG --- 16 - - - 367
SEQCRD A K LYS --- 17 - - - 367
SEQCRD A R ARG --- 18 - - - 367
SEQCRD A M MET --- 19 - - - 367
SEQCRD A S SER --- 20 - - - 367
SEQCRD A P PRO --- 21 - - - 367
SEQCRD A K LYS --- 22 - - - 367
SEQCRD A P PRO --- 23 - - - 367
SEQCRD A E GLU GLU 24 24 C C -
SEQCRD A L LEU LEU 25 25 C C -
SEQCRD A T THR THR 26 26 C C -
SEQCRD A E GLU GLU 27 27 H H -
SEQCRD A E GLU GLU 28 28 H H -
SEQCRD A Q GLN GLN 29 29 H H -
SEQCRD A K LYS LYS 30 30 H H -
SEQCRD A Q GLN GLN 31 31 H H -
SEQCRD A E GLU GLU 32 32 H H -
SEQCRD A I ILE ILE 33 33 H H -
SEQCRD A R ARG ARG 34 34 H H -
SEQCRD A E GLU GLU 35 35 H H -
SEQCRD A A ALA ALA 36 36 H H -
SEQCRD A F PHE PHE 37 37 H H -
SEQCRD A D ASP ASP 38 38 H H -
SEQCRD A L LEU LEU 39 39 H H -
SEQCRD A F PHE PHE 40 40 H H -
SEQCRD A D ASP ASP 41 41 C T 5
SEQCRD A A ALA ALA 42 42 C T 5
SEQCRD A D ASP ASP 43 43 C T 5
SEQCRD A G GLY GLY 44 44 C T 5
SEQCRD A T THR THR 45 45 C C -
SEQCRD A G GLY GLY 46 46 C C -
SEQCRD A T THR THR 47 47 E E -
SEQCRD A I ILE ILE 48 48 E E -
SEQCRD A D ASP ASP 49 49 E E -
SEQCRD A V VAL VAL 50 50 C G 5
SEQCRD A K LYS LYS 51 51 C G 5
SEQCRD A E GLU GLU 52 52 H G 5
SEQCRD A L LEU LEU 53 53 H H -
SEQCRD A K LYS LYS 54 54 H H -
SEQCRD A V VAL VAL 55 55 H H -
SEQCRD A A ALA ALA 56 56 H H -
SEQCRD A M MSE MSE 57 57 H H -
SEQCRD A R ARG ARG 58 58 H H -
SEQCRD A A ALA ALA 59 59 H H -
SEQCRD A L LEU LEU 60 60 H H -
SEQCRD A G GLY GLY 61 61 C C -
SEQCRD A F PHE PHE 62 62 C C -
SEQCRD A E GLU GLU 63 63 C C -
SEQCRD A P PRO PRO 64 64 C C -
SEQCRD A K LYS LYS 65 65 H C 5
SEQCRD A K LYS LYS 66 66 H H -
SEQCRD A E GLU GLU 67 67 H H -
SEQCRD A E GLU GLU 68 68 H H -
SEQCRD A I ILE ILE 69 69 H H -
SEQCRD A K LYS LYS 70 70 H H -
SEQCRD A K LYS LYS 71 71 H H -
SEQCRD A M MSE MSE 72 72 H H -
SEQCRD A I ILE ILE 73 73 H H -
SEQCRD A S SER SER 74 74 H H -
SEQCRD A E GLU GLU 75 75 H H -
SEQCRD A I ILE ILE 76 76 H H -
SEQCRD A D ASP ASP 77 77 H H -
SEQCRD A K LYS LYS 78 78 C T 5
SEQCRD A E GLU GLU 79 79 C T 5
SEQCRD A G GLY GLY 80 80 C T 5
SEQCRD A T THR THR 81 81 C C -
SEQCRD A G GLY GLY 82 82 C C -
SEQCRD A K LYS LYS 83 83 E E -
SEQCRD A M MSE MSE 84 84 E E -
SEQCRD A N ASN ASN 85 85 E E -
SEQCRD A F PHE PHE 86 86 H H -
SEQCRD A G GLY GLY 87 87 H H -
SEQCRD A D ASP ASP 88 88 H H -
SEQCRD A F PHE PHE 89 89 H H -
SEQCRD A L LEU LEU 90 90 H H -
SEQCRD A T THR THR 91 91 H H -
SEQCRD A V VAL VAL 92 92 H H -
SEQCRD A M MSE MSE 93 93 H H -
SEQCRD A T THR THR 94 94 H H -
SEQCRD A Q GLN GLN 95 95 H H -
SEQCRD A K LYS LYS 96 96 H H -
SEQCRD A M MSE MSE 97 97 H H -
SEQCRD A S SER SER 98 98 H H -
SEQCRD A E GLU GLU 99 99 H H -
SEQCRD A K LYS LYS 100 100 H H -
SEQCRD A D ASP ASP 101 101 H H -
SEQCRD A T THR THR 102 102 H H -
SEQCRD A K LYS LYS 103 103 H H -
SEQCRD A E GLU GLU 104 104 H H -
SEQCRD A E GLU GLU 105 105 H H -
SEQCRD A I ILE ILE 106 106 H H -
SEQCRD A L LEU LEU 107 107 H H -
SEQCRD A K LYS LYS 108 108 H H -
SEQCRD A A ALA ALA 109 109 H H -
SEQCRD A F PHE PHE 110 110 H H -
SEQCRD A K LYS LYS 111 111 H H -
SEQCRD A L LEU LEU 112 112 H H -
SEQCRD A F PHE PHE 113 113 H H -
SEQCRD A D ASP ASP 114 114 H T 5
SEQCRD A D ASP ASP 115 115 C T 5
SEQCRD A D ASP ASP 116 116 C T 5
SEQCRD A E GLU GLU 117 117 C T 5
SEQCRD A T THR THR 118 118 C C -
SEQCRD A G GLY GLY 119 119 C C -
SEQCRD A K LYS LYS 120 120 C C -
SEQCRD A I ILE ILE 121 121 C B 5
SEQCRD A S SER SER 122 122 H C 5
SEQCRD A F PHE PHE 123 123 H H -
SEQCRD A K LYS LYS 124 124 H H -
SEQCRD A N ASN ASN 125 125 H H -
SEQCRD A L LEU LEU 126 126 H H -
SEQCRD A K LYS LYS 127 127 H H -
SEQCRD A R ARG ARG 128 128 H H -
SEQCRD A V VAL VAL 129 129 H H -
SEQCRD A A ALA ALA 130 130 H H -
SEQCRD A K LYS LYS 131 131 H H -
SEQCRD A E GLU GLU 132 132 H H -
SEQCRD A L LEU LEU 133 133 H H -
SEQCRD A G GLY GLY 134 134 C C -
SEQCRD A E GLU GLU 135 135 C C -
SEQCRD A N ASN ASN 136 136 C C -
SEQCRD A L LEU LEU 137 137 C C -
SEQCRD A T THR THR 138 138 H C 5
SEQCRD A D ASP ASP 139 139 H H -
SEQCRD A E GLU GLU 140 140 H H -
SEQCRD A E GLU GLU 141 141 H H -
SEQCRD A L LEU LEU 142 142 H H -
SEQCRD A Q GLN GLN 143 143 H H -
SEQCRD A E GLU GLU 144 144 H H -
SEQCRD A M MSE MSE 145 145 H H -
SEQCRD A I ILE ILE 146 146 H H -
SEQCRD A D ASP ASP 147 147 H H -
SEQCRD A E GLU GLU 148 148 H H -
SEQCRD A A ALA ALA 149 149 H H -
SEQCRD A D ASP ASP 150 150 H T 5
SEQCRD A R ARG ARG 151 151 C T 5
SEQCRD A D ASP ASP 152 152 C T 5
SEQCRD A G GLY GLY 153 153 C T 5
SEQCRD A D ASP ASP 154 154 C C -
SEQCRD A G GLY GLY 155 155 C C -
SEQCRD A E GLU GLU 156 156 C C -
SEQCRD A V VAL VAL 157 157 C B 5
SEQCRD A S SER SER 158 158 H C 5
SEQCRD A E GLU GLU 159 159 H H -
SEQCRD A Q GLN GLN 160 160 H H -
SEQCRD A E GLU GLU 161 161 H H -
SEQCRD A F PHE PHE 162 162 H H -
SEQCRD A L LEU LEU 163 163 H H -
SEQCRD A R ARG ARG 164 164 H H -
SEQCRD A I ILE ILE 165 165 H H -
SEQCRD A M MSE MSE 166 166 H H -
SEQCRD A K LYS LYS 167 167 H H -
SEQCRD A K LYS LYS 168 168 H H -
SEQCRD A T THR THR 169 169 C C -
SEQCRD A S SER SER 170 170 C C -
SEQCRD A L LEU LEU 171 171 C C -
SEQCRD A Y TYR TYR 172 172 C C -
SEQCRD B M MET --- 1 - - - 367
SEQCRD B A ALA --- 2 - - - 367
SEQCRD B S SER --- 3 - - - 367
SEQCRD B N ASN --- 4 - - - 367
SEQCRD B F PHE --- 5 - - - 367
SEQCRD B K LYS --- 6 - - - 367
SEQCRD B K LYS --- 7 - - - 367
SEQCRD B A ALA --- 8 - - - 367
SEQCRD B N ASN --- 9 - - - 367
SEQCRD B M MET --- 10 - - - 367
SEQCRD B A ALA --- 11 - - - 367
SEQCRD B S SER --- 12 - - - 367
SEQCRD B S SER --- 13 - - - 367
SEQCRD B S SER --- 14 - - - 367
SEQCRD B Q GLN --- 15 - - - 367
SEQCRD B R ARG --- 16 - - - 367
SEQCRD B K LYS --- 17 - - - 367
SEQCRD B R ARG --- 18 - - - 367
SEQCRD B M MET --- 19 - - - 367
SEQCRD B S SER --- 20 - - - 367
SEQCRD B P PRO --- 21 - - - 367
SEQCRD B K LYS --- 22 - - - 367
SEQCRD B P PRO --- 23 - - - 367
SEQCRD B E GLU --- 24 - - - 367
SEQCRD B L LEU LEU 25 25 C C -
SEQCRD B T THR THR 26 26 C C -
SEQCRD B E GLU GLU 27 27 C C -
SEQCRD B E GLU GLU 28 28 H H -
SEQCRD B Q GLN GLN 29 29 H H -
SEQCRD B K LYS LYS 30 30 H H -
SEQCRD B Q GLN GLN 31 31 H H -
SEQCRD B E GLU GLU 32 32 H H -
SEQCRD B I ILE ILE 33 33 H H -
SEQCRD B R ARG ARG 34 34 H H -
SEQCRD B E GLU GLU 35 35 H H -
SEQCRD B A ALA ALA 36 36 H H -
SEQCRD B F PHE PHE 37 37 H H -
SEQCRD B D ASP ASP 38 38 H G 5
SEQCRD B L LEU LEU 39 39 H G 5
SEQCRD B F PHE PHE 40 40 H G 5
SEQCRD B D ASP ASP 41 41 H T 5
SEQCRD B A ALA ALA 42 42 C T 5
SEQCRD B D ASP ASP 43 43 C T 5
SEQCRD B G GLY GLY 44 44 C T 5
SEQCRD B T THR THR 45 45 C C -
SEQCRD B G GLY GLY 46 46 C C -
SEQCRD B T THR THR 47 47 E E -
SEQCRD B I ILE ILE 48 48 E E -
SEQCRD B D ASP ASP 49 49 E E -
SEQCRD B V VAL VAL 50 50 C G 5
SEQCRD B K LYS LYS 51 51 C G 5
SEQCRD B E GLU GLU 52 52 H G 5
SEQCRD B L LEU LEU 53 53 H H -
SEQCRD B K LYS LYS 54 54 H H -
SEQCRD B V VAL VAL 55 55 H H -
SEQCRD B A ALA ALA 56 56 H H -
SEQCRD B M MSE MSE 57 57 H H -
SEQCRD B R ARG ARG 58 58 H H -
SEQCRD B A ALA ALA 59 59 H H -
SEQCRD B L LEU LEU 60 60 H H -
SEQCRD B G GLY GLY 61 61 C C -
SEQCRD B F PHE PHE 62 62 C C -
SEQCRD B E GLU GLU 63 63 C C -
SEQCRD B P PRO PRO 64 64 C C -
SEQCRD B K LYS LYS 65 65 H C 5
SEQCRD B K LYS LYS 66 66 H H -
SEQCRD B E GLU GLU 67 67 H H -
SEQCRD B E GLU GLU 68 68 H H -
SEQCRD B I ILE ILE 69 69 H H -
SEQCRD B K LYS LYS 70 70 H H -
SEQCRD B K LYS LYS 71 71 H H -
SEQCRD B M MSE MSE 72 72 H H -
SEQCRD B I ILE ILE 73 73 H H -
SEQCRD B S SER SER 74 74 H H -
SEQCRD B E GLU GLU 75 75 H H -
SEQCRD B I ILE ILE 76 76 H H -
SEQCRD B D ASP ASP 77 77 H H -
SEQCRD B K LYS LYS 78 78 C T 5
SEQCRD B E GLU GLU 79 79 C T 5
SEQCRD B G GLY GLY 80 80 C T 5
SEQCRD B T THR THR 81 81 C C -
SEQCRD B G GLY GLY 82 82 C C -
SEQCRD B K LYS LYS 83 83 E E -
SEQCRD B M MSE MSE 84 84 E E -
SEQCRD B N ASN ASN 85 85 E E -
SEQCRD B F PHE PHE 86 86 H H -
SEQCRD B G GLY GLY 87 87 H H -
SEQCRD B D ASP ASP 88 88 H H -
SEQCRD B F PHE PHE 89 89 H H -
SEQCRD B L LEU LEU 90 90 H H -
SEQCRD B T THR THR 91 91 H H -
SEQCRD B V VAL VAL 92 92 H H -
SEQCRD B M MSE MSE 93 93 H H -
SEQCRD B T THR THR 94 94 H H -
SEQCRD B Q GLN GLN 95 95 H H -
SEQCRD B K LYS LYS 96 96 H H -
SEQCRD B M MSE MSE 97 97 H H -
SEQCRD B S SER SER 98 98 H H -
SEQCRD B E GLU GLU 99 99 H H -
SEQCRD B K LYS LYS 100 100 H H -
SEQCRD B D ASP ASP 101 101 H H -
SEQCRD B T THR THR 102 102 H H -
SEQCRD B K LYS LYS 103 103 H H -
SEQCRD B E GLU GLU 104 104 H H -
SEQCRD B E GLU GLU 105 105 H H -
SEQCRD B I ILE ILE 106 106 H H -
SEQCRD B L LEU LEU 107 107 H H -
SEQCRD B K LYS LYS 108 108 H H -
SEQCRD B A ALA ALA 109 109 H H -
SEQCRD B F PHE PHE 110 110 H H -
SEQCRD B K LYS LYS 111 111 H H -
SEQCRD B L LEU LEU 112 112 H H -
SEQCRD B F PHE PHE 113 113 H H -
SEQCRD B D ASP ASP 114 114 H T 5
SEQCRD B D ASP ASP 115 115 C T 5
SEQCRD B D ASP ASP 116 116 C T 5
SEQCRD B E GLU GLU 117 117 C T 5
SEQCRD B T THR THR 118 118 C C -
SEQCRD B G GLY GLY 119 119 C C -
SEQCRD B K LYS LYS 120 120 C C -
SEQCRD B I ILE ILE 121 121 C E 5
SEQCRD B S SER SER 122 122 H E 5
SEQCRD B F PHE PHE 123 123 H H -
SEQCRD B K LYS LYS 124 124 H H -
SEQCRD B N ASN ASN 125 125 H H -
SEQCRD B L LEU LEU 126 126 H H -
SEQCRD B K LYS LYS 127 127 H H -
SEQCRD B R ARG ARG 128 128 H H -
SEQCRD B V VAL VAL 129 129 H H -
SEQCRD B A ALA ALA 130 130 H H -
SEQCRD B K LYS LYS 131 131 H H -
SEQCRD B E GLU GLU 132 132 H H -
SEQCRD B L LEU LEU 133 133 H H -
SEQCRD B G GLY GLY 134 134 C C -
SEQCRD B E GLU GLU 135 135 C C -
SEQCRD B N ASN ASN 136 136 C C -
SEQCRD B L LEU LEU 137 137 C C -
SEQCRD B T THR THR 138 138 H C 5
SEQCRD B D ASP ASP 139 139 H H -
SEQCRD B E GLU GLU 140 140 H H -
SEQCRD B E GLU GLU 141 141 H H -
SEQCRD B L LEU LEU 142 142 H H -
SEQCRD B Q GLN GLN 143 143 H H -
SEQCRD B E GLU GLU 144 144 H H -
SEQCRD B M MSE MSE 145 145 H H -
SEQCRD B I ILE ILE 146 146 H H -
SEQCRD B D ASP ASP 147 147 H H -
SEQCRD B E GLU GLU 148 148 H H -
SEQCRD B A ALA ALA 149 149 H H -
SEQCRD B D ASP ASP 150 150 H T 5
SEQCRD B R ARG ARG 151 151 C T 5
SEQCRD B D ASP ASP 152 152 C T 5
SEQCRD B G GLY GLY 153 153 C T 5
SEQCRD B D ASP ASP 154 154 C C -
SEQCRD B G GLY GLY 155 155 C C -
SEQCRD B E GLU GLU 156 156 C E 5
SEQCRD B V VAL VAL 157 157 C E 5
SEQCRD B S SER SER 158 158 H C 5
SEQCRD B E GLU GLU 159 159 H H -
SEQCRD B Q GLN GLN 160 160 H H -
SEQCRD B E GLU GLU 161 161 H H -
SEQCRD B F PHE PHE 162 162 H H -
SEQCRD B L LEU LEU 163 163 H H -
SEQCRD B R ARG ARG 164 164 H H -
SEQCRD B I ILE ILE 165 165 H H -
SEQCRD B M MSE MSE 166 166 H H -
SEQCRD B K LYS LYS 167 167 H H -
SEQCRD B K LYS LYS 168 168 H C 5
SEQCRD B T THR --- 169 - - - 367
SEQCRD B S SER --- 170 - - - 367
SEQCRD B L LEU --- 171 - - - 367
SEQCRD B Y TYR --- 172 - - - 367
SEQCRD D N ASN ASN 1 847 H C 45
SEQCRD D W TRP TRP 2 848 H H 4
SEQCRD D K LYS LYS 3 849 H H 4
SEQCRD D L LEU LEU 4 850 H H 4
SEQCRD D L LEU LEU 5 851 H H 4
SEQCRD D A ALA ALA 6 852 H H 4
SEQCRD D K LYS LYS 7 853 H H 4
SEQCRD D G GLY GLY 8 854 H H 4
SEQCRD D L LEU LEU 9 855 H H 4
SEQCRD D L LEU LEU 10 856 H H 4
SEQCRD D I ILE ILE 11 857 H H 4
SEQCRD D R ARG ARG 12 858 H H 4
SEQCRD D E GLU GLU 13 859 H T 45
SEQCRD D R ARG ARG 14 860 H T 45
SEQCRD D L LEU LEU 15 861 H T 45
SEQCRD D K LYS LYS 16 862 H T 45
SEQCRD D R ARG ARG 17 863 H C 45
SEQCRD C N ASN ASN 1 847 H C 45
SEQCRD C W TRP TRP 2 848 H H 4
SEQCRD C K LYS LYS 3 849 H H 4
SEQCRD C L LEU LEU 4 850 H H 4
SEQCRD C L LEU LEU 5 851 H H 4
SEQCRD C A ALA ALA 6 852 H H 4
SEQCRD C K LYS LYS 7 853 H H 4
SEQCRD C G GLY GLY 8 854 H H 4
SEQCRD C L LEU LEU 9 855 H H 4
SEQCRD C L LEU LEU 10 856 H H 4
SEQCRD C I ILE ILE 11 857 H H 4
SEQCRD C R ARG ARG 12 858 H H 4
SEQCRD C E GLU GLU 13 859 H H 4
SEQCRD C R ARG ARG 14 860 H H 4
SEQCRD C L LEU LEU 15 861 H H 4
SEQCRD C K LYS LYS 16 862 H H 4
SEQCRD C R ARG ARG 17 863 H C 45
COMMNT
S2CERR 1 0 No standard amino acid code
S2CERR 2 0 SEQRES and ATOM residue names differ
S2CERR 3 51 No ATOM record
S2CERR 4 34 SEQRES and ATOM residue numbers differ
S2CERR 5 61 PDB and STRIDE secondary structures differ
S2CERR 6 51 PDB secondary structure is absent
S2CERR 7 51 STRIDE secondary structure is absent
COMMNT
COMMNT Crystallographic technical parameters:
PARAME method 'X-RAY DIFFRACTION'
PARAME resolution 2.35
PARAME R-factor 0.19439
PARAME B-factor 50.142
COMMNT
COMMNT Reference database information:
DATABA source:
DATABA UNP: CETN2_HUMAN (P41208)
DATABA UNP: XPC_HUMAN (Q01831)
COMMNT
DATABA mutation:
DATABA MSE A 57 --> MET 57 'MODIFIED RESIDUE'
DATABA MSE B 84 --> MET 84 'MODIFIED RESIDUE'
DATABA MSE B 166 --> MET 166 'MODIFIED RESIDUE'
DATABA MSE A 97 --> MET 97 'MODIFIED RESIDUE'
DATABA MSE A 145 --> MET 145 'MODIFIED RESIDUE'
DATABA MSE B 57 --> MET 57 'MODIFIED RESIDUE'
DATABA MSE B 72 --> MET 72 'MODIFIED RESIDUE'
DATABA MSE A 93 --> MET 93 'MODIFIED RESIDUE'
DATABA MSE B 145 --> MET 145 'MODIFIED RESIDUE'
DATABA MSE B 97 --> MET 97 'MODIFIED RESIDUE'
DATABA MSE A 84 --> MET 84 'MODIFIED RESIDUE'
DATABA MSE A 72 --> MET 72 'MODIFIED RESIDUE'
DATABA MSE B 93 --> MET 93 'MODIFIED RESIDUE'
DATABA MSE A 166 --> MET 166 'MODIFIED RESIDUE'