HEADSC 2ggm COMMNT S2C correlation file created: Sat Apr 29 08:15:26 EDT 2006 COMMNT COMMNT If you use this database, please cite: COMMNT COMMNT Guoli Wang, Jonathan W. Arthur, and Roland L. Dunbrack, Jr. COMMNT "S2C: A database correlating sequence and atomic COMMNT coordinate numbering in the Protein Data Bank" COMMNT dunbrack.fccc.edu/Guoli/s2c COMMNT Copyright (c) February 2000, April 2002. COMMNT COMMNT SEQCRD columns are as follows: COMMNT COMMNT Column Positions Item COMMNT 1 0-6 Record identifier COMMNT 2 8 Chain COMMNT 3 10 One letter residue code COMMNT 4 12-14 SEQRES three letter residue code COMMNT 5 16-18 ATOM three letter residue code COMMNT 6 20-24 SEQRES residue number COMMNT 7 26-31 ATOM residue number COMMNT 8 33 PDB secondary structure COMMNT 9 35 STRIDE secondary structure COMMNT 10 37-43 Error flags COMMNT COMMNT Secondary structrue annotation: COMMNT H: Helix E: Strand T: Turn COMMNT B: Bridge G: 310Helix C: Coil COMMNT SEQCRD A M MET --- 1 - - - 367 SEQCRD A A ALA --- 2 - - - 367 SEQCRD A S SER --- 3 - - - 367 SEQCRD A N ASN --- 4 - - - 367 SEQCRD A F PHE --- 5 - - - 367 SEQCRD A K LYS --- 6 - - - 367 SEQCRD A K LYS --- 7 - - - 367 SEQCRD A A ALA --- 8 - - - 367 SEQCRD A N ASN --- 9 - - - 367 SEQCRD A M MET --- 10 - - - 367 SEQCRD A A ALA --- 11 - - - 367 SEQCRD A S SER --- 12 - - - 367 SEQCRD A S SER --- 13 - - - 367 SEQCRD A S SER --- 14 - - - 367 SEQCRD A Q GLN --- 15 - - - 367 SEQCRD A R ARG --- 16 - - - 367 SEQCRD A K LYS --- 17 - - - 367 SEQCRD A R ARG --- 18 - - - 367 SEQCRD A M MET --- 19 - - - 367 SEQCRD A S SER --- 20 - - - 367 SEQCRD A P PRO --- 21 - - - 367 SEQCRD A K LYS --- 22 - - - 367 SEQCRD A P PRO --- 23 - - - 367 SEQCRD A E GLU GLU 24 24 C C - SEQCRD A L LEU LEU 25 25 C C - SEQCRD A T THR THR 26 26 C C - SEQCRD A E GLU GLU 27 27 H H - SEQCRD A E GLU GLU 28 28 H H - SEQCRD A Q GLN GLN 29 29 H H - SEQCRD A K LYS LYS 30 30 H H - SEQCRD A Q GLN GLN 31 31 H H - SEQCRD A E GLU GLU 32 32 H H - SEQCRD A I ILE ILE 33 33 H H - SEQCRD A R ARG ARG 34 34 H H - SEQCRD A E GLU GLU 35 35 H H - SEQCRD A A ALA ALA 36 36 H H - SEQCRD A F PHE PHE 37 37 H H - SEQCRD A D ASP ASP 38 38 H H - SEQCRD A L LEU LEU 39 39 H H - SEQCRD A F PHE PHE 40 40 H H - SEQCRD A D ASP ASP 41 41 C T 5 SEQCRD A A ALA ALA 42 42 C T 5 SEQCRD A D ASP ASP 43 43 C T 5 SEQCRD A G GLY GLY 44 44 C T 5 SEQCRD A T THR THR 45 45 C C - SEQCRD A G GLY GLY 46 46 C C - SEQCRD A T THR THR 47 47 E E - SEQCRD A I ILE ILE 48 48 E E - SEQCRD A D ASP ASP 49 49 E E - SEQCRD A V VAL VAL 50 50 C G 5 SEQCRD A K LYS LYS 51 51 C G 5 SEQCRD A E GLU GLU 52 52 H G 5 SEQCRD A L LEU LEU 53 53 H H - SEQCRD A K LYS LYS 54 54 H H - SEQCRD A V VAL VAL 55 55 H H - SEQCRD A A ALA ALA 56 56 H H - SEQCRD A M MSE MSE 57 57 H H - SEQCRD A R ARG ARG 58 58 H H - SEQCRD A A ALA ALA 59 59 H H - SEQCRD A L LEU LEU 60 60 H H - SEQCRD A G GLY GLY 61 61 C C - SEQCRD A F PHE PHE 62 62 C C - SEQCRD A E GLU GLU 63 63 C C - SEQCRD A P PRO PRO 64 64 C C - SEQCRD A K LYS LYS 65 65 H C 5 SEQCRD A K LYS LYS 66 66 H H - SEQCRD A E GLU GLU 67 67 H H - SEQCRD A E GLU GLU 68 68 H H - SEQCRD A I ILE ILE 69 69 H H - SEQCRD A K LYS LYS 70 70 H H - SEQCRD A K LYS LYS 71 71 H H - SEQCRD A M MSE MSE 72 72 H H - SEQCRD A I ILE ILE 73 73 H H - SEQCRD A S SER SER 74 74 H H - SEQCRD A E GLU GLU 75 75 H H - SEQCRD A I ILE ILE 76 76 H H - SEQCRD A D ASP ASP 77 77 H H - SEQCRD A K LYS LYS 78 78 C T 5 SEQCRD A E GLU GLU 79 79 C T 5 SEQCRD A G GLY GLY 80 80 C T 5 SEQCRD A T THR THR 81 81 C C - SEQCRD A G GLY GLY 82 82 C C - SEQCRD A K LYS LYS 83 83 E E - SEQCRD A M MSE MSE 84 84 E E - SEQCRD A N ASN ASN 85 85 E E - SEQCRD A F PHE PHE 86 86 H H - SEQCRD A G GLY GLY 87 87 H H - SEQCRD A D ASP ASP 88 88 H H - SEQCRD A F PHE PHE 89 89 H H - SEQCRD A L LEU LEU 90 90 H H - SEQCRD A T THR THR 91 91 H H - SEQCRD A V VAL VAL 92 92 H H - SEQCRD A M MSE MSE 93 93 H H - SEQCRD A T THR THR 94 94 H H - SEQCRD A Q GLN GLN 95 95 H H - SEQCRD A K LYS LYS 96 96 H H - SEQCRD A M MSE MSE 97 97 H H - SEQCRD A S SER SER 98 98 H H - SEQCRD A E GLU GLU 99 99 H H - SEQCRD A K LYS LYS 100 100 H H - SEQCRD A D ASP ASP 101 101 H H - SEQCRD A T THR THR 102 102 H H - SEQCRD A K LYS LYS 103 103 H H - SEQCRD A E GLU GLU 104 104 H H - SEQCRD A E GLU GLU 105 105 H H - SEQCRD A I ILE ILE 106 106 H H - SEQCRD A L LEU LEU 107 107 H H - SEQCRD A K LYS LYS 108 108 H H - SEQCRD A A ALA ALA 109 109 H H - SEQCRD A F PHE PHE 110 110 H H - SEQCRD A K LYS LYS 111 111 H H - SEQCRD A L LEU LEU 112 112 H H - SEQCRD A F PHE PHE 113 113 H H - SEQCRD A D ASP ASP 114 114 H T 5 SEQCRD A D ASP ASP 115 115 C T 5 SEQCRD A D ASP ASP 116 116 C T 5 SEQCRD A E GLU GLU 117 117 C T 5 SEQCRD A T THR THR 118 118 C C - SEQCRD A G GLY GLY 119 119 C C - SEQCRD A K LYS LYS 120 120 C C - SEQCRD A I ILE ILE 121 121 C B 5 SEQCRD A S SER SER 122 122 H C 5 SEQCRD A F PHE PHE 123 123 H H - SEQCRD A K LYS LYS 124 124 H H - SEQCRD A N ASN ASN 125 125 H H - SEQCRD A L LEU LEU 126 126 H H - SEQCRD A K LYS LYS 127 127 H H - SEQCRD A R ARG ARG 128 128 H H - SEQCRD A V VAL VAL 129 129 H H - SEQCRD A A ALA ALA 130 130 H H - SEQCRD A K LYS LYS 131 131 H H - SEQCRD A E GLU GLU 132 132 H H - SEQCRD A L LEU LEU 133 133 H H - SEQCRD A G GLY GLY 134 134 C C - SEQCRD A E GLU GLU 135 135 C C - SEQCRD A N ASN ASN 136 136 C C - SEQCRD A L LEU LEU 137 137 C C - SEQCRD A T THR THR 138 138 H C 5 SEQCRD A D ASP ASP 139 139 H H - SEQCRD A E GLU GLU 140 140 H H - SEQCRD A E GLU GLU 141 141 H H - SEQCRD A L LEU LEU 142 142 H H - SEQCRD A Q GLN GLN 143 143 H H - SEQCRD A E GLU GLU 144 144 H H - SEQCRD A M MSE MSE 145 145 H H - SEQCRD A I ILE ILE 146 146 H H - SEQCRD A D ASP ASP 147 147 H H - SEQCRD A E GLU GLU 148 148 H H - SEQCRD A A ALA ALA 149 149 H H - SEQCRD A D ASP ASP 150 150 H T 5 SEQCRD A R ARG ARG 151 151 C T 5 SEQCRD A D ASP ASP 152 152 C T 5 SEQCRD A G GLY GLY 153 153 C T 5 SEQCRD A D ASP ASP 154 154 C C - SEQCRD A G GLY GLY 155 155 C C - SEQCRD A E GLU GLU 156 156 C C - SEQCRD A V VAL VAL 157 157 C B 5 SEQCRD A S SER SER 158 158 H C 5 SEQCRD A E GLU GLU 159 159 H H - SEQCRD A Q GLN GLN 160 160 H H - SEQCRD A E GLU GLU 161 161 H H - SEQCRD A F PHE PHE 162 162 H H - SEQCRD A L LEU LEU 163 163 H H - SEQCRD A R ARG ARG 164 164 H H - SEQCRD A I ILE ILE 165 165 H H - SEQCRD A M MSE MSE 166 166 H H - SEQCRD A K LYS LYS 167 167 H H - SEQCRD A K LYS LYS 168 168 H H - SEQCRD A T THR THR 169 169 C C - SEQCRD A S SER SER 170 170 C C - SEQCRD A L LEU LEU 171 171 C C - SEQCRD A Y TYR TYR 172 172 C C - SEQCRD B M MET --- 1 - - - 367 SEQCRD B A ALA --- 2 - - - 367 SEQCRD B S SER --- 3 - - - 367 SEQCRD B N ASN --- 4 - - - 367 SEQCRD B F PHE --- 5 - - - 367 SEQCRD B K LYS --- 6 - - - 367 SEQCRD B K LYS --- 7 - - - 367 SEQCRD B A ALA --- 8 - - - 367 SEQCRD B N ASN --- 9 - - - 367 SEQCRD B M MET --- 10 - - - 367 SEQCRD B A ALA --- 11 - - - 367 SEQCRD B S SER --- 12 - - - 367 SEQCRD B S SER --- 13 - - - 367 SEQCRD B S SER --- 14 - - - 367 SEQCRD B Q GLN --- 15 - - - 367 SEQCRD B R ARG --- 16 - - - 367 SEQCRD B K LYS --- 17 - - - 367 SEQCRD B R ARG --- 18 - - - 367 SEQCRD B M MET --- 19 - - - 367 SEQCRD B S SER --- 20 - - - 367 SEQCRD B P PRO --- 21 - - - 367 SEQCRD B K LYS --- 22 - - - 367 SEQCRD B P PRO --- 23 - - - 367 SEQCRD B E GLU --- 24 - - - 367 SEQCRD B L LEU LEU 25 25 C C - SEQCRD B T THR THR 26 26 C C - SEQCRD B E GLU GLU 27 27 C C - SEQCRD B E GLU GLU 28 28 H H - SEQCRD B Q GLN GLN 29 29 H H - SEQCRD B K LYS LYS 30 30 H H - SEQCRD B Q GLN GLN 31 31 H H - SEQCRD B E GLU GLU 32 32 H H - SEQCRD B I ILE ILE 33 33 H H - SEQCRD B R ARG ARG 34 34 H H - SEQCRD B E GLU GLU 35 35 H H - SEQCRD B A ALA ALA 36 36 H H - SEQCRD B F PHE PHE 37 37 H H - SEQCRD B D ASP ASP 38 38 H G 5 SEQCRD B L LEU LEU 39 39 H G 5 SEQCRD B F PHE PHE 40 40 H G 5 SEQCRD B D ASP ASP 41 41 H T 5 SEQCRD B A ALA ALA 42 42 C T 5 SEQCRD B D ASP ASP 43 43 C T 5 SEQCRD B G GLY GLY 44 44 C T 5 SEQCRD B T THR THR 45 45 C C - SEQCRD B G GLY GLY 46 46 C C - SEQCRD B T THR THR 47 47 E E - SEQCRD B I ILE ILE 48 48 E E - SEQCRD B D ASP ASP 49 49 E E - SEQCRD B V VAL VAL 50 50 C G 5 SEQCRD B K LYS LYS 51 51 C G 5 SEQCRD B E GLU GLU 52 52 H G 5 SEQCRD B L LEU LEU 53 53 H H - SEQCRD B K LYS LYS 54 54 H H - SEQCRD B V VAL VAL 55 55 H H - SEQCRD B A ALA ALA 56 56 H H - SEQCRD B M MSE MSE 57 57 H H - SEQCRD B R ARG ARG 58 58 H H - SEQCRD B A ALA ALA 59 59 H H - SEQCRD B L LEU LEU 60 60 H H - SEQCRD B G GLY GLY 61 61 C C - SEQCRD B F PHE PHE 62 62 C C - SEQCRD B E GLU GLU 63 63 C C - SEQCRD B P PRO PRO 64 64 C C - SEQCRD B K LYS LYS 65 65 H C 5 SEQCRD B K LYS LYS 66 66 H H - SEQCRD B E GLU GLU 67 67 H H - SEQCRD B E GLU GLU 68 68 H H - SEQCRD B I ILE ILE 69 69 H H - SEQCRD B K LYS LYS 70 70 H H - SEQCRD B K LYS LYS 71 71 H H - SEQCRD B M MSE MSE 72 72 H H - SEQCRD B I ILE ILE 73 73 H H - SEQCRD B S SER SER 74 74 H H - SEQCRD B E GLU GLU 75 75 H H - SEQCRD B I ILE ILE 76 76 H H - SEQCRD B D ASP ASP 77 77 H H - SEQCRD B K LYS LYS 78 78 C T 5 SEQCRD B E GLU GLU 79 79 C T 5 SEQCRD B G GLY GLY 80 80 C T 5 SEQCRD B T THR THR 81 81 C C - SEQCRD B G GLY GLY 82 82 C C - SEQCRD B K LYS LYS 83 83 E E - SEQCRD B M MSE MSE 84 84 E E - SEQCRD B N ASN ASN 85 85 E E - SEQCRD B F PHE PHE 86 86 H H - SEQCRD B G GLY GLY 87 87 H H - SEQCRD B D ASP ASP 88 88 H H - SEQCRD B F PHE PHE 89 89 H H - SEQCRD B L LEU LEU 90 90 H H - SEQCRD B T THR THR 91 91 H H - SEQCRD B V VAL VAL 92 92 H H - SEQCRD B M MSE MSE 93 93 H H - SEQCRD B T THR THR 94 94 H H - SEQCRD B Q GLN GLN 95 95 H H - SEQCRD B K LYS LYS 96 96 H H - SEQCRD B M MSE MSE 97 97 H H - SEQCRD B S SER SER 98 98 H H - SEQCRD B E GLU GLU 99 99 H H - SEQCRD B K LYS LYS 100 100 H H - SEQCRD B D ASP ASP 101 101 H H - SEQCRD B T THR THR 102 102 H H - SEQCRD B K LYS LYS 103 103 H H - SEQCRD B E GLU GLU 104 104 H H - SEQCRD B E GLU GLU 105 105 H H - SEQCRD B I ILE ILE 106 106 H H - SEQCRD B L LEU LEU 107 107 H H - SEQCRD B K LYS LYS 108 108 H H - SEQCRD B A ALA ALA 109 109 H H - SEQCRD B F PHE PHE 110 110 H H - SEQCRD B K LYS LYS 111 111 H H - SEQCRD B L LEU LEU 112 112 H H - SEQCRD B F PHE PHE 113 113 H H - SEQCRD B D ASP ASP 114 114 H T 5 SEQCRD B D ASP ASP 115 115 C T 5 SEQCRD B D ASP ASP 116 116 C T 5 SEQCRD B E GLU GLU 117 117 C T 5 SEQCRD B T THR THR 118 118 C C - SEQCRD B G GLY GLY 119 119 C C - SEQCRD B K LYS LYS 120 120 C C - SEQCRD B I ILE ILE 121 121 C E 5 SEQCRD B S SER SER 122 122 H E 5 SEQCRD B F PHE PHE 123 123 H H - SEQCRD B K LYS LYS 124 124 H H - SEQCRD B N ASN ASN 125 125 H H - SEQCRD B L LEU LEU 126 126 H H - SEQCRD B K LYS LYS 127 127 H H - SEQCRD B R ARG ARG 128 128 H H - SEQCRD B V VAL VAL 129 129 H H - SEQCRD B A ALA ALA 130 130 H H - SEQCRD B K LYS LYS 131 131 H H - SEQCRD B E GLU GLU 132 132 H H - SEQCRD B L LEU LEU 133 133 H H - SEQCRD B G GLY GLY 134 134 C C - SEQCRD B E GLU GLU 135 135 C C - SEQCRD B N ASN ASN 136 136 C C - SEQCRD B L LEU LEU 137 137 C C - SEQCRD B T THR THR 138 138 H C 5 SEQCRD B D ASP ASP 139 139 H H - SEQCRD B E GLU GLU 140 140 H H - SEQCRD B E GLU GLU 141 141 H H - SEQCRD B L LEU LEU 142 142 H H - SEQCRD B Q GLN GLN 143 143 H H - SEQCRD B E GLU GLU 144 144 H H - SEQCRD B M MSE MSE 145 145 H H - SEQCRD B I ILE ILE 146 146 H H - SEQCRD B D ASP ASP 147 147 H H - SEQCRD B E GLU GLU 148 148 H H - SEQCRD B A ALA ALA 149 149 H H - SEQCRD B D ASP ASP 150 150 H T 5 SEQCRD B R ARG ARG 151 151 C T 5 SEQCRD B D ASP ASP 152 152 C T 5 SEQCRD B G GLY GLY 153 153 C T 5 SEQCRD B D ASP ASP 154 154 C C - SEQCRD B G GLY GLY 155 155 C C - SEQCRD B E GLU GLU 156 156 C E 5 SEQCRD B V VAL VAL 157 157 C E 5 SEQCRD B S SER SER 158 158 H C 5 SEQCRD B E GLU GLU 159 159 H H - SEQCRD B Q GLN GLN 160 160 H H - SEQCRD B E GLU GLU 161 161 H H - SEQCRD B F PHE PHE 162 162 H H - SEQCRD B L LEU LEU 163 163 H H - SEQCRD B R ARG ARG 164 164 H H - SEQCRD B I ILE ILE 165 165 H H - SEQCRD B M MSE MSE 166 166 H H - SEQCRD B K LYS LYS 167 167 H H - SEQCRD B K LYS LYS 168 168 H C 5 SEQCRD B T THR --- 169 - - - 367 SEQCRD B S SER --- 170 - - - 367 SEQCRD B L LEU --- 171 - - - 367 SEQCRD B Y TYR --- 172 - - - 367 SEQCRD D N ASN ASN 1 847 H C 45 SEQCRD D W TRP TRP 2 848 H H 4 SEQCRD D K LYS LYS 3 849 H H 4 SEQCRD D L LEU LEU 4 850 H H 4 SEQCRD D L LEU LEU 5 851 H H 4 SEQCRD D A ALA ALA 6 852 H H 4 SEQCRD D K LYS LYS 7 853 H H 4 SEQCRD D G GLY GLY 8 854 H H 4 SEQCRD D L LEU LEU 9 855 H H 4 SEQCRD D L LEU LEU 10 856 H H 4 SEQCRD D I ILE ILE 11 857 H H 4 SEQCRD D R ARG ARG 12 858 H H 4 SEQCRD D E GLU GLU 13 859 H T 45 SEQCRD D R ARG ARG 14 860 H T 45 SEQCRD D L LEU LEU 15 861 H T 45 SEQCRD D K LYS LYS 16 862 H T 45 SEQCRD D R ARG ARG 17 863 H C 45 SEQCRD C N ASN ASN 1 847 H C 45 SEQCRD C W TRP TRP 2 848 H H 4 SEQCRD C K LYS LYS 3 849 H H 4 SEQCRD C L LEU LEU 4 850 H H 4 SEQCRD C L LEU LEU 5 851 H H 4 SEQCRD C A ALA ALA 6 852 H H 4 SEQCRD C K LYS LYS 7 853 H H 4 SEQCRD C G GLY GLY 8 854 H H 4 SEQCRD C L LEU LEU 9 855 H H 4 SEQCRD C L LEU LEU 10 856 H H 4 SEQCRD C I ILE ILE 11 857 H H 4 SEQCRD C R ARG ARG 12 858 H H 4 SEQCRD C E GLU GLU 13 859 H H 4 SEQCRD C R ARG ARG 14 860 H H 4 SEQCRD C L LEU LEU 15 861 H H 4 SEQCRD C K LYS LYS 16 862 H H 4 SEQCRD C R ARG ARG 17 863 H C 45 COMMNT S2CERR 1 0 No standard amino acid code S2CERR 2 0 SEQRES and ATOM residue names differ S2CERR 3 51 No ATOM record S2CERR 4 34 SEQRES and ATOM residue numbers differ S2CERR 5 61 PDB and STRIDE secondary structures differ S2CERR 6 51 PDB secondary structure is absent S2CERR 7 51 STRIDE secondary structure is absent COMMNT COMMNT Crystallographic technical parameters: PARAME method 'X-RAY DIFFRACTION' PARAME resolution 2.35 PARAME R-factor 0.19439 PARAME B-factor 50.142 COMMNT COMMNT Reference database information: DATABA source: DATABA UNP: CETN2_HUMAN (P41208) DATABA UNP: XPC_HUMAN (Q01831) COMMNT DATABA mutation: DATABA MSE A 57 --> MET 57 'MODIFIED RESIDUE' DATABA MSE B 84 --> MET 84 'MODIFIED RESIDUE' DATABA MSE B 166 --> MET 166 'MODIFIED RESIDUE' DATABA MSE A 97 --> MET 97 'MODIFIED RESIDUE' DATABA MSE A 145 --> MET 145 'MODIFIED RESIDUE' DATABA MSE B 57 --> MET 57 'MODIFIED RESIDUE' DATABA MSE B 72 --> MET 72 'MODIFIED RESIDUE' DATABA MSE A 93 --> MET 93 'MODIFIED RESIDUE' DATABA MSE B 145 --> MET 145 'MODIFIED RESIDUE' DATABA MSE B 97 --> MET 97 'MODIFIED RESIDUE' DATABA MSE A 84 --> MET 84 'MODIFIED RESIDUE' DATABA MSE A 72 --> MET 72 'MODIFIED RESIDUE' DATABA MSE B 93 --> MET 93 'MODIFIED RESIDUE' DATABA MSE A 166 --> MET 166 'MODIFIED RESIDUE'