Complet list of 2crq hssp file
Complete list of 2crq.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 2CRQ
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-17
HEADER TRANSLATION 20-MAY-05 2CRQ
COMPND MOL_ID: 1; MOLECULE: MITOCHONDRIAL TRANSLATIONAL INITIATION FACTOR 3;
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: MUS MUSCULUS; ORGANISM_COMMON: HOUSE M
AUTHOR T.NAGASHIMA,F.HAYASHI,S.YOKOYAMA,RIKEN STRUCTURAL GENOMICS/PROTEOMICS
DBREF 2CRQ A 8 106 UNP Q9CZD5 Q9CZD5_MOUSE 148 246
SEQLENGTH 112
NCHAIN 1 chain(s) in 2CRQ data set
NALIGN 69
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : A4QMQ0_MOUSE 0.99 1.00 8 107 148 247 100 0 0 276 A4QMQ0 Mitochondrial translational initiation factor 3 OS=Mus musculus GN=Mtif3 PE=2 SV=1
2 : IF3M_MOUSE 2CRQ 0.99 1.00 8 107 148 247 100 0 0 276 Q9CZD5 Translation initiation factor IF-3, mitochondrial OS=Mus musculus GN=Mtif3 PE=1 SV=1
3 : B0BNE9_RAT 0.87 0.96 11 107 152 248 97 0 0 276 B0BNE9 LOC684274 protein OS=Rattus norvegicus GN=Mtif3 PE=2 SV=1
4 : G3GWI6_CRIGR 0.82 0.91 8 107 142 241 100 0 0 270 G3GWI6 Translation initiation factor IF-3, mitochondrial OS=Cricetulus griseus GN=I79_002111 PE=4 SV=1
5 : S7NB93_MYOBR 0.78 0.93 8 105 120 217 98 0 0 247 S7NB93 Translation initiation factor IF-3, mitochondrial OS=Myotis brandtii GN=D623_10026416 PE=4 SV=1
6 : L8YC18_TUPCH 0.77 0.92 12 107 59 154 96 0 0 181 L8YC18 Translation initiation factor IF-3, mitochondrial OS=Tupaia chinensis GN=TREES_T100016460 PE=4 SV=1
7 : G1NX84_MYOLU 0.75 0.90 5 105 149 250 102 1 1 280 G1NX84 Uncharacterized protein OS=Myotis lucifugus GN=MTIF3 PE=4 SV=1
8 : K9IZG3_DESRO 0.75 0.91 5 107 149 251 103 0 0 278 K9IZG3 Putative translation initiation factor if-3 mitochondrial OS=Desmodus rotundus PE=2 SV=1
9 : L5L651_PTEAL 0.75 0.91 11 105 15 109 95 0 0 139 L5L651 Translation initiation factor IF-3, mitochondrial OS=Pteropus alecto GN=PAL_GLEAN10006839 PE=4 SV=1
10 : D2HWR1_AILME 0.73 0.89 6 107 150 251 102 0 0 279 D2HWR1 Uncharacterized protein (Fragment) OS=Ailuropoda melanoleuca GN=LOC100470040 PE=4 SV=1
11 : F6X1C9_HORSE 0.73 0.89 2 107 146 251 106 0 0 272 F6X1C9 Uncharacterized protein OS=Equus caballus GN=MTIF3 PE=4 SV=1
12 : G9KBP3_MUSPF 0.73 0.91 6 107 150 251 102 0 0 278 G9KBP3 Mitochondrial translational initiation factor 3 (Fragment) OS=Mustela putorius furo PE=2 SV=1
13 : M3Y394_MUSPF 0.73 0.91 6 107 150 251 102 0 0 279 M3Y394 Uncharacterized protein OS=Mustela putorius furo GN=MTIF3 PE=4 SV=1
14 : M3W6V3_FELCA 0.72 0.88 4 107 147 249 104 1 1 277 M3W6V3 Uncharacterized protein OS=Felis catus GN=MTIF3 PE=4 SV=1
15 : G3QZB1_GORGO 0.70 0.87 5 107 148 250 103 0 0 278 G3QZB1 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101123946 PE=4 SV=1
16 : H2Q7C1_PANTR 0.70 0.87 5 107 148 250 103 0 0 278 H2Q7C1 Uncharacterized protein OS=Pan troglodytes GN=MTIF3 PE=4 SV=1
17 : G1QHM2_NOMLE 0.69 0.87 5 107 127 229 103 0 0 257 G1QHM2 Uncharacterized protein (Fragment) OS=Nomascus leucogenys GN=MTIF3 PE=4 SV=1
18 : G1T1U3_RABIT 0.69 0.89 2 107 172 277 106 0 0 305 G1T1U3 Uncharacterized protein OS=Oryctolagus cuniculus GN=MTIF3 PE=4 SV=2
19 : G3TF97_LOXAF 0.69 0.83 5 107 139 241 103 0 0 269 G3TF97 Uncharacterized protein OS=Loxodonta africana GN=MTIF3 PE=4 SV=1
20 : F1RSX7_PIG 0.68 0.84 3 112 146 256 111 1 1 278 F1RSX7 Uncharacterized protein OS=Sus scrofa GN=SSC.79080 PE=4 SV=2
21 : G5ATV2_HETGA 0.68 0.88 4 106 147 249 103 0 0 276 G5ATV2 Translation initiation factor IF-3, mitochondrial OS=Heterocephalus glaber GN=GW7_12525 PE=4 SV=1
22 : H2NJH3_PONAB 0.68 0.86 5 107 148 249 103 1 1 277 H2NJH3 Uncharacterized protein OS=Pongo abelii GN=MTIF3 PE=4 SV=1
23 : IF3M_BOVIN 0.68 0.87 3 107 141 245 105 0 0 273 Q32KZ1 Translation initiation factor IF-3, mitochondrial OS=Bos taurus GN=MTIF3 PE=2 SV=1
24 : IF3M_HUMAN 0.68 0.87 4 107 147 250 104 0 0 278 Q9H2K0 Translation initiation factor IF-3, mitochondrial OS=Homo sapiens GN=MTIF3 PE=1 SV=4
25 : L8I6F7_9CETA 0.68 0.87 3 107 141 245 105 0 0 273 L8I6F7 Translation initiation factor IF-3, mitochondrial OS=Bos mutus GN=M91_00437 PE=4 SV=1
26 : E2RDI7_CANFA 0.67 0.85 4 107 149 252 104 0 0 279 E2RDI7 Uncharacterized protein OS=Canis familiaris GN=MTIF3 PE=4 SV=1
27 : F7FGJ0_MACMU 0.67 0.86 5 107 148 250 103 0 0 278 F7FGJ0 Uncharacterized protein OS=Macaca mulatta GN=MTIF3 PE=4 SV=1
28 : G7NJU2_MACMU 0.67 0.86 5 107 148 250 103 0 0 278 G7NJU2 Translation initiation factor IF-3, mitochondrial OS=Macaca mulatta GN=EGK_09193 PE=4 SV=1
29 : G7PVW8_MACFA 0.67 0.86 5 107 148 250 103 0 0 278 G7PVW8 Translation initiation factor IF-3, mitochondrial OS=Macaca fascicularis GN=EGM_08356 PE=4 SV=1
30 : F7E1Z4_CALJA 0.66 0.88 4 107 147 250 104 0 0 280 F7E1Z4 Uncharacterized protein OS=Callithrix jacchus GN=MTIF3 PE=4 SV=1
31 : W5PSL1_SHEEP 0.65 0.87 3 107 137 241 105 0 0 269 W5PSL1 Uncharacterized protein OS=Ovis aries GN=MTIF3 PE=4 SV=1
32 : F6VVP7_CALJA 0.64 0.86 4 106 147 249 103 0 0 262 F6VVP7 Uncharacterized protein OS=Callithrix jacchus GN=MTIF3 PE=4 SV=1
33 : H0VG87_CAVPO 0.64 0.88 4 111 142 249 108 0 0 272 H0VG87 Uncharacterized protein OS=Cavia porcellus GN=MTIF3 PE=4 SV=1
34 : H0XBJ0_OTOGA 0.63 0.83 2 107 145 250 106 0 0 278 H0XBJ0 Uncharacterized protein OS=Otolemur garnettii GN=MTIF3 PE=4 SV=1
35 : F7G2D6_MONDO 0.62 0.84 2 104 145 246 103 1 1 279 F7G2D6 Uncharacterized protein OS=Monodelphis domestica GN=MTIF3 PE=4 SV=1
36 : I3MLU1_SPETR 0.61 0.81 2 110 145 253 109 0 0 278 I3MLU1 Uncharacterized protein OS=Spermophilus tridecemlineatus GN=MTIF3 PE=4 SV=1
37 : G3WX92_SARHA 0.57 0.80 2 104 144 245 103 1 1 278 G3WX92 Uncharacterized protein OS=Sarcophilus harrisii GN=MTIF3 PE=4 SV=1
38 : V9L3A3_CALMI 0.51 0.74 2 104 161 262 103 1 1 288 V9L3A3 Translation initiation factor IF-3, mitochondrial-like protein (Fragment) OS=Callorhynchus milii PE=2 SV=1
39 : V9L496_CALMI 0.51 0.74 2 104 161 262 103 1 1 298 V9L496 Translation initiation factor IF-3, mitochondrial-like protein OS=Callorhynchus milii PE=2 SV=1
40 : V9L8A3_CALMI 0.51 0.74 2 104 161 262 103 1 1 289 V9L8A3 Translation initiation factor IF-3, mitochondrial-like protein (Fragment) OS=Callorhynchus milii PE=2 SV=1
41 : F7D1L4_ORNAN 0.50 0.79 2 104 147 249 103 0 0 283 F7D1L4 Uncharacterized protein OS=Ornithorhynchus anatinus GN=MTIF3 PE=4 SV=1
42 : F7D1T9_ORNAN 0.50 0.79 2 104 147 249 103 0 0 278 F7D1T9 Uncharacterized protein OS=Ornithorhynchus anatinus GN=MTIF3 PE=4 SV=1
43 : F1P2M3_CHICK 0.49 0.75 1 108 100 202 108 3 5 227 F1P2M3 Uncharacterized protein (Fragment) OS=Gallus gallus GN=MTIF3 PE=4 SV=2
44 : H3B460_LATCH 0.48 0.74 2 104 157 258 103 1 1 293 H3B460 Uncharacterized protein OS=Latimeria chalumnae PE=4 SV=1
45 : G1NQD1_MELGA 0.46 0.75 1 93 145 236 93 1 1 277 G1NQD1 Uncharacterized protein OS=Meleagris gallopavo GN=MTIF3 PE=4 SV=1
46 : U3IEM8_ANAPL 0.46 0.76 1 92 145 232 92 2 4 273 U3IEM8 Uncharacterized protein OS=Anas platyrhynchos GN=MTIF3 PE=4 SV=1
47 : U3K1S6_FICAL 0.46 0.70 1 90 145 231 90 1 3 274 U3K1S6 Uncharacterized protein OS=Ficedula albicollis GN=MTIF3 PE=4 SV=1
48 : H9G780_ANOCA 0.45 0.69 2 92 153 242 91 1 1 282 H9G780 Uncharacterized protein OS=Anolis carolinensis GN=MTIF3 PE=4 SV=1
49 : K7FYB2_PELSI 0.45 0.71 2 92 152 241 91 1 1 282 K7FYB2 Uncharacterized protein OS=Pelodiscus sinensis GN=MTIF3 PE=4 SV=1
50 : T1DAF6_CROHD 0.43 0.65 4 104 146 244 101 2 2 275 T1DAF6 Translation initiation factor IF-3, mitochondrial-like protein OS=Crotalus horridus PE=2 SV=1
51 : W5M9G5_LEPOC 0.43 0.68 2 104 156 255 103 2 3 290 W5M9G5 Uncharacterized protein OS=Lepisosteus oculatus PE=4 SV=1
52 : F1QA65_DANRE 0.42 0.63 4 106 106 205 103 1 3 236 F1QA65 Uncharacterized protein (Fragment) OS=Danio rerio GN=mtif3 PE=4 SV=1
53 : G3NXG4_GASAC 0.42 0.65 4 103 139 236 100 1 2 236 G3NXG4 Uncharacterized protein OS=Gasterosteus aculeatus PE=4 SV=1
54 : A2BEV1_DANRE 0.41 0.61 3 108 149 251 106 1 3 280 A2BEV1 Uncharacterized protein OS=Danio rerio GN=mtif3 PE=4 SV=1
55 : H3BYR9_TETNG 0.41 0.60 2 111 67 174 112 2 6 199 H3BYR9 Uncharacterized protein (Fragment) OS=Tetraodon nigroviridis PE=4 SV=1
56 : H0ZMW6_TAEGU 0.40 0.62 1 110 76 181 110 2 4 205 H0ZMW6 Uncharacterized protein (Fragment) OS=Taeniopygia guttata GN=MTIF3 PE=4 SV=1
57 : M7BI40_CHEMY 0.40 0.73 2 92 109 198 91 1 1 240 M7BI40 Translation initiation factor IF-3 OS=Chelonia mydas GN=UY3_05216 PE=4 SV=1
58 : R0L8T8_ANAPL 0.40 0.69 1 111 75 180 111 3 5 199 R0L8T8 Translation initiation factor IF-3, mitochondrial (Fragment) OS=Anas platyrhynchos GN=Anapl_14258 PE=4 SV=1
59 : C3KI71_ANOFI 0.39 0.63 3 111 146 252 109 1 2 298 C3KI71 Translation initiation factor IF-3, mitochondrial OS=Anoplopoma fimbria GN=IF3M PE=2 SV=1
60 : F7EGE9_XENTR 0.39 0.65 2 111 132 239 110 1 2 260 F7EGE9 Uncharacterized protein OS=Xenopus tropicalis GN=mtif3 PE=4 SV=1
61 : I3JE24_ORENI 0.37 0.61 3 111 146 252 109 1 2 286 I3JE24 Uncharacterized protein OS=Oreochromis niloticus GN=LOC100711381 PE=4 SV=1
62 : Q0VFN0_XENTR 0.36 0.64 2 111 130 237 110 1 2 258 Q0VFN0 MGC145669 protein OS=Xenopus tropicalis GN=mtif3 PE=2 SV=1
63 : V8PC28_OPHHA 0.36 0.61 4 92 146 233 89 1 1 273 V8PC28 Translation initiation factor IF-3, mitochondrial (Fragment) OS=Ophiophagus hannah GN=Mtif3 PE=4 SV=1
64 : A8WH50_XENLA 0.35 0.63 2 111 118 225 110 1 2 246 A8WH50 LOC100127256 protein (Fragment) OS=Xenopus laevis GN=LOC100127256 PE=2 SV=1
65 : H2MTU7_ORYLA 0.34 0.57 2 111 143 250 110 1 2 264 H2MTU7 Uncharacterized protein OS=Oryzias latipes GN=LOC101156700 PE=4 SV=1
66 : W5UJV8_ICTPU 0.33 0.59 3 112 153 261 111 2 3 293 W5UJV8 Translation initiation factor IF-3, mitochondrial OS=Ictalurus punctatus GN=Mtif3 PE=2 SV=1
67 : H2UB21_TAKRU 0.32 0.59 2 112 84 192 111 1 2 212 H2UB21 Uncharacterized protein (Fragment) OS=Takifugu rubripes GN=LOC101077278 PE=4 SV=1
68 : S4R6Z0_PETMA 0.31 0.57 2 111 116 225 111 2 2 225 S4R6Z0 Uncharacterized protein (Fragment) OS=Petromyzon marinus PE=4 SV=1
69 : S4R6Z1_PETMA 0.30 0.58 2 108 76 183 109 3 3 211 S4R6Z1 Uncharacterized protein (Fragment) OS=Petromyzon marinus PE=4 SV=1
## ALIGNMENTS 1 - 69
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A G 0 0 135 7 24 A AAA A A
2 2 A S + 0 0 132 31 27 A E DEDEEEEEEEEEEEEE E EEEE Q E EQ EQQ
3 3 A S + 0 0 130 39 53 S M A A A A RKMIKKKKKKKKKKKK R RKKKKRKRK KRRKKQ
4 4 A G + 0 0 79 50 51 G G E GE EEEE GEGEEEEEQQQSSEQEKKQEEQEEEQKQKELEQEQEEQQD
5 5 A S + 0 0 116 60 14 KK K KKKKKKRKKRKRKKKKKRKKKKKKKKKKKKKKKKKKKKKKKKKKKKSKKKKKKKQK
6 6 A S - 0 0 131 63 52 AA AAAAAAAAAAAAAAAAAAAAAAAAAAAAEEEAAAVAASAAAAQQQSAAAQSKLALNQSDP
7 7 A G - 0 0 44 63 70 RT HRNNHNNSQEEKSANAHSSSNANKRQKQNNNKKSNSSSNHSQKKKKSSSKGKSSSKKKKQ
8 8 A P + 0 0 139 67 53 PP PP PP PPPPPPPPPPPPPPPPPPPPPPPPPSLSPPPSSSSLSPPSPPDADAPSSAHASPSTTAPS
9 9 A K - 0 0 176 67 34 KK KK KK KTKKKKKKKKKKKKKKKKKKKKKKKKKKKKKQQKKKKKSKKAKKKAKKKKAKGKSKKGQA
10 10 A T - 0 0 106 67 67 TT TT tA PPAAPTTTATpATPTPPTTTTPTSITPTSSSTTTNTTPNTAPKAKTPAAAIAHAHAaMSg
11 11 A G - 0 0 25 64 24 GGGGG gGGGGGGGGGGGGgGGGGGGGGGGGGGGGGETTTGG.A..DGGG.GAGGGG.AGVGGGPgSGg
12 12 A P - 0 0 90 69 44 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPRAAAPPVPVGGPPP.VPVPIPGPPPPLTPVPTQ
13 13 A T S S+ 0 0 146 70 58 TTTATTTTTATTTTTTTITTTTTTTVTTTTTTTTPAPTTTTTLTLLVVVVTVVVVTVLLVVVVVPVATT
14 14 A M - 0 0 85 70 70 MMVALLLLVLLLLLLLLLLLVVLLLLLLLLLLVLQVQQQQQQVQVVVQQQQQQQQFQVQQQQQQQQQQQ
15 15 A T E -a 45 0A 89 70 82 TTTTTMTTTTTTTTRRRTTTTRTRTTTTTTTTRMLRLMMMLLQIQQQQLQVSVSVSLQVIVIQIVVVII
16 16 A K E -a 46 0A 69 70 0 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK
17 17 A E E -a 47 0A 135 70 13 EEEEEEEEEEEEEEEEEEKEEEEEEEEEEEEEEEEEEEEEEEEAEEEEEIEEEEEEEEEEEEIEEDEEE
18 18 A L E -a 48 0A 27 70 17 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQLLLLLLLILILLLLLLLLLLLLILLLLII
19 19 A V E +a 49 0A 92 70 58 VVIIITITTSTSSTIITTTTTITITTTTTTTTTTITITTTTTSTSSSTTKTNTNTSTSTSSSKSTTTSS
20 20 A F E -a 50 0A 15 70 8 FFFFFFFFFFFFFFLLLFFFFLFLFLFFFFFFFFFFFFFFFFFFFFFLFMLFFFFFFFFFFFMFFLFFF
21 21 A S E > -a 51 0A 44 70 3 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTSS
22 22 A S T 3 S+ 0 0 12 70 43 SSSSSSSSASSSSSSSSSSSSSSSSSSSSSSSSSSSSAAASSSASSSTATSSSSHSASSAAATALSHAA
23 23 A N T 3 + 0 0 119 70 62 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNDDDSSADAAATAVDDGDGAAAGCGCVGGDGDD
24 24 A I < - 0 0 29 70 6 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIITIIIIIIIIIIIITIIIIII
25 25 A G > - 0 0 36 70 41 GGGGGGGGGGGGGGGGGGEGGGGGGGGGGGGGGGAEAAAATTAAAAAAAEASASAAAAGAAAEAAAAGG
26 26 A Q H > S+ 0 0 119 70 58 QQQQQQQRQQQQQRQQQQQQQQQQQQQQQQQQQQQQQKKKQQKKKKKQKQRLPLAKKKAKSKQKPPAKK
27 27 A H H > S+ 0 0 172 70 25 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHNHNHHHHHHHHHHNHNNNHHHRHHHHNHNHHHHHHHHHSS
28 28 A D H >> S+ 0 0 95 70 0 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
29 29 A L H 3X S+ 0 0 2 70 1 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLFLLLLLLLLLLLLFLLLLLL
30 30 A D H 3X S+ 0 0 92 70 42 DDDDDDDDDDDDDDDDDDDDNDDDDDDDDDNDEDNDDQQQHHEDEEEDESDDSDTEEESDTDSDSDTEE
31 31 A T H <>S+ 0 0 48 70 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
40 40 A I H ><5S+ 0 0 0 70 16 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIVLLLLIIILILIIILLLVV
41 41 A E H 3<5S+ 0 0 95 70 35 EEEEEEEEEGEEEEKKKEKEEKEKEEKKKKEKEEEEEEEEEEEQEEEDDDEEEEEEDEEEEEDEEEELL
42 42 A K T <<5S- 0 0 112 70 6 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKEE
43 43 A K T < 5 + 0 0 116 70 26 KKKKKKKKKKKQQKKKKKKKKKKKKKEEEQKQNKKKKKKKKKKRKKRKKKKNKNNRKKKKKKKKKKNKK
44 44 A Y < - 0 0 29 70 62 YYYYYYYYYYYYYYHHHYHYYHYHYYRRRHYHYFYYYHHHHHYHYYHHHYLNNNNHHYNHHHHHHHNrr
45 45 A H E -a 15 0A 92 70 55 HHHLKQKKKRKKKRLLLQQKQLKLKKQQQQKQQKQQKHHHQQHHHHHHHHHHHHHHHHHHHHHHHHHhh
46 46 A V E -aB 16 97A 0 69 1 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVIVV.VVVVVVVVIVV
47 47 A Q E -aB 17 96A 67 69 47 QQQQQQQQQQQQQQQQRQQQQQQQQQRRRRQRQQQQQRRRRRKRKKKRRKKKRKRKR.RRKRKRRRRRR
48 48 A V E -aB 18 95A 0 69 18 VVVVVIVIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIVIVVVVIIVLILVVI.LIIIIIMLVVV
49 49 A T E -aB 19 94A 22 70 27 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSMTTTTTTSVTSTSMSTTTSS
50 50 A I E -aB 20 93A 0 70 25 IIIIIIIIVIIIIIIIIIIIIIVIVIIIIIVIIIVIVVVVIIILIVIVVLVILILIVKLVLVLVVLLII
51 51 A K E -aB 21 92A 104 70 37 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRRRRRQQHRRRRrRQVRQRQRQRKRKK
52 52 A R - 0 0 66 44 27 RRKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK...KK..Q.........rQ.T...........
53 53 A R > - 0 0 181 56 63 RRRAAGAAGGGGGGGGGGGGGGGGGGGGGGGGGGDGGKKKGG.KG..QQQK...GAQV....Q....NN
54 54 A K T 3 S- 0 0 195 65 50 KKKKKKKKKKKKKKKKKKKRKKKKKKKKKKKKKKKKKGGGKK.GR..KKKG.S.QKRRSKAKKKGPATT
55 55 A D T 3 S- 0 0 142 70 69 DDDNNNNNNNNNNNNNNNNNNNSNSNNNNNSNNNTNTNNNFFQSDQQRNSSAGAPGNQGGVGTGGRKKK
56 56 A A < - 0 0 29 70 74 AAAAAVATATAPPVVVVAVAAVAVAVVVVVAVATAAIAAAVVASSAAAVIPGRGASVARSPSISRRRKK
57 57 A E S S+ 0 0 186 68 71 EEEEEDEEEEEEEEDDDDEEDDDDDEDDDDDDDD.D.LLLKKKVNKKDASATGTDSAKGPRPSSADGKK
58 58 A Q - 0 0 100 69 68 QQQQEEEEEEEEE.VVMEEEEVEVEEVVVEEEDEDEDHHHEEDNTDGDDDANGNNTDDRAEADAERQSS
59 59 A S >> - 0 0 81 70 63 SSPPPPPPPPPPPPSSSPPPPSPSPPSSSSPSPPPQPTTTEESSDSSEGGESSSLDGSPGSGGEPSPDD
60 60 A E H 3> S+ 0 0 76 70 65 EEGREEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEIIIEENIMNSPPPIEAEEMPNAGAGPGAEATT
61 61 A E H 3> S+ 0 0 149 70 68 EESNNENNKHNQQNNNNNANNNDNDHNNNNDNHNDNDEEEHHTDHTTKEEATVTTFETDNVNENVQDDD
62 62 A E H X> S+ 0 0 60 70 61 EEEEKKKKKKKKKKEEEKKKKEKERKKKKKKKKKNKNKKKNNDEQDDHKKEPNPAMKDNDNDKDPANKK
63 63 A T H 3X S+ 0 0 9 70 44 TTMMMTMTVMIMMMMMMTTMTMMMMMMMMGMGPMMMMMMMLLTMLMMTMMLLLLLLMMLKLKMKLLLMM
64 64 A E H 3X S+ 0 0 86 70 65 EEDDEEEEDEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEFEIFFLLLEDDDEFLFDLDLLLDDELL
65 65 A E H X S+ 0 0 0 69 44 FFFFFFFLFCFCCCFFFFFCFFCFCCFFFFCFFFFLFLLLFFFIFFFFFFLLLLV.FFLLLLFLLLLAA
68 68 A N H 3X S+ 0 0 66 69 53 NNNNNNNNNNNNNNHHHNNNHHNHNNHHHHNHDNNNNDDDDDDEDDDSDDETETQ.DDENENDNEDEGG
69 69 A Q H 3< S+ 0 0 68 70 47 QQQQQQQQQQHQQQQQQQQQEQRQRKQQQQRQHQKQKYYYEEQKQRQQQKQQQQQQQRQQQQKQQEESS
70 70 A I H XX S+ 0 0 0 70 13 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIVMIMVIIIIIIIIIMMVVV
71 71 A L H 3< S+ 0 0 29 70 36 LLLLLLLLLLLLLLLLLLLLLLVLVLLLLLVLLLLVLIIILLLVLLLLVLAVVVGLVLVAVALTVVVLL
72 72 A Q T 3< S+ 0 0 145 70 44 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQEKQKKKKQQEQEEEDEDQEQEMEAEQEEEDDEGHEE
73 73 A T T <4 S+ 0 0 60 70 58 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATTTTTSSSSAATHATTATTHKQKTTTTQNQNTAQRKPP
74 74 A M S >X S+ 0 0 0 70 25 MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMVMMMMMMMMMVVMMMVVMMMLIMIVVMVMMMMMVMIVII
75 75 A P T 34 S+ 0 0 71 69 54 PPPTPPPPPPPPPPPPPPPHPPSPSPPPPPSPPPPPPPPPPPSPSSSPPRASES.SPSESESRSDTGAA
76 76 A D T 34 S+ 0 0 69 69 59 DDGAGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGEGEDDDIIDDDDEEAEGVAV.EEDMEVEEEVAVDD
77 77 A I T <4 S+ 0 0 20 69 79 IIIIIIIITIILLIIIIIIIVIIIIIMMMIIIMILILKKKLLKKKKKKKKVPMP.KKKMHMHKHMPMNN
78 78 A A < - 0 0 1 69 28 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATTAAAAAAAAAVVV.AAAVAVAAAVLVAA
79 79 A T E -C 98 0A 53 70 22 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAGAGTTTGTGTTTGGGTT
80 80 A F E +C 97 0A 33 70 12 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFYYYFFYFYYYYYYFFFFFYYYFCFCYCFFFYY
81 81 A S E S+ 0 0A 36 70 76 SSLSSSSSSSSSSSSSSKSSLSSSSLLLLSSSLSTLAVVVSSLVLLLLLHIVVVVLLLVVVVHVVVVVV
82 82 A S E S-C 96 0A 58 69 15 SSSSSSSSSSSSSTSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS.SSSVSVSVSNSSS
83 83 A R E - 0 0A 189 70 48 RRRRRRRRRRRRRRRRRKKRRKRRRRRRRRRRRKKKRRRRKKKKKKKEQMKNKNKSQKKKRKMKKKKAA
84 84 A P E - 0 0A 30 70 3 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPKPPPPPPPPPPPPP
85 85 A K E -C 94 0A 99 70 56 KKKKQQQQQQQQQQQQQQHQQQQQQQQQQQQQQQKQKKKKRRKKKKKRKSKTKTKPRKKKKKSKKKKRR
86 86 A A E -C 93 0A 58 69 65 AAAAPAPPAASPPPAAAAALAAPAPPAAAAPAPATPIAAAVVASAAVIV.ALALAKVATLVLILVLVAA
87 87 A I E > > +C 92 0A 35 70 28 IIVIVVVIVVVVVVVVVVIIVVIVIVVVVVIVVTTVTVVVIIILIIAAIIIIIIIVVIIVIVAIIIVVV
88 88 A R G > 5S- 0 0 210 69 46 RRRRKKKRKRKRRRQQQKRRK.RQRRQQQHRHKKRKRRRRKKKKKKRKRAKRRRRAKKRKRKAKKRRNN
89 89 A G G 3 5S- 0 0 84 70 45 GGGGGGGGGGGGGGGGGGGGGRGGGGGGGGGGGGGGGEEEDDEEEEEEEADGDGDREEDEDEEEDEDDD
90 90 A G G < 5S+ 0 0 16 70 4 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGEGGGGGEGGGGGGGGGGGGG
91 91 A T T < 5S+ 0 0 68 69 58 TTTTKKKKKKKKKKKKKKRKRKKKKKKKKKKKRKKKKKKKRRTRTT RRGKRQRQGKTHRQRRRRRQAA
92 92 A A E