Complet list of 4trx hssp file
Complete list of 4trx.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 4TRX
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-02-12
HEADER ELECTRON TRANSPORT 1992-01-15 4TRX
COMPND THIOREDOXIN
SOURCE Homo sapiens
AUTHOR Forman-Kay, J.D.; Clore, G.M.; Gronenborn, A.M.
SEQLENGTH 105
NCHAIN 1 chain(s) in 4TRX data set
NALIGN 69
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : G3QVK1_GORGO 0.99 0.99 1 105 1 105 105 0 0 105 G3QVK1 Thioredoxin OS=Gorilla gorilla gorilla GN=101141924 PE=3 SV=1
2 : H2QXP0_PANTR 0.99 0.99 1 105 1 105 105 0 0 105 H2QXP0 Thioredoxin OS=Pan troglodytes GN=TXN PE=3 SV=1
3 : H9ZYJ2_HUMAN 0.99 0.99 1 105 1 105 105 0 0 105 H9ZYJ2 Thioredoxin OS=Homo sapiens GN=TXN PE=2 SV=1
4 : THIO_HUMAN 3TRX 0.99 0.99 1 105 1 105 105 0 0 105 P10599 Thioredoxin OS=Homo sapiens GN=TXN PE=1 SV=3
5 : THIO_MACMU 0.96 0.98 1 105 1 105 105 0 0 105 P29451 Thioredoxin OS=Macaca mulatta GN=TXN PE=3 SV=2
6 : F6RHG3_MACMU 0.95 0.99 1 105 1 105 105 0 0 105 F6RHG3 Thioredoxin OS=Macaca mulatta GN=LOC100426263 PE=3 SV=1
7 : H0X010_OTOGA 0.95 0.98 1 105 1 105 105 0 0 105 H0X010 Thioredoxin OS=Otolemur garnettii GN=TXN PE=3 SV=1
8 : U3CTN3_CALJA 0.95 0.98 1 105 1 105 105 0 0 105 U3CTN3 Thioredoxin OS=Callithrix jacchus GN=TXN PE=3 SV=1
9 : D2H4F0_AILME 0.94 0.99 10 97 1 88 88 0 0 88 D2H4F0 Putative uncharacterized protein (Fragment) OS=Ailuropoda melanoleuca GN=PANDA_004661 PE=4 SV=1
10 : G1MI46_AILME 0.93 0.98 1 105 1 105 105 0 0 105 G1MI46 Thioredoxin OS=Ailuropoda melanoleuca GN=TXN PE=3 SV=1
11 : G7P3Z5_MACFA 0.93 0.99 1 102 1 102 102 0 0 102 G7P3Z5 Thioredoxin OS=Macaca fascicularis GN=EGM_13623 PE=3 SV=1
12 : G8JKZ8_BOVIN 0.93 0.99 10 105 2 97 96 0 0 97 G8JKZ8 Thioredoxin (Fragment) OS=Bos taurus GN=TXN PE=2 SV=1
13 : H6TBN0_PIG 0.93 0.98 1 105 1 105 105 0 0 105 H6TBN0 Thioredoxin OS=Sus scrofa GN=TRX1 PE=2 SV=1
14 : I3MZF3_SPETR 0.93 0.99 10 105 11 106 96 0 0 106 I3MZF3 Thioredoxin (Fragment) OS=Spermophilus tridecemlineatus PE=3 SV=1
15 : F7G6L6_CALJA 0.92 0.96 3 105 2 104 103 0 0 104 F7G6L6 Thioredoxin (Fragment) OS=Callithrix jacchus GN=LOC100896650 PE=3 SV=1
16 : G5C094_HETGA 0.92 0.97 10 105 1 96 96 0 0 96 G5C094 Thioredoxin (Fragment) OS=Heterocephalus glaber GN=GW7_17004 PE=4 SV=1
17 : I3N4D9_SPETR 0.92 0.98 1 105 1 105 105 0 0 105 I3N4D9 Thioredoxin OS=Spermophilus tridecemlineatus PE=3 SV=1
18 : J9NWJ5_CANFA 0.92 0.96 5 105 1 101 101 0 0 101 J9NWJ5 Thioredoxin (Fragment) OS=Canis familiaris GN=TXN PE=3 SV=1
19 : K9IG40_DESRO 0.91 0.98 1 105 1 105 105 0 0 105 K9IG40 Thioredoxin OS=Desmodus rotundus PE=3 SV=1
20 : THIO_SHEEP 0.91 0.98 1 105 1 105 105 0 0 105 P50413 Thioredoxin OS=Ovis aries GN=TXN PE=1 SV=2
21 : F7ALV0_HORSE 0.90 0.98 10 105 2 97 96 0 0 97 F7ALV0 Thioredoxin (Fragment) OS=Equus caballus GN=TXN PE=4 SV=1
22 : G3TA82_LOXAF 0.90 0.95 7 104 1 98 98 0 0 98 G3TA82 Uncharacterized protein (Fragment) OS=Loxodonta africana GN=LOC100668902 PE=4 SV=1
23 : H0WA07_CAVPO 0.90 0.97 1 105 1 105 105 0 0 105 H0WA07 Thioredoxin OS=Cavia porcellus GN=LOC100729881 PE=3 SV=1
24 : THIO_HORSE 0.90 0.98 1 105 1 105 105 0 0 105 O97508 Thioredoxin OS=Equus caballus GN=TXN PE=3 SV=3
25 : G3S9Y6_GORGO 0.89 0.93 1 105 1 104 105 1 1 104 G3S9Y6 Thioredoxin OS=Gorilla gorilla gorilla PE=3 SV=1
26 : THIO_MOUSE 0.88 0.94 1 103 1 103 103 0 0 105 P10639 Thioredoxin OS=Mus musculus GN=Txn PE=1 SV=3
27 : THIO_RABIT 0.88 0.98 1 105 1 105 105 0 0 105 P08628 Thioredoxin OS=Oryctolagus cuniculus GN=TXN PE=1 SV=2
28 : G7P547_MACFA 0.87 0.94 1 105 1 109 109 1 4 109 G7P547 Thioredoxin OS=Macaca fascicularis GN=EGM_14384 PE=3 SV=1
29 : H2PDP6_PONAB 0.86 0.92 1 105 1 104 105 1 1 104 H2PDP6 Thioredoxin OS=Pongo abelii PE=3 SV=1
30 : F6YVA9_CALJA 0.82 0.92 1 105 1 105 105 0 0 105 F6YVA9 Thioredoxin OS=Callithrix jacchus GN=LOC100415403 PE=3 SV=1
31 : G3VX55_SARHA 0.82 0.93 1 104 1 104 104 0 0 105 G3VX55 Thioredoxin OS=Sarcophilus harrisii PE=3 SV=1
32 : F7A8L2_MACMU 0.81 0.87 1 105 1 100 105 1 5 100 F7A8L2 Thioredoxin OS=Macaca mulatta PE=3 SV=1
33 : F7H4M5_MACMU 0.77 0.80 1 105 1 85 105 1 20 85 F7H4M5 Uncharacterized protein OS=Macaca mulatta GN=LOC100426263 PE=4 SV=1
34 : T1E3X8_CROHD 0.72 0.86 1 102 1 102 102 0 0 105 T1E3X8 Thioredoxin OS=Crotalus horridus PE=3 SV=1
35 : D2GW32_AILME 0.71 0.79 1 105 1 104 105 1 1 104 D2GW32 Putative uncharacterized protein (Fragment) OS=Ailuropoda melanoleuca GN=PANDA_000989 PE=4 SV=1
36 : D2HQ54_AILME 0.70 0.77 1 104 1 125 125 2 21 125 D2HQ54 Thioredoxin (Fragment) OS=Ailuropoda melanoleuca GN=PANDA_013993 PE=3 SV=1
37 : U3FYX3_MICFL 0.67 0.83 1 105 1 105 105 0 0 105 U3FYX3 Thioredoxin OS=Micrurus fulvius PE=3 SV=1
38 : B6VFL5_BOMOR 0.65 0.85 1 104 1 104 104 0 0 105 B6VFL5 Thioredoxin OS=Bombina orientalis PE=3 SV=1
39 : B9EPV5_SALSA 0.62 0.78 1 97 1 99 99 1 2 99 B9EPV5 Thioredoxin OS=Salmo salar GN=THIO PE=4 SV=1
40 : C1BFF6_ONCMY 0.62 0.76 1 104 1 106 106 1 2 108 C1BFF6 Thioredoxin OS=Oncorhynchus mykiss GN=THIO PE=3 SV=1
41 : B2RYZ1_XENTR 0.61 0.80 1 104 1 104 104 0 0 105 B2RYZ1 Thioredoxin OS=Xenopus tropicalis GN=LOC100170420 PE=3 SV=1
42 : B5X6M7_SALSA 0.61 0.76 1 104 1 106 106 1 2 108 B5X6M7 Thioredoxin OS=Salmo salar GN=THIO PE=3 SV=1
43 : B9ELU3_SALSA 0.61 0.76 1 103 1 105 105 1 2 109 B9ELU3 Thioredoxin OS=Salmo salar GN=THIO PE=3 SV=1
44 : H0W6G8_CAVPO 0.61 0.78 1 104 12 115 104 0 0 116 H0W6G8 Thioredoxin (Fragment) OS=Cavia porcellus GN=LOC100735435 PE=3 SV=1
45 : B9ENS2_SALSA 0.60 0.76 1 104 1 106 106 1 2 108 B9ENS2 Thioredoxin OS=Salmo salar GN=THIO PE=3 SV=1
46 : C1BX46_ESOLU 0.60 0.76 1 104 1 106 106 1 2 108 C1BX46 Thioredoxin OS=Esox lucius GN=THIO PE=3 SV=1
47 : Q6GQ64_XENLA 0.60 0.77 1 104 1 104 104 0 0 105 Q6GQ64 Thioredoxin OS=Xenopus laevis GN=MGC80314 PE=3 SV=1
48 : B5XE86_SALSA 0.58 0.76 1 103 1 105 105 1 2 105 B5XE86 Thioredoxin OS=Salmo salar GN=THIO PE=3 SV=2
49 : B5XF92_SALSA 0.58 0.76 1 104 1 106 106 1 2 108 B5XF92 Thioredoxin OS=Salmo salar GN=THIO PE=3 SV=1
50 : C1BH28_ONCMY 0.58 0.76 1 104 1 106 106 1 2 108 C1BH28 Thioredoxin OS=Oncorhynchus mykiss GN=THIO PE=3 SV=1
51 : C1BH85_ONCMY 0.58 0.76 1 104 1 106 106 1 2 108 C1BH85 Thioredoxin OS=Oncorhynchus mykiss GN=THIO PE=3 SV=1
52 : G1FKJ7_9PERO 0.56 0.73 1 104 1 106 106 1 2 108 G1FKJ7 Thioredoxin (Fragment) OS=Epinephelus bruneus PE=2 SV=1
53 : Q7ZUI4_DANRE 0.56 0.79 1 103 1 105 105 1 2 108 Q7ZUI4 Thioredoxin OS=Danio rerio GN=zgc:56493 PE=3 SV=1
54 : H2ZY23_LATCH 0.55 0.72 1 104 1 104 104 0 0 105 H2ZY23 Thioredoxin OS=Latimeria chalumnae PE=3 SV=1
55 : G3UT77_MELGA 0.54 0.69 2 105 2 129 128 2 24 129 G3UT77 Thioredoxin (Fragment) OS=Meleagris gallopavo GN=LOC100540930 PE=3 SV=1
56 : F6S2N6_CALJA 0.52 0.64 7 105 6 92 101 3 16 92 F6S2N6 Uncharacterized protein OS=Callithrix jacchus PE=4 SV=1
57 : A4S3L5_OSTLU 0.50 0.73 1 103 1 101 103 2 2 105 A4S3L5 Thioredoxin OS=Ostreococcus lucimarinus (strain CCE9901) GN=OSTLU_26095 PE=3 SV=1
58 : B9EP13_SALSA 0.47 0.63 1 104 1 95 105 2 11 97 B9EP13 Thioredoxin OS=Salmo salar GN=THIO PE=4 SV=1
59 : B9ELH6_SALSA 0.46 0.63 1 104 1 95 105 2 11 97 B9ELH6 Thioredoxin OS=Salmo salar GN=THIO PE=4 SV=1
60 : TXND8_RAT 0.44 0.64 1 105 1 127 127 1 22 127 Q69AB1 Thioredoxin domain-containing protein 8 OS=Rattus norvegicus GN=Txndc8 PE=2 SV=1
61 : U3KPP5_RABIT 0.43 0.60 1 105 1 127 127 1 22 146 U3KPP5 Uncharacterized protein OS=Oryctolagus cuniculus GN=TXNDC8 PE=4 SV=1
62 : G3SDY1_GORGO 0.42 0.59 1 105 1 127 127 1 22 127 G3SDY1 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101143145 PE=4 SV=1
63 : F7I2U3_CALJA 0.41 0.60 1 105 1 127 127 1 22 127 F7I2U3 Uncharacterized protein OS=Callithrix jacchus GN=LOC100895386 PE=4 SV=1
64 : G3U840_LOXAF 0.40 0.60 9 105 1 119 119 1 22 119 G3U840 Uncharacterized protein (Fragment) OS=Loxodonta africana PE=4 SV=1
65 : TXND8_HUMAN 0.40 0.58 1 105 1 127 127 1 22 127 Q6A555 Thioredoxin domain-containing protein 8 OS=Homo sapiens GN=TXNDC8 PE=1 SV=2
66 : Q14AP0_MOUSE 0.39 0.58 1 105 27 153 127 1 22 153 Q14AP0 Thioredoxin domain containing 8 OS=Mus musculus GN=Txndc8 PE=2 SV=1
67 : TXND8_MOUSE 0.39 0.58 1 105 1 127 127 1 22 127 Q69AB2 Thioredoxin domain-containing protein 8 OS=Mus musculus GN=Txndc8 PE=1 SV=1
68 : F6Y264_HORSE 0.36 0.51 8 105 1 121 121 2 23 121 F6Y264 Uncharacterized protein (Fragment) OS=Equus caballus GN=TXNDC8 PE=4 SV=1
69 : H0VJY6_CAVPO 0.36 0.52 1 105 1 127 127 1 22 127 H0VJY6 Uncharacterized protein OS=Cavia porcellus PE=4 SV=1
## ALIGNMENTS 1 - 69
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A M 0 0 120 58 2 MMMMMMMM MM M M MM MMMMMMMMMMMMMMMMMMMMMVMMMMMMMMMM MMMMMMM MMM M
2 2 A V - 0 0 26 59 11 VVVVVVVV VV V V VV VVVVVVVVVVVVVVVVIIVIIVIIVIIIIVIVV VIIVVVV VVV V
3 3 A K E -a 55 0A 152 60 72 KKKKKKKK KK K K K KK QKKKKKKKKKKKKKKRLIRLLKLIRLLIIRVRE KLLQQQQ QKK Q
4 4 A Q E -a 56 0A 88 60 78 QQQQQQQQ QQ Q Q Q QQ QQQLQQQQQQQIQQIHVIHVVVVVHEEEEQVRK QVVKVII IRR V
5 5 A I - 0 0 13 61 9 IIIIIIII II I I IIII IIIIIIIIIIIVMIIIIIVIIIIVIIIIIVIIS IIIIIII III I
6 6 A E - 0 0 140 61 52 EEEEEEED EE E D EKEE EEKEEKKEDKEGEKGEDDEDDTDEEEEEEEEDP ADDKKKK KKK K
7 7 A S S > S- 0 0 48 63 59 SSSSSSSS SS S S SSSS SSSSSSSSSSQSSSSDNDDSDDGDDNDDDDTDNRSSDDSDDD DNN D
8 8 A K H > S+ 0 0 139 64 59 KKKKKKKK KK K K KKKK KKKKKKKKKKKKLEKMLKKLKKKKKLKKKKLQMEKAKKMTMM TMMQT
9 9 A T H > S+ 0 0 112 65 75 TTTTAACD YA Y D EYEY EESVESAVVEFATYYTEDEDDDEDEEDDDDDDESSEDDRDNNDNSSDD
10 10 A A H > S+ 0 0 25 70 59 AAAAAAAAAAAAAAAAAAAAADAAAAAAAANAAEAAQEDAEDDDDGESSSSDGQEAEDDEEEEEEEEEE
11 11 A F H X S+ 0 0 29 70 2 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFLFFFFLLFF
12 12 A Q H X S+ 0 0 112 70 77 QQQQQQKQQQQQQQQQQQQQQQQQQQQQQQEQQRQQRKLLQLLNLLQFFFFNDDEQALLKKKKQKKKKR
13 13 A E H >X S+ 0 0 132 69 65 EEEEEEEEEEEEEEEQEEEEEAEEEEEEEESEETEEAQNSNNNENKLSSSSAKAAE.NNETTTTTEETA
14 14 A A H >X S+ 0 0 30 70 64 AAAAAAAAAAAAAAAAAAAAAAAAAAVAAAEAAEAAEAAAIAAAAAVAAAAIAAEGAAALFFFFFLLFF
15 15 A L H 3< S+ 0 0 17 70 5 LLLLLLLLLLLLLLMLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLVILLLLLLLLFFLL
16 16 A D H << S+ 0 0 120 70 69 DDDDDNDDNNNNNNDANNNNNNNNDADDDEKENSSNSVKTKKKKKEKKKKKKAKKAKKKGKTTKTSSKK
17 17 A A H << S+ 0 0 91 70 64 AAAADTAAGGTSSSAASSSSSSDSTASTTAGVTDGSNDEEEEEEENEEEEEEGESAAEEAAAATADDAA
18 18 A A S >< S- 0 0 8 70 4 AAAAAAAAAAAAAAAAAAAAAAAAAAATAAAAAAVAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
19 19 A G T 3 - 0 0 41 69 3 GGGGGGGGGGGGGGGGGGGGGGGGGGGG.SGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
20 20 A D T 3 S+ 0 0 87 70 49 DDDDDDDDDDDEEDDDDDDEEDDENDDNGEEDDDDDSDDDDDDEDDGDDDDDDSEDKDDNYHHCHNNCH
21 21 A K S < S- 0 0 97 70 10 KKKKKKKKKKKKKKKKKKKKKKKKKKKKNKKKKKKKKKKKKKKRKKKKKKKKKKKRPKKRKKKKKKKKK
22 22 A L - 0 0 12 70 11 LLLLLLLLLLLLLLLLLLLLLLLLLLLLKLLPLLLPLLLLLLLLLLLLLLLLLLLLVLLLLLLLLLLLL
23 23 A V E -bC 53 80A 3 69 22 VVVVVVVVVVVVVVVVVVVVVVVV.VVVLVVGVIAVIVVVVVVVVVVVVVVVVVVVVVVVVAAVAVVAI
24 24 A V E -bC 54 79A 0 70 0 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVV
25 25 A V E -bC 55 78A 0 70 10 VVVVVVVVVVVVVVVVVVVVVVVVVVVdVVVVVVVVVVVVVVVVVVVVVVVVVVVADVVVVVVVVVVVV
26 26 A D E -bC 56 77A 10 69 19 DDDDDDDDDDDDDDDDDDDDDDDDDDDdDNDDDDDDDDHDDDDDDDDDDGDDDDDD.DDEEQEEQEEEE
27 27 A F E +bC 57 76A 0 70 0 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF
28 28 A S E -b 58 0A 3 70 42 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTTTTTTSTTTTTTTTTFSSTTTSSSSSSSSSS
29 29 A A - 0 0 2 70 13 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAVAAAAAAAAAAAAAAAAAAAAAAAAAALSASAAAA
30 30 A T S S+ 0 0 47 70 46 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSSSSSTETTSTTQKKTKKKKKK
31 31 A W S S+ 0 0 170 70 4 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWHWWWWWWWWWWWWWWWWWWWWRWWWWWRWWRWWWW
32 32 A C S > S- 0 0 19 70 4 CCCCCCCCCCCCCCYCCCCCCCCCCCCCCYCCCCPCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
33 33 A G H > S+ 0 0 45 69 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG.GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
34 34 A P H 4 S+ 0 0 100 70 8 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPGPPPPPPPPPPPPPPPPPPPPPPPSPPPPPPPPAP
35 35 A C H >4 S+ 0 0 14 70 2 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCVCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
36 36 A K H >< S+ 0 0 94 70 7 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKPKKKKKKKKRKKKKKKKKQKKKKKKKKKKKKKKKK
37 37 A M T 3< S+ 0 0 139 70 77 MMMMMMMMMMMMMMMMMMMMMMMMMMMMMQMTMMVMMMNNMNNANNMNNNNQSMMMSNNMRRRRRTTRL
38 38 A I T <> S+ 0 0 18 70 17 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIGTIIIIIIIIIIIIIIIIIIIIIIIIMMMMMIIII
39 39 A K H <>> + 0 0 91 70 81 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKEKKAAASAARAAAAAAAGAAKKAAAASFCCFAACG
40 40 A P H 45S+ 0 0 84 70 3 PPPPPPPPPPPPPPPPPPPPPPPPSPPPSPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
41 41 A F H 45S+ 0 0 98 70 41 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFSFFFFFFFFFFVFFLFFVFFFFHFKFFFFFALVILVVVLV
42 42 A F H <5S+ 0 0 2 70 7 FFFFFFYFFFFFFFFFFFFFFFFFFFFFFSFFFFFFFFFFFFFFFFFFFFFFYIFFFFFFFFFIFFFFF
43 43 A H T ><5S+ 0 0 71 70 52 HHHHHHHHHHHHHHNHHHHHHHHHHHHHHHHHHHHHHKKKEKKHKKEKKKKEKEHHEKKQHHHQHQQHH
44 44 A S T >> + 0 0 22 69 15 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL.MLLMLLLLLLLLLLLLLLLLLLLLLLMLMMMMMMSS
46 46 A S H <4 S+ 0 0 27 69 25 SSSSSSSSSSSSSSSSSSSSSSSSSCSSSSVS.VSSVSSSSSSSSSSSSSSSSSCAASSSSSSSSSSII
47 47 A E H <4 S+ 0 0 166 69 58 EEEEEEEEEEEEEEEEEEEEEEEEEDEEEEEE.EEQEEEEVEELEEVEEEENEKDEAEELQVVLVLLTT
48 48 A K H < S+ 0 0 126 69 26 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK.KKKKKKKEKKKKKDKKKKKNTKKKKKQQKKQKKKSG
49 49 A Y < + 0 0 67 69 77 YYYYYYYYYYYYYYYYYYYYYYYYYYFYYNYY.YSYYYPPNPPHPPNPPPPAPYFNYPPYYYYYYYYIF
50 50 A S S S+ 0 0 87 69 80 SSSSSSSSSSSSSSSSSSTSSGSSSSNSSSPS.PSSPPEEPEEKEEPEEEEEDTGPPEERQQQQQQQYY
51 51 A N S S+ 0 0 99 69 37 NNNNNNNNNNNNNNSNNNSNNNKNNNNNNHDN.DNnDDnnDnnDnnDnnnnnyNDSENNNNNNNNNNfL
52 52 A V S S- 0 0 12 66 13 VVVVVVVVVVMVVVVVVVVVVVVVMVVVVMVM.VVvVVvvVvvVvvAvvvvvvVV.V..VVVVVVVVvT
53 53 A I E - b 0 23A 37 66 42 IIIIVVVVVVVVVLGLLVLVVVVVVVVVVVII.IVVVVVVVVVTVVVVVVVIVVV.E..MMFFMFTTLV
54 54 A F E - b 0 24A 1 66 12 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF.FFSFFFFFFFFFFFFFFFFFFF.F..FFFFFFFFEI
55 55 A L E -ab 3 25A 4 66 45 LLLLLLLLLLLLLLLLLLLLLLLLLLILLFIL.ILLILLLILLLLLLLLLLLLLI.V..AAVAAAAALT
56 56 A E E -ab 4 26A 21 66 58 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE.EQKEKKKKKKEKKKKKKKKKKE.K..QNNNNNQQHS
57 57 A V E - b 0 27A 0 66 12 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVV.IVVIVVVVVVVVVVVVVVVVVI.I..VVVVVVVVAY
58 58 A D E > - b 0 28A 10 66 11 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDHDD.DDDDDDDDDDDDDDDDDDDDDD.D..DDDDDDDDLH
59 59 A V T 4 S+ 0 0 28 67 14 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVV.VVVVVVVVVVAVVVVVVVVVVVIV..VIVVVVVVSF
60 60 A D T 4 S+ 0 0 87 67 28 DDDDDDDDDDDDDDDDDDDDDDDDDDDGDDDG.DDDDDDDDDDDDDDDDDDDDDDVD..DDNNDNDDFF
61 61 A D T 4 S+ 0 0 114 67 43 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD.DDDEDDDDDDDDDDEEEEDDEDVE..SDNNKNSSLL
62 62 A C >X + 0 0 1 69 61 CCCCCCCCCCCCCCCCCCCCCCCCYCCCCCCC.ASCAAAAAAACAAAAAAAAAAARLSSSSSSSSSSSS
63 63 A Q H 3> + 0 0 109 69 54 QQQQQQQQQQQQQQQQQQQQQQQQQQKQQQQQ.QQQQQAAQAAEAAQAAAAQQEQKENNQPPPRPKKQQ
64 64 A D H 34 S+ 0 0 71 70 25 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDEDDDDDDDEDDDADVVEEEEEEEEEE
65 65 A V H <> S+ 0 0 5 70 24 VVVVVVVVVVVVVVVIVVVVVVVVVVIVVLVVVVVVVVVVVVVLVVVVVVVVVVVDVVVLLLLLLLLLL
66 66 A A H >X>S+ 0 0 7 70 20 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATAAAAAAAVAAAAAAAAAAAVAFFTAAAAAAAAA
67 67 A S H 3<5S+ 0 0 89 70 79 SSSSSSSSAASASASAASSAASAALAASLSASSSSSSAQQAQQIQQAKKKKAQQTDALLEKEEQEEEQE
68 68 A E H 345S+ 0 0 132 70 66 EEEEEEEEEEEEEDEEDEEEEEEEEDEEEEEEEHEEHSHHHHHEHHHHHHHESHHDEKKHTTTTTHHTF
69 69 A C H <<5S- 0 0 23 70 21 CCCCCCCCCCCCCCCCCCCCCCCCYCCCWCCYCCYCCCCCCCCCCCCCCCCWCYCCCVVCCCCCCCCYC
70 70 A E T <5 - 0 0 161 70 37 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEDDEEDEEEEEDDDDDEEEDQGDDSHHHNHDDHN
71 71 A V < - 0 0 35 70 24 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVAVVVVSVVIIIVIIIIIVIIIIIIIVDIVVIVIIIIIIIV
72 72 A K + 0 0 188 70 35 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKVKKKKKKKRKTKVSDDQKKRKKTTKQ
73 73 A C S S- 0 0 70 70 54 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCYCCCCCCCCCCCCCCCCACACCADDVATRITMMAA
74 74 A T S S+ 0 0 67 70 35 MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMTMMRMMMMMMMMLMMMMMMMMMMMFMAAVIIIIILLII
75 75 A P S S+ 0 0 13 70 8 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPpPPPPPPPPPPPPPPPPPPPRPaaPPPPPPPPPP
76 76 A T E -CD 27 90A 4 69 8 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTsTTTTTTTTTTTTTTTTTTT.TvvTTTTTTTTTT
77 77 A F E -CD 26 89A 1 68 10 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF.FFFFFFFFFFFFFFFFFFFFFFF.FAAFFFFFFFFFF
78 78 A Q E -CD 25 88A 0 68 32 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ.QQQQQHHHHHHHHHQHHHHIHIQ.QHHQQQQQQQQQQ
79 79 A F E -CD 24 87A 1 68 29 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF.FFFFFFFFFFFFFFFFFFFFFFF.VFFMMMMMMMMMM
80 80 A F E +CD 23 85A 18 68 5 FFFFFFFFFFFFFFFFFFFFFYFFFYFFFFY.FYFFYYYYYYYYYYYYYYYLYFY.YYYFFFFFFFFFF
81 81 A K E > S- D 0 84A 67 68 5 KKKKKKKKKKKKKKKKKKKKKKKKKKKKNKK.KKKKKKKKKKKKKKKKKKKKKKK.SKKKKKKKKKKKK
82 82 A K T 3 S- 0 0 146 69 58 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKNKKNNNNNNNKNQNNNNNNNDN.NNNHKKKQKYYQQ
83 83 A G T 3 S+ 0 0 38 70 37 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGNGGGGGGEGGGEEEEGGGgGGGGssssasttat
84 84 A Q E < S-D 81 0A 130 70 62 QQQQQQQQQQQQQQQEQQKQQVKQQQQQQQKEQEQQEEEQQEEEEKIKKKKEKKkMVEEknnnnnkkkk
85 85 A K E +D 80 0A 108 70 39 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRKKKKKKKKKTKKKKKLLPQLMMPL
86 86 A V E + 0 0A 78 70 20 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVMVVVVVVVVVVVVVVVVLLYGVVVIIIIIIIIII
87 87 A G E +D 79 0A 16 70 87 GGGGGGGGGGGGGGGGGGGSDGSDGGGGGGGGGHSGHHDDHDDGDHDDDDDMDEggSDDFFFFFFFFFY
88 88 A E E +D 78 0A 86 70 20 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDDDEDDEDDEDDDDEDVepEDDEEEEEEEEEE
89 89 A F E -D 77 0A 40 70 20 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFLFFFFFFFVFFFVMFFFFFFMFCCFF
90 90 A S E +D 76 0A 59 70 40 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSCSSSSSSSVSCSFTSSQCCCCCHHCC
91 91 A G S S- 0 0 31 70 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
92 92 A A S S+ 0 0 68 70 13 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATAAAAAAAAAAASSSSASAAAAAAAAAAAAAAAA
93 93 A N + 0 0 72 70 25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNQNNNNNNNDNNNNDNNDDDDDDDDDD
94 94 A K S >> S+ 0 0 70 70 68 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKQQQQQQHQQQEEEERQAKKKQQIAAAAAAAAP
95 95 A E H 3> S+ 0 0 154 70 61 EEEEEEEEEEEEEEEEEEEEEEEEEEEEKEDEEEEGENAAAAAEADSAAAAVTDEEDAAEKKKKKKKEK
96 96 A K H 3> S+ 0 0 109 70 38 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKNKKKKKTTTSTTKTTSKKKKKKKKKKTTKKKIKKQQKK
97 97 A L H <> S+ 0 0 4 70 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLFLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
98 98 A E H >X S+ 0 0 104 68 26 EEEEEEEE EDEEEEEEEEEEEEEEEEEEEEEEEEEEE EIEEKEAIEEEEGEVEESEEEAEEEEEEEE
99 99 A A H 3X S+ 0 0 60 68 54 AAAAAAAA AAAAAAAAAAAAAAAAAAAAASAAEAAEQ EQEEAEEQEEEEKEQEAAEEETAAAAKKAA
100 100 A T H >X S+ 0 0 19 68 65 TTTTTTTT TTTTTTTTTTTTTTTTSTTTSTTTATTAK KKKKIKKKKKKKNMKTTLKKKKKKKKKKKK
101 101 A I H