Complet list of 2vik hssp file
Complete list of 2vik.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 2VIK
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-26
HEADER ACTIN-BINDING PROTEIN 16-JAN-97 2VIK
COMPND MOL_ID: 1; MOLECULE: VILLIN 14T; CHAIN: A; FRAGMENT: RESIDUES 1 - 126;
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: GALLUS GALLUS; ORGANISM_COMMON: CHICKE
AUTHOR M.A.MARKUS,P.MATSUDAIRA,G.WAGNER
DBREF 2VIK A 1 126 UNP P02640 VILI_CHICK 2 127
SEQLENGTH 126
NCHAIN 1 chain(s) in 2VIK data set
NALIGN 81
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : VILI_CHICK 2LLF 1.00 1.00 1 126 2 127 126 0 0 826 P02640 Villin-1 OS=Gallus gallus GN=VIL1 PE=1 SV=2
2 : E1BU00_CHICK 0.98 0.98 1 126 2 126 126 1 1 825 E1BU00 Villin-1 OS=Gallus gallus GN=VIL1 PE=4 SV=2
3 : G1NG38_MELGA 0.98 0.99 1 126 2 127 126 0 0 827 G1NG38 Uncharacterized protein OS=Meleagris gallopavo GN=VIL1 PE=4 SV=2
4 : U3JN84_FICAL 0.89 0.96 3 126 1 124 124 0 0 860 U3JN84 Uncharacterized protein (Fragment) OS=Ficedula albicollis GN=VIL1 PE=4 SV=1
5 : H0Z4U6_TAEGU 0.88 0.95 1 126 2 127 126 0 0 827 H0Z4U6 Uncharacterized protein OS=Taeniopygia guttata GN=VIL1 PE=4 SV=1
6 : G1KAJ0_ANOCA 0.79 0.93 1 126 2 127 126 0 0 826 G1KAJ0 Uncharacterized protein OS=Anolis carolinensis GN=LOC100553035 PE=4 SV=1
7 : M7AK33_CHEMY 0.78 0.93 2 126 3 127 125 0 0 821 M7AK33 Villin-1 OS=Chelonia mydas GN=UY3_17446 PE=4 SV=1
8 : H2ZV12_LATCH 0.75 0.90 2 126 3 127 125 0 0 832 H2ZV12 Uncharacterized protein OS=Latimeria chalumnae PE=4 SV=1
9 : W5MRH2_LEPOC 0.74 0.89 2 126 3 127 125 0 0 825 W5MRH2 Uncharacterized protein OS=Lepisosteus oculatus PE=4 SV=1
10 : W5MRG1_LEPOC 0.73 0.89 2 126 7 131 125 0 0 865 W5MRG1 Uncharacterized protein (Fragment) OS=Lepisosteus oculatus PE=4 SV=1
11 : F8VVU1_HUMAN 0.71 0.89 16 126 14 124 111 0 0 157 F8VVU1 Advillin (Fragment) OS=Homo sapiens GN=AVIL PE=2 SV=1
12 : I3MSG2_SPETR 0.71 0.91 2 126 3 127 125 0 0 827 I3MSG2 Uncharacterized protein OS=Spermophilus tridecemlineatus GN=VIL1 PE=4 SV=1
13 : B5DFA0_RAT 0.70 0.90 2 126 3 127 125 0 0 827 B5DFA0 Protein Vil1 OS=Rattus norvegicus GN=Vil1 PE=2 SV=1
14 : E9QJ38_DANRE 0.70 0.89 2 126 4 128 125 0 0 239 E9QJ38 Uncharacterized protein OS=Danio rerio GN=vil1 PE=4 SV=1
15 : F1QVU3_DANRE 0.70 0.89 2 126 4 128 125 0 0 834 F1QVU3 Uncharacterized protein OS=Danio rerio GN=vil1 PE=4 SV=1
16 : G3HWH4_CRIGR 0.70 0.90 2 126 3 127 125 0 0 827 G3HWH4 Villin-1 OS=Cricetulus griseus GN=I79_015332 PE=4 SV=1
17 : G3WU16_SARHA 0.70 0.89 2 126 3 127 125 0 0 827 G3WU16 Uncharacterized protein OS=Sarcophilus harrisii GN=VIL1 PE=4 SV=1
18 : Q802V3_DANRE 0.70 0.89 2 126 4 128 125 0 0 834 Q802V3 Vil1l protein OS=Danio rerio GN=vil1 PE=2 SV=1
19 : S7NLL5_MYOBR 0.70 0.89 2 126 3 127 125 0 0 827 S7NLL5 Villin-1 OS=Myotis brandtii GN=D623_10032903 PE=4 SV=1
20 : VILI_MOUSE 0.70 0.90 2 126 3 127 125 0 0 827 Q62468 Villin-1 OS=Mus musculus GN=Vil1 PE=1 SV=3
21 : A4II76_XENTR 0.69 0.90 2 126 3 127 125 0 0 824 A4II76 LOC100124960 protein OS=Xenopus tropicalis GN=vil1 PE=2 SV=1
22 : F6XHE5_XENTR 0.69 0.90 2 126 3 127 125 0 0 817 F6XHE5 Uncharacterized protein OS=Xenopus tropicalis GN=vil1 PE=4 SV=1
23 : F7DCH9_XENTR 0.69 0.90 2 126 3 127 125 0 0 820 F7DCH9 Uncharacterized protein OS=Xenopus tropicalis GN=vil1 PE=4 SV=1
24 : G1L8D7_AILME 0.69 0.90 2 126 3 127 125 0 0 827 G1L8D7 Uncharacterized protein OS=Ailuropoda melanoleuca GN=VIL1 PE=4 SV=1
25 : L5LYI0_MYODS 0.69 0.89 2 126 3 127 125 0 0 827 L5LYI0 Villin-1 OS=Myotis davidii GN=MDA_GLEAN10022923 PE=4 SV=1
26 : Q7SY93_XENLA 0.69 0.89 2 126 3 127 125 0 0 824 Q7SY93 Vil1-prov protein OS=Xenopus laevis GN=vil1 PE=2 SV=1
27 : Q8CEJ4_MOUSE 0.69 0.90 2 126 3 127 125 0 0 827 Q8CEJ4 Putative uncharacterized protein OS=Mus musculus GN=Vil1 PE=2 SV=1
28 : W5KFH2_ASTMX 0.69 0.87 1 126 4 129 126 0 0 842 W5KFH2 Uncharacterized protein OS=Astyanax mexicanus PE=4 SV=1
29 : F6QZZ5_HORSE 0.68 0.90 2 126 3 127 125 0 0 790 F6QZZ5 Uncharacterized protein OS=Equus caballus GN=VIL1 PE=4 SV=1
30 : G1TQT8_RABIT 0.68 0.92 2 126 3 127 125 0 0 828 G1TQT8 Uncharacterized protein OS=Oryctolagus cuniculus GN=VIL1 PE=4 SV=1
31 : G5AZI8_HETGA 0.68 0.90 2 126 3 127 125 0 0 827 G5AZI8 Villin-1 OS=Heterocephalus glaber GN=GW7_09495 PE=4 SV=1
32 : H0WPB5_OTOGA 0.68 0.90 2 126 3 127 125 0 0 827 H0WPB5 Uncharacterized protein OS=Otolemur garnettii GN=VIL1 PE=4 SV=1
33 : L8IXS3_9CETA 0.68 0.90 2 126 3 127 125 0 0 827 L8IXS3 Villin-1 OS=Bos mutus GN=M91_13928 PE=4 SV=1
34 : M3YYM2_MUSPF 0.68 0.89 2 126 3 127 125 0 0 827 M3YYM2 Uncharacterized protein OS=Mustela putorius furo GN=VIL1 PE=4 SV=1
35 : Q5E9Z3_BOVIN 0.68 0.90 2 126 3 127 125 0 0 827 Q5E9Z3 Villin 1 OS=Bos taurus GN=VIL1 PE=2 SV=1
36 : W5QE76_SHEEP 0.68 0.90 2 126 3 127 125 0 0 812 W5QE76 Uncharacterized protein OS=Ovis aries GN=VIL1 PE=4 SV=1
37 : VILI_BOVIN 0.67 0.90 2 126 3 127 125 0 0 827 Q3SZP7 Villin-1 OS=Bos taurus GN=VIL1 PE=2 SV=3
38 : W5UJQ2_ICTPU 0.67 0.85 1 126 4 129 126 0 0 830 W5UJQ2 Villin-1 OS=Ictalurus punctatus GN=VIL1 PE=2 SV=1
39 : E2RPX1_CANFA 0.66 0.88 2 126 3 127 125 0 0 827 E2RPX1 Uncharacterized protein OS=Canis familiaris GN=VIL1 PE=4 SV=1
40 : F1SRY3_PIG 0.66 0.90 2 126 3 127 125 0 0 827 F1SRY3 Villin-1 OS=Sus scrofa GN=VIL1 PE=4 SV=1
41 : F6TIS5_CALJA 0.66 0.90 2 126 3 127 125 0 0 750 F6TIS5 Uncharacterized protein OS=Callithrix jacchus GN=VIL1 PE=4 SV=1
42 : F7CQ79_MACMU 0.66 0.90 2 126 3 127 125 0 0 827 F7CQ79 Uncharacterized protein OS=Macaca mulatta GN=VIL1 PE=4 SV=1
43 : F7EEJ6_CALJA 0.66 0.90 2 126 3 127 125 0 0 830 F7EEJ6 Uncharacterized protein OS=Callithrix jacchus GN=VIL1 PE=4 SV=1
44 : G1RBF3_NOMLE 0.66 0.90 2 126 3 127 125 0 0 827 G1RBF3 Uncharacterized protein OS=Nomascus leucogenys GN=VIL1 PE=4 SV=1
45 : G3REI7_GORGO 0.66 0.90 2 126 3 127 125 0 0 827 G3REI7 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101126593 PE=4 SV=1
46 : G3U836_LOXAF 0.66 0.89 2 126 3 127 125 0 0 827 G3U836 Uncharacterized protein OS=Loxodonta africana GN=VIL1 PE=4 SV=1
47 : G7PLF6_MACFA 0.66 0.90 2 126 3 127 125 0 0 827 G7PLF6 Putative uncharacterized protein OS=Macaca fascicularis GN=EGM_04317 PE=4 SV=1
48 : H2P8L0_PONAB 0.66 0.90 2 126 15 139 125 0 0 839 H2P8L0 Uncharacterized protein OS=Pongo abelii GN=VIL1 PE=4 SV=2
49 : H2QJE9_PANTR 0.66 0.90 2 126 3 127 125 0 0 827 H2QJE9 Uncharacterized protein OS=Pan troglodytes GN=VIL1 PE=4 SV=1
50 : Q53F91_HUMAN 0.66 0.90 2 126 3 127 125 0 0 827 Q53F91 Villin 1 variant (Fragment) OS=Homo sapiens PE=2 SV=1
51 : S9WT99_9CETA 0.66 0.90 2 126 3 127 125 0 0 833 S9WT99 Villin-1 isoform 1 OS=Camelus ferus GN=CB1_000931037 PE=4 SV=1
52 : VILI_HUMAN 3FG7 0.66 0.90 2 126 3 127 125 0 0 827 P09327 Villin-1 OS=Homo sapiens GN=VIL1 PE=1 SV=4
53 : VILI_PIG 0.66 0.90 2 126 3 127 125 0 0 827 Q29261 Villin-1 OS=Sus scrofa GN=VIL1 PE=2 SV=2
54 : C9J2B5_HUMAN 0.65 0.88 3 126 4 123 124 1 4 157 C9J2B5 Villin-1 (Fragment) OS=Homo sapiens GN=VIL1 PE=2 SV=2
55 : H0WCU7_CAVPO 0.65 0.89 2 126 3 127 125 0 0 827 H0WCU7 Uncharacterized protein OS=Cavia porcellus GN=VIL1 PE=4 SV=1
56 : B5T1M4_EPICO 0.57 0.77 17 126 15 125 111 1 1 149 B5T1M4 Gelsolin a (Fragment) OS=Epinephelus coioides PE=2 SV=1
57 : G1N751_MELGA 0.56 0.74 1 126 47 173 127 1 1 778 G1N751 Uncharacterized protein OS=Meleagris gallopavo GN=GSN PE=4 SV=1
58 : F1NKF3_CHICK 0.55 0.74 1 126 47 173 127 1 1 778 F1NKF3 Uncharacterized protein OS=Gallus gallus GN=GSN PE=4 SV=1
59 : GELS_CHICK 0.55 0.74 1 126 47 173 127 1 1 778 O93510 Gelsolin OS=Gallus gallus GN=GSN PE=2 SV=1
60 : R0L173_ANAPL 0.55 0.74 1 126 47 173 127 1 1 778 R0L173 Gelsolin (Fragment) OS=Anas platyrhynchos GN=Anapl_16652 PE=4 SV=1
61 : U3ICM6_ANAPL 0.55 0.74 1 126 50 176 127 1 1 781 U3ICM6 Uncharacterized protein (Fragment) OS=Anas platyrhynchos GN=GSN PE=4 SV=1
62 : H3D5C6_TETNG 0.52 0.75 2 126 3 128 127 2 3 836 H3D5C6 Uncharacterized protein (Fragment) OS=Tetraodon nigroviridis PE=4 SV=1
63 : C9JGB6_HUMAN 0.48 0.65 21 126 22 154 133 1 27 160 C9JGB6 Adseverin (Fragment) OS=Homo sapiens GN=SCIN PE=2 SV=1
64 : B8BC75_ORYSI 0.47 0.70 1 126 66 192 128 2 3 310 B8BC75 Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_28365 PE=4 SV=1
65 : B8BC76_ORYSI 0.47 0.70 1 126 16 142 128 2 3 967 B8BC76 Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_28366 PE=4 SV=1
66 : Q6Z3N3_ORYSJ 0.47 0.70 1 126 16 142 128 2 3 911 Q6Z3N3 Putative villin OS=Oryza sativa subsp. japonica GN=OSJNBa0087F21.6 PE=4 SV=1
67 : C9JUR8_HUMAN 0.44 0.69 6 126 2 125 124 1 3 130 C9JUR8 Villin-like protein (Fragment) OS=Homo sapiens GN=VILL PE=2 SV=1
68 : E9PFV5_HUMAN 0.44 0.69 6 126 2 125 124 1 3 187 E9PFV5 Villin-like protein (Fragment) OS=Homo sapiens GN=VILL PE=2 SV=1
69 : L7M9M8_9ACAR 0.44 0.66 11 125 11 138 128 2 13 845 L7M9M8 Putative villin-1 OS=Rhipicephalus pulchellus PE=2 SV=1
70 : M8BBC3_AEGTA 0.44 0.61 17 126 10 141 132 2 22 961 M8BBC3 Villin-4 OS=Aegilops tauschii GN=F775_06800 PE=4 SV=1
71 : V5HJ32_IXORI 0.43 0.62 2 126 341 465 125 0 0 662 V5HJ32 Putative scinderin like b (Fragment) OS=Ixodes ricinus PE=2 SV=1
72 : V5HKG4_IXORI 0.43 0.63 11 126 11 139 129 2 13 496 V5HKG4 Putative actin regulatory gelsolin/villin family OS=Ixodes ricinus PE=2 SV=1
73 : B7PVR6_IXOSC 0.42 0.62 2 126 336 460 125 0 0 693 B7PVR6 Villin, putative (Fragment) OS=Ixodes scapularis GN=IscW_ISCW007199 PE=4 SV=1
74 : U6NVF4_HAECO 0.40 0.67 3 126 44 163 124 2 4 322 U6NVF4 Gelsolin region domain containing protein OS=Haemonchus contortus GN=HCOI_00941700 PE=4 SV=1
75 : B3RRZ6_TRIAD 0.39 0.56 18 126 18 149 136 3 31 834 B3RRZ6 Putative uncharacterized protein OS=Trichoplax adhaerens GN=TRIADDRAFT_54421 PE=4 SV=1
76 : E9FXU5_DAPPU 0.39 0.62 15 126 22 148 127 2 15 833 E9FXU5 Putative uncharacterized protein OS=Daphnia pulex GN=DAPPUDRAFT_311761 PE=4 SV=1
77 : G4N933_MAGO7 0.33 0.47 17 119 52 171 129 2 35 426 G4N933 Gelsolin repeat-containing protein OS=Magnaporthe oryzae (strain 70-15 / ATCC MYA-4617 / FGSC 8958) GN=MGG_10059 PE=4 SV=1
78 : L7HRM5_MAGOY 0.33 0.47 17 119 52 171 129 2 35 426 L7HRM5 Gelsolin repeat-containing protein OS=Magnaporthe oryzae (strain Y34) GN=OOU_Y34scaffold00923g6 PE=4 SV=1
79 : L7ITT9_MAGOP 0.33 0.47 17 119 52 171 129 2 35 426 L7ITT9 Gelsolin repeat-containing protein OS=Magnaporthe oryzae (strain P131) GN=OOW_P131scaffold01546g15 PE=4 SV=1
80 : S9XM81_9CETA 0.33 0.54 18 126 131 262 141 3 41 635 S9XM81 Villin-like isoform 1 OS=Camelus ferus GN=CB1_000128033 PE=4 SV=1
81 : L8WX45_THACA 0.31 0.49 21 126 51 177 127 3 21 383 L8WX45 Actin regulatory protein OS=Thanatephorus cucumeris (strain AG1-IA) GN=AG1IA_03407 PE=4 SV=1
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A V 0 0 103 16 12 VVV VV V V VVVVV III
2 2 A E > - 0 0 115 64 69 EEE EEEEQQ NKEEKKEKKEEEKKEKEKEDKKKKKKQRKKKKKKKKKKKKKK N SSSSST SSS
3 3 A L T 4 + 0 0 3 67 21 LLLLLLLLVV LLVVLLVLLLLLLLLLVLLLLLLLLLTLLLLLLLLLLLLLLLLL MMMMML III
4 4 A S T 4 S+ 0 0 31 67 65 SSSSSNTSKK STKKTSKSNTTTSSTNKSSSSSSSSSKTNSSSSSSSSSSSSNSS VVVVVK SSS
5 5 A K T 4 S+ 0 0 170 67 75 KKKAAAASSS AATTTATAAEEEAAEASSASAAAAAATTAAAAAAAAAAAAAAAS EEEEES RRR
6 6 A K S < S+ 0 0 181 69 68 KKKKKKKTQQ QQDDQQDQQKKKQQKQEQQQQQQQQQDQQQQQQQQQQQQQQQQQ HHHHHV VVVDD
7 7 A V S > S+ 0 0 10 69 39 VVVVVVVVVV VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVV AAAAAF EEEII
8 8 A T T 3 S+ 0 0 16 69 72 TTTSSTTTSS KKPPKKPKKKKKKKKKPKKKKKKKKKKKKKKKKKKKKKKKKKKK EEEEEK VVVSS
9 9 A G T 3 S- 0 0 45 69 90 GGGRRKKKKK GGKKGSKGGKKKGSKGKSGSGGDGGGKNGSGSGGGGGGGGGGGN FFFFFG NNNKK
10 10 A K S < S+ 0 0 174 69 67 KKKTTTTIVV SSVVSSVSSTTTSSSSVSSSSSSSSSVSSSSSSSSSSSSSSSSS SSSSSS EEEGG
11 11 A L S S- 0 0 8 71 46 LLLLLLLLLL LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL KKKKKK IIILLL
12 12 A D + 0 0 24 71 55 DDDNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN AAAAAS TTTPPP
13 13 A K S S- 0 0 119 71 89 KKKKKKRKKK ITKKTIKIIKKKIIKIKIIIIIVIIIKIVIIIIITIIIITIVII GGGGGF MMMGGK
14 14 A T S S+ 0 0 111 71 53 TTTTTTTTTT TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTTTT KKKKKM IIIMMN
15 15 A T - 0 0 71 72 44 TTTTTTTTTT TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT EEEEEK TTTQQS
16 16 A P + 0 0 98 73 31 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP PPPPPP GGGGGT
17 17 A G E S-A 50 0A 31 78 20 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG GGGGGFG
18 18 A I E -A 49 0A 23 80 24 IIIIILLLLLIVILLLILVILLLVVLILVVVLVVVVVLVVLLLLLLLLLLVLVLVLLLLLLL LLLLLFL
19 19 A Q E -A 48 0A 30 80 43 QQQQQQQQQQIQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ EEEHHTE
20 20 A I E -AB 47 31A 0 80 5 IIIIIIIIIIVIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIVIIIIII IIIIIII
21 21 A W E -AB 46 30A 1 82 4 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWCWWWWWWWWWWWWWWWWWWWWWWWW
22 22 A R E -A 45 0A 62 82 16 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRTRRRRIIRR
23 23 A I > + 0 0 0 82 14 IIIIIIIIIIIIIVVIIVIIVVVIIVIVIIIIIIIIIVIIIIIIIIIIIIIIIIIVIIIIIVIIIISSII
24 24 A E G > + 0 0 14 82 4 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEENEEEEEEEE
25 25 A N G 3 S- 0 0 29 82 73 NNNSNSKKNNKAANNAANAASSSDAKANAAAAAAAAAKAAAAAAAAAAAASAAAANKKKKKKKNNNNNKN
26 26 A M G < S+ 0 0 70 82 36 MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMFFFFFFMLFFFRRMF
27 27 A E S < S- 0 0 91 82 53 EEEEEEENEEEQQEEQKEQQNNNKQEQEQQQQQKQQQEKQQQQQQQQQQQQQQQQDDDDDDKEKKKKKQK
28 28 A M + 0 0 2 82 29 MMMMMMMMMMLMMLLMMLMMLLLMMLMLMMMMMMMMMLMMMMMMMMMMMMMMMMMLLLLLLLLPPPMMLP
29 29 A V - 0 0 38 82 12 VVVVVVVVVVAVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVV
30 30 A P E -B 21 0A 76 82 14 PPPPPPPPPPLPPPPPPPPPPPPPPPRPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPQP
31 31 A V E -B 20 0A 4 82 40 VVVVVVVVVVVVVCCVVCVVVVVVVVVCVVVVVVVVVCVVVVVVVVVVVVVVVVVVVVVVVVVIIIVVLV
32 32 A P >> - 0 0 65 82 12 PPPPPPPPPPPPPPPPPPPPPPPPPPPPAPPPPPPPPPPSPPPPPPPPPPPPSPPPPPPPPSPPPPPPPP
33 33 A T T 34 S+ 0 0 96 81 73 T.TTTPPKPPVSSSSSSSPSEEESPESSSSSSSSSSSSSSSSSSSSSSSSSSSSSKKKKKKAQAAAEEKT
34 34 A K T 34 S+ 0 0 192 82 67 KTKKKKKKKKSSSKKSNKSSNNNSSNSKNNSSNSNNNKSSSSSSSSSSSSSSSSHNNNNNNRSSSSGGDS
35 35 A S T X4 S+ 0 0 54 82 75 STSSSTSSSSATTTTSSTTTSSSTTSTTTTTTSTSSSTTTTTTTTTTTTTTTTTTLLLLLLSASSSAASS
36 36 A Y T 3< S+ 0 0 99 82 22 YYYYYYYYYYHYFFFFFFFFYYYFFHFYYYFFFFFFFYFYFFFFFFFFFFFFYFVYYYYYYFHYYYYYYH
37 37 A G T 3 S+ 0 0 3 82 4 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
38 38 A N < + 0 0 82 82 79 NNNNNNNNNNNSSQQNSQSSNNNSSNSQNSTSSSSSSQSSSSSSSSSSSSSSSSTGDDDDDTDKKKNNYK
39 39 A F - 0 0 3 82 0 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF
40 40 A Y - 0 0 36 82 6 YYYYYYYFFFYFFFFFFFFFFFFYFFFFFFYFFYFFFYFFFFFFFFFFFFFFFFYYFFFFFFYFFFFFFY
41 41 A E S S+ 0 0 97 82 55 EEEEEEEEEEEDDEEDDEDDDDDDDEDEEEDDDDDDDEDDDDDDDDDDDDDDDDDSTTTTTEVMMMEENM
42 42 A G S S+ 0 0 8 82 9 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGEEGG
43 43 A D S S- 0 0 2 82 7 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDHHDD
44 44 A C - 0 0 0 82 36 CCCCCACCSSCCCSSCCSCCCCCCCCCSCCCCCCCCCSCCCCCCCCCCCCCCCCCASSSSSCASSSCCSS
45 45 A Y E -AC 22 64A 1 82 0 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY
46 46 A V E -AC 21 63A 0 82 21 VVVVVVVIIIVVVVVVIVVVVVVIVVVVVVVVVIVVVLIIIIIIIIIIIIVIIIVLLLLLLVLIIIVVII
47 47 A L E -AC 20 62A 0 82 23 LLLLLIVLIIILVIIVVIVVLLLVVLVTVIVVIVIIIIVVIIIIIVIIIIVIVIVIVVVVVVVIIIIIVI
48 48 A L E -AC 19 61A 0 82 18 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLAL
49 49 A S E -AC 18 60A 17 82 89 SSSSSSVSYYSAAYYAAYAAMMMAAMAYAAAAAAAAAHAAAAAAAAAAAAAAAAANNNNNNNHKKKHHVK
50 50 A T E -AC 17 59A 4 82 62 TTTTTTTTTTTIITTIITIITTTIITITITIIIIIIITIIIIIIIIIIIIIIIIITTTTTTITTTTVVAL
51 51 A R E - C 0 58A 168 82 81 RRRRRHQHHHRHHHHHHHHHHHHHHHHQHHHHHHHHHQHHHHHHHHHHHHHHHHHIIIIIIsAtttppsl
52 52 A K E + C 0 57A 151 81 15 KKKKKKKKKKRKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRRKKkKkkkkkka
53 53 A T - 0 0 62 81 65 TTTSSSTTTTVTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTQQQQQQSTNNNAAKL
54 54 A G S S+ 0 0 93 82 58 GGGGGGGRSSAGGSSSGSGSGGGGGGSSGSGGGGGGGSGGAAAAAGAAAAGAGAARRRRRRWSGGGTTAK
55 55 A S S S- 0 0 121 82 61 SSSSSSNYNNSSSNNSSNSSNNNSSNSSNNSSSSSSSSSSSSSSSSSSSSSSSSNSSSSNNDRSSSQQVN
56 56 A G S S- 0 0 29 82 67 GGSNNTSSNNLNTNNTTNNTNNNSNNTSNNNTNNNNNTNNNNNNNNNNNSNSNSNggggggQGLLLGGgg
57 57 A F E -C 52 0A 52 72 37 FFFFFFFFYYLLLFFLLFLLFFFLLFLYLLLLLLLLLFLLLLLLLTLLLLLLLLLllllll.F...AAlf
58 58 A S E -C 51 0A 61 76 66 SSSSSTSTTTSSSSSTSSSSTTTSSTSSSSTSSSSSSTSSSSSSSSSSSSSSSSTQQQQQQRTRRRSSDR
59 59 A Y E -C 50 0A 18 77 43 YYYYYYYYYYQYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYAYHHHSSIH
60 60 A N E -C 49 0A 42 77 27 NNNDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDHDDDDDHD
61 61 A I E -Cd 48 94A 0 78 15 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIVIIIIIIIIIIIIIIIIIIIIIIIIILLLLLLILIIILLII
62 62 A H E -Cd 47 95A 0 78 6 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
63 63 A Y E -Cd 46 96A 34 78 3 YYYYYYYYYYFYYYYYYYYYFFFYYFYYYFYYYYYYYYYYYYYYYYYYYYYYYYYFFFFFFFFYYYYYFY
64 64 A W E -Cd 45 97A 2 78 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
65 65 A L E - d 0 98A 9 78 35 LLLLLLLILLIIILLILLIIVVVIIVILLIIIIIIIILIIIIIIIIIIIIIIIIILLLLLLILIIIVVLL
66 66 A G > - 0 0 0 82 7 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGgGGGGGGG
67 67 A K T 3 S+ 0 0 135 81 63 KKKKKNQKKKKKQKKQQKQQNNNQQNQKQQQQQQQQQKQQQQQQQQQQQQQQQQQEDDDDDRkKKKKKAK
68 68 A N T 3 + 0 0 105 81 64 NNNEESETAADDDSSNASADDDDAADDEAADAAAAAAAADNDNDDDDDDDADDDDAEEEEEAEDDDQQQD
69 69 A S S < S- 0 0 31 81 45 SSSSSSSSSSSSSTTSSTSSSSSSSSSTSSSSSSSSSTSSSSSSSSSSSSSSSSSCSSSSSSCTTTAATT
70 70 A S >> - 0 0 53 82 22 SSSSSSWSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTSSSSSTSSSSGGSS
71 71 A Q H 3> S+ 0 0 172 82 45 QQQQQQQQQQQQQQQQQQQQMMMQQMQVLQQQQQQQQVQQQLQLLQLLLLQLQLQQQQQQQVQQQQAAQQ
72 72 A D H 3> S+ 0 0 103 82 12 DDDDDDEDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDEEDD
73 73 A E H <> S+ 0 0 5 82 12 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAAEE
74 74 A Q H X S+ 0 0 69 82 58 QQQQQQQQQQQQQMMQQMQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQSRRRRRQSSSSQQAA
75 75 A G H X S+ 0 0 30 82 13 GGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGTGGGGGGG
76 76 A A H X S+ 0 0 14 82 36 AAAAAAAAAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAAAAAATTTAAVT
77 77 A A H X S+ 0 0 0 82 13 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
78 78 A A H X S+ 0 0 24 82 10 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAEEAA
79 79 A I H X S+ 0 0 91 82 32 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIAAYI
80 80 A Y H >X S+ 0 0 33 82 48 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYFFFFFFYFLLLFFKL
81 81 A T H 3X S+ 0 0 0 82 20 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTVTTTTQQTT
82 82 A T H 3X S+ 0 0 68 82 59 TTTTTTTSTTTTTTTTTTTTIIITTITTTTTTTTTTTTTTTTTTTTTTTTTTTTTVVVVVVAVVVVQQVV
83 83 A Q H S+ 0 0 38 82 10 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDQQDD
86 86 A E H ><5S+ 0 0 142 82 29 EEDDDEETEEDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDEDAAADDDA
87 87 A Y H 3<5S+ 0 0 149 82 75 YYHHHHHHHHYFYHHYFHFYHHHFFHYHFFFFFFFFFHFFFFFFFFFFFFFFFFFFYYYYYHYAAAEEFA
88 88 A L T 3<5S- 0 0 20 82 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
89 89 A G T < 5 - 0 0 47 82 70 GGGGGGGGGGGKKGGKKGKKGGGKKGKGKKKKKKKKKGKKKKKKKKKKKKKKKKQGQQQQQGGGGGGGGG
90 90 A S S > -d 65 0A 19 82 38 VVVVAVVVAAVVVTTVVTVVVVVVVAVAVVVVVVVVVTVVVVVVVIVVVVVVVVVVVVVVVVLIIIAAVS
99 99 A Q T 34 S+ 0 0 118 82 1 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ
100 100 A G T 34 S+ 0 0 58 82 28 GGGGGNGHGGYGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
101 101 A H T <4 S+ 0 0 94 82 63 HHHHHYHYYYHNHHHNHHNNHHHNNHNYNNNNNNNNNYNNNNNNNNNNNNNNNNNYHHHHHNYNNNHHFN
102 102 A E S X>S+ 0 0 19 82 12 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
103 103 A S H >5S- 0 0 61 82 17 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTTTSSST
104 104 A E H >5S+ 0 0 153 82 49 EEEEDENEEEDEDAAEDAEEDDDEEDEPEEEDDEDDDPEEEEEEEEEEEEEEEEEKSSSAAANDDDDDQE
105 105 A T H >5S+ 0 0 53 82 75 TTTTTAIITTTATTTTITTTTTTATTTTATTTTATTTKAAAAAAAAAAAATAAATTTTTTTLDKKKCCRK
106 106 A F H X5S+ 0 0 0 82 0 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFL
107 107 A R H < + 0 0 44 79 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
121 121 A V T 3 + 0 0 55 79 22 VVVVVVVIVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVLLVV
122 122 A A T 3 S+ 0 0 69 79 13 AAAAAAAAAAAAAAAAAAAAAAAAAAAASAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
123 123 A S < + 0 0 60 79 8 SSSSSSSTSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
124 124 A G S S+ 0 0 70 79 4 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGDDGG
125 125 A M 0 0 157 79 30 MMMMMMMMMMMMMMMMMMMMMMMMMMMMMLMMMMMMMMMMMMMMMMMMMMLMMMLFFFFFFLLFFFLLLF
126 126 A K 0 0 240 78 31 KKKKKKKKKKKKKKKKKKKKNNNKKNKKKKKKKKKKKKKKKNKKKKNKKKKKKKKKRRRRRQNKKKKK N
## ALIGNMENTS 71 - 81
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....8....:....9....:....0....:....1....:....2....:....3....:....4
1 1 A V 0 0 103 16 12
2 2 A E > - 0 0 115 64 69 N N
3 3 A L T 4 + 0 0 3 67 21 L LM
4 4 A S T 4 S+ 0 0 31 67 65 G GA
5 5 A K T 4 S+ 0 0 170 67 75 R RQ
6 6 A K S < S+ 0 0 181 69 68 S SD
7 7 A V S > S+ 0 0 10 69 39 I II
8 8 A T T 3 S+ 0 0 16 69 72 G GD
9 9 A G T 3 S- 0 0 45 69 90 F FV
10 10 A K S < S+ 0 0 174 69 67 M MS
11 11 A L S S- 0 0 8 71 46 PLPS
12 12 A D + 0 0 24 71 55 DPDV
13 13 A K S S- 0 0 119 71 89 DKDG
14 14 A T S S+ 0 0 111 71 53 GNGK
15 15 A T - 0 0 71 72 44 SSSE S
16 16 A P + 0 0 98 73 31 GTGP T
17 17 A G E S-A 50 0A 31 78 20 KFKG AGGG
18 18 A I E -A 49 0A 23 80 24 LFLLLFLLLL
19 19 A Q E -A 48 0A 30 80 43 ETEEEILLLH
20 20 A I E -AB 47 31A 0 80 5 IIIVVIVVVI
21 21 A W E -AB 46 30A 1 82 4 FWFWWWWWWWW
22 22 A R E -A 45 0A 62 82 16 RRRRRRRRRIR
23 23 A I > + 0 0 0 82 14 IIIIIVIIIII
24 24 A E G > + 0 0 14 82 4 EEEKEEEEEEE
25 25 A N G 3 S- 0 0 29 82 73 NKNESDRRRND
26 26 A M G < S+ 0 0 70 82 36 FMFFMLFFFLF
27 27 A E S < S- 0 0 91 82 53 EQETQKEEEKK
28 28 A M + 0 0 2 82 29 LILLVLVVVMV
29 29 A V - 0 0 38 82 12 AVAEVVVVVVV
30 30 A P E -B 21 0A 76 82 14 PQPAPQPPPPP
31 31 A V E -B 20 0A 4 82 40 VFVVYLWWWVW
32 32 A P >> - 0 0 65 82 12 DPDPPPPPPPP
33 33 A T T 34 S+ 0 0 96 81 73 PKPKKKRRRRD
34 34 A K T 34 S+ 0 0 192 82 67 ADANDEDDDRD
35 35 A S T X4 S+ 0 0 54 82 75 IQIQKSRRRAR
36 36 A Y T 3< S+ 0 0 99 82 22 YYYYYHYYYYK
37 37 A G T 3 S+ 0 0 3 82 4 GGGGGGGGGRG
38 38 A N < + 0 0 82 82 79 FTFLEKQQQNQ
39 39 A F - 0 0 3 82 0 FFFFFFFFFFF
40 40 A Y - 0 0 36 82 6 FYFYYHYYYFY
41 41 A E S S+ 0 0 97 82 55 GTGSTADDDED
42 42 A G S S+ 0 0 8 82 9 GGGDGGGGGEG
43 43 A D S S- 0 0 2 82 7 DDDDDDDDDHD
44 44 A C - 0 0 0 82 36 SSSTSSSSSCS
45 45 A Y E -AC 22 64A 1 82 0 YYYYFYYYYYY
46 46 A V E -AC 21 63A 0 82 21 IIIIILIIIII
47 47 A L E -AC 20 62A 0 82 23 IVVVIIVVVVI
48 48 A L E -AC 19 61A 0 82 18 KVKLLYLLLLL
49 49 A S E -AC 18 60A 17 82 89 YVYNHSHHHHH
50 50 A T E -AC 17 59A 4 82 62 TATSTASSSTT
51 51 A R E - C 0 58A 168 82 81 YsYTkfWWWky
52 52 A K E + C 0 57A 151 81 15 KkK.kaKKKrt
53 53 A T - 0 0 62 81 65 KKK.VAVVVIP
54 54 A G S S+ 0 0 93 82 58 GAGDESGGGAG
55 55 A S S S- 0 0 121 82 61 YVYSWSKKKST
56 56 A G S S- 0 0 29 82 67 SgSGNgEEEQq
57 57 A F E -C 52 0A 52 72 37 DlD..l....l
58 58 A S E -C 51 0A 61 76 66 RDR..E....A
59 59 A Y E -C 50 0A 18 77 43 YIYW.R....H
60 60 A N E -C 49 0A 42 77 27 VHVD.F....D
61 61 A I E -Cd 48 94A 0 78 15 IIIVII....L
62 62 A H E -Cd 47 95A 0 78 6 YHYHHH....H
63 63 A Y E -Cd 46 96A 34 78 3 FFFFFY....F
64 64 A W E -Cd 45 97A 2 78 0 WWWWWW....W
65 65 A L E - d 0 98A 9 78 35 QLQILL....L
66 66 A G > - 0 0 0 82 7 GGGGgGnnngg
67 67 A K T 3 S+ 0 0 135 81 63 NANKvTrrrxd
68 68 A N T 3 + 0 0 105 81 64 EQETTEYYYXD
69 69 A S S < S- 0 0 31 81 45 STSAVTTTTXR
70 70 A S >> - 0 0 53 82 22 SSSSVTSSSAK
71 71 A Q H 3> S+ 0 0 172 82 45 QQQQKQQQQGL
72 72 A D H 3> S+ 0 0 103 82 12 DDDDDDDDDTF
73 73 A E H <> S+ 0 0 5 82 12 EEEEEEEEEAI
74 74 A Q H X S+ 0 0 69 82 58 KAKRAAAAAEN
75 75 A G H X S+ 0 0 30 82 13 AGAGGGGGGDH
76 76 A A H X S+ 0 0 14 82 36 AVATVVTTTAF
77 77 A A H X S+ 0 0 0 82 13 SASAAVAAAAR
78 78 A A H X S+ 0 0 24 82 10 AAAAAAAAAGP
79 79 A I H X S+ 0 0 91 82 32 IYIIYIYYYSN
80 80 A Y H >X S+ 0 0 33 82 48 WKWKKKKKKFL
81 81 A T H 3X S+ 0 0 0 82 20 ATATTATTTVT
82 82 A T H 3X S+ 0 0 68 82 59 VVVVVVVVVQT
83 83 A Q H S+ 0 0 38 82 10 DDDDDDDDDQV
86 86 A E H ><5S+ 0 0 142 82 29 NDNQDDEEEEV
87 87 A Y H 3<5S+ 0 0 149 82 75 DHDAHYFFFAD
88 88 A L T 3<5S- 0 0 20 82 0 LLLLLLLLLLL
89 89 A G T < 5 - 0 0 47 82 70 NGNNGGRRRGG
90 90 A S S > -d 65 0A 19 82 38 VVVVVVLLLAC
99 99 A Q T 34 S+ 0 0 118 82 1 QQQQQEQQQQQ
100 100 A G T 34 S+ 0 0 58 82 28 GGGNEGKKKGY
101 101 A H T <4 S+ 0 0 94 82 63 HFHHHSCCCHY
102 102 A E S X>S+ 0 0 19 82 12 EEEEEEMMMEE
103 103 A S H >5S- 0 0 61 82 17 PSPSTSSSSSS
104 104 A E H >5S+ 0 0 153 82 49 EQEPKTDDDAQ
105 105 A T H >5S+ 0 0 53 82 75 HRHLRREEECR
106 106 A F H X5S+ 0 0 0 82 0 FFFFFFFFFFF
107 107 A R H < + 0 0 44 79 0 GGGGGG GG
121 121 A V T 3 + 0 0 55 79 22 HVHYVA LV
122 122 A A T 3 S+ 0 0 69 79 13 AVAKAA AR
123 123 A S < + 0 0 60 79 8 SSSTSS ST
124 124 A G S S+ 0 0 70 79 4 GGGGGG GG
125 125 A M 0 0 157 79 30 FFFFFF LF
126 126 A K 0 0 240 78 31 KKKHKK SS
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 81 0 19 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 16 0 0 0.483 16 0.88
2 2 A 0 0 0 0 0 0 0 0 0 0 13 2 0 0 2 45 5 27 6 2 64 0 0 1.482 49 0.30
3 3 A 9 76 4 9 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 67 0 0 0.842 28 0.79
4 4 A 7 0 0 0 0 0 0 3 1 0 57 12 0 0 0 12 0 0 7 0 67 0 0 1.384 46 0.34
5 5 A 0 0 0 0 0 0 0 0 51 0 12 9 0 0 7 6 1 13 0 0 67 0 0 1.508 50 0.24
6 6 A 6 0 0 0 0 0 0 0 0 0 3 1 0 7 0 17 54 1 0 10 69 0 0 1.451 48 0.31
7 7 A 80 0 7 0 1 0 0 0 7 0 0 0 0 0 0 0 0 4 0 0 69 0 0 0.759 25 0.61
8 8 A 4 0 0 0 0 0 0 3 0 6 9 10 0 0 0 59 0 7 0 1 69 0 0 1.409 47 0.27
9 9 A 1 0 0 0 10 0 0 45 0 0 9 0 0 0 3 23 0 0 7 1 69 0 0 1.558 52 0.10
10 10 A 10 0 1 3 0 0 0 3 0 0 62 10 0 0 0 6 0 4 0 0 69 0 0 1.327 44 0.33
11 11 A 0 83 4 0 0 0 0 0 0 3 1 0 0 0 0 8 0 0 0 0 71 0 0 0.657 21 0.54
12 12 A 1 0 0 0 0 0 0 0 7 6 1 4 0 0 0 0 0 0 72 8 71 0 0 1.049 35 0.45
13 13 A 4 0 39 4 1 0 0 11 0 0 0 6 0 0 1 30 0 0 0 3 71 0 0 1.623 54 0.11
14 14 A 0 0 4 4 0 0 0 3 0 0 0 76 0 0 0 8 0 0 4 0 71 0 0 0.919 30 0.46
15 15 A 0 0 0 0 0 0 0 0 0 0 7 81 0 0 0 1 3 8 0 0 72 0 0 0.725 24 0.55
16 16 A 0 0 0 0 0 0 0 10 0 86 0 4 0 0 0 0 0 0 0 0 73 0 0 0.483 16 0.69
17 17 A 0 0 0 0 3 0 0 94 1 0 0 0 0 0 0 3 0 0 0 0 78 0 0 0.306 10 0.80
18 18 A 21 61 14 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 80 0 0 1.025 34 0.75
19 19 A 0 4 3 0 0 0 0 0 0 0 0 3 0 4 0 0 77 10 0 0 80 0 0 0.859 28 0.57
20 20 A 9 0 91 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 80 0 0 0.297 9 0.94
21 21 A 0 0 0 0 2 96 0 0 0 0 0 0 1 0 0 0 0 0 0 0 82 0 0 0.180 6 0.96
22 22 A 0 0 4 0 0 0 0 0 0 0 0 1 0 0 95 0 0 0 0 0 82 0 0 0.222 7 0.84
23 23 A 15 0 83 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 82 0 0 0.527 17 0.85
24 24 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 98 1 0 82 0 0 0.132 4 0.95
25 25 A 0 0 0 0 0 0 0 0 40 0 9 0 0 0 4 17 0 1 26 4 82 0 0 1.523 50 0.27
26 26 A 0 4 0 73 21 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 82 0 0 0.766 25 0.64
27 27 A 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 17 41 28 5 7 82 0 0 1.416 47 0.47
28 28 A 6 28 1 60 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 82 0 0 1.036 34 0.70
29 29 A 95 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 1 0 0 82 0 0 0.222 7 0.87
30 30 A 0 1 0 0 0 0 0 0 1 93 0 0 0 0 1 0 4 0 0 0 82 0 0 0.353 11 0.85
31 31 A 80 2 4 0 1 5 1 0 0 0 0 0 6 0 0 0 0 0 0 0 82 0 0 0.812 27 0.59
32 32 A 0 0 0 0 0 0 0 0 1 93 4 0 0 0 0 0 0 0 0 2 82 1 0 0.336 11 0.87
33 33 A 1 0 0 0 0 0 0 0 5 10 47 7 0 0 5 15 1 7 0 1 81 0 0 1.712 57 0.26
34 34 A 0 0 0 0 0 0 0 2 2 0 40 1 0 1 2 18 0 1 22 9 82 0 0 1.653 55 0.32
35 35 A 0 7 2 0 0 0 0 0 6 0 32 44 0 0 5 1 2 0 0 0 82 0 0 1.470 49 0.25
36 36 A 1 0 0 0 40 0 51 0 0 0 0 0 0 6 0 1 0 0 0 0 82 0 0 0.987 32 0.77
37 37 A 0 0 0 0 0 0 0 99 0 0 0 0 0 0 1 0 0 0 0 0 82 0 0 0.066 2 0.96
38 38 A 0 1 0 0 2 0 1 1 0 0 38 5 0 0 0 6 11 1 26 7 82 0 0 1.774 59 0.20
39 39 A 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 82 0 0 0.000 0 1.00
40 40 A 0 0 0 0 70 0 29 0 0 0 0 0 0 1 0 0 0 0 0 0 82 0 0 0.666 22 0.93
41 41 A 1 0 0 5 0 0 0 2 1 0 2 9 0 0 0 0 0 29 1 49 82 0 0 1.410 47 0.45
42 42 A 0 0 0 0 0 0 0 95 0 0 0 0 0 0 0 0 0 4 0 1 82 0 0 0.222 7 0.91
43 43 A 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 96 82 0 0 0.157 5 0.92
44 44 A 0 0 0 0 0 0 0 0 4 0 32 1 63 0 0 0 0 0 0 0 82 0 0 0.828 27 0.64
45 45 A 0 0 0 0 1 0 99 0 0 0 0 0 0 0 0 0 0 0 0 0 82 0 0 0.066 2 1.00
46 46 A 45 11 44 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 82 0 0 0.963 32 0.78
47 47 A 41 15 43 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 82 0 0 1.063 35 0.76
48 48 A 1 93 0 0 0 0 1 0 1 0 0 0 0 0 0 2 1 0 0 0 82 0 0 0.376 12 0.82
49 49 A 4 0 0 5 0 0 10 0 43 0 12 0 0 12 0 5 0 0 10 0 82 0 0 1.746 58 0.10
50 50 A 2 1 43 0 0 0 0 0 4 0 5 45 0 0 0 0 0 0 0 0 82 0 0 1.135 37 0.38
51 51 A 0 1 7 0 1 4 4 0 1 2 4 5 0 56 9 2 4 0 0 0 82 1 13 1.700 56 0.19
52 52 A 0 0 0 0 0 0 0 0 2 0 0 1 0 0 5 91 0 0 0 0 81 0 0 0.377 12 0.85
53 53 A 6 1 1 0 0 0 0 0 4 1 5 65 0 0 0 5 7 0 4 0 81 0 0 1.346 44 0.34
54 54 A 0 0 0 0 0 1 0 49 20 0 16 2 0 0 9 1 0 1 0 1 82 0 0 1.477 49 0.41
55 55 A 2 0 0 0 0 1 4 0 0 0 63 1 0 0 1 4 2 0 20 1 82 0 0 1.246 41 0.39
56 56 A 0 5 0 0 0 0 0 21 0 0 12 10 0 0 0 0 4 4 45 0 82 10 11 1.558 52 0.33
57 57 A 0 63 0 0 26 0 4 0 3 0 0 1 0 0 0 0 0 0 0 3 72 0 0 1.036 34 0.62
58 58 A 0 0 0 0 0 0 0 0 1 0 61 17 0 0 9 0 8 1 0 3 76 0 0 1.236 41 0.33
59 59 A 0 0 3 0 0 1 83 0 1 0 3 0 0 6 1 0 1 0 0 0 77 0 0 0.747 24 0.57
60 60 A 3 0 0 0 1 0 0 0 0 0 0 0 0 4 0 0 0 0 5 87 77 0 0 0.552 18 0.72
61 61 A 3 13 85 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.499 16 0.84
62 62 A 0 0 0 0 0 0 3 0 0 0 0 0 0 97 0 0 0 0 0 0 78 0 0 0.119 3 0.93
63 63 A 0 0 0 0 27 0 73 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.582 19 0.97
64 64 A 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.000 0 1.00
65 65 A 8 38 51 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 78 0 0 1.001 33 0.64
66 66 A 0 0 0 0 0 0 0 96 0 0 0 0 0 0 0 0 0 0 4 0 82 0 7 0.157 5 0.92
67 67 A 1 0 0 0 0 0 0 0 2 0 0 1 0 0 5 30 43 1 9 7 81 0 0 1.530 51 0.37
68 68 A 0 0 0 0 0 0 4 0 23 0 5 4 0 0 0 0 5 16 9 35 81 0 0 1.754 58 0.36
69 69 A 1 0 0 0 0 0 0 0 4 0 73 19 2 0 1 0 0 0 0 0 81 0 0 0.865 28 0.54
70 70 A 1 0 0 0 0 1 0 2 1 0 89 4 0 0 0 1 0 0 0 0 82 0 0 0.530 17 0.78
71 71 A 4 13 0 5 0 0 0 1 2 0 0 0 0 0 0 1 73 0 0 0 82 0 0 0.964 32 0.55
72 72 A 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 4 0 94 82 0 0 0.288 9 0.87
73 73 A 0 0 1 0 0 0 0 0 4 0 0 0 0 0 0 0 0 95 0 0 82 0 0 0.222 7 0.87
74 74 A 0 0 0 4 0 0 0 0 10 0 6 0 0 0 7 2 68 1 1 0 82 0 0 1.168 39 0.41
75 75 A 0 0 0 0 0 0 0 93 2 0 1 1 0 1 0 0 0 0 0 1 82 0 0 0.376 12 0.86
76 76 A 5 0 0 0 1 0 0 0 82 0 1 10 1 0 0 0 0 0 0 0 82 0 0 0.701 23 0.63
77 77 A 1 0 0 0 0 0 0 0 95 0 2 0 0 0 1 0 0 0 0 0 82 0 0 0.246 8 0.87
78 78 A 0 0 0 0 0 0 0 1 95 1 0 0 0 0 0 0 0 2 0 0 82 0 0 0.246 8 0.90
79 79 A 0 0 88 0 0 0 7 0 2 0 1 0 0 0 0 0 0 0 1 0 82 0 0 0.504 16 0.67
80 80 A 0 6 0 0 12 2 70 0 0 0 0 0 0 0 0 10 0 0 0 0 82 0 0 0.998 33 0.52
81 81 A 2 0 0 0 0 0 0 0 4 0 0 91 0 0 0 0 2 0 0 0 82 0 0 0.384 12 0.79
82 82 A 26 0 5 0 0 0 0 0 1 0 1 63 0 0 0 0 4 0 0 0 82 0 0 1.014 33 0.40
83 83 A 0 5 0 1 0 0 0 0 0 0 0 0 0 1 2 2 73 15 0 0 82 0 0 0.946 31 0.59
84 84 A 0 21 4 74 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 82 0 0 0.721 24 0.86
85 85 A 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 95 82 0 0 0.222 7 0.90
86 86 A 1 0 0 0 0 0 0 0 5 0 0 1 0 0 0 0 1 15 2 74 82 0 0 0.900 30 0.70
87 87 A 0 0 0 0 44 0 18 0 7 0 0 0 0 24 0 0 0 2 0 4 82 0 0 1.419 47 0.25
88 88 A 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 82 0 0 0.000 0 1.00
89 89 A 0 0 0 0 0 0 0 44 0 0 0 0 0 0 4 41 7 0 4 0 82 0 0 1.160 38 0.30
90 90 A 0 0 0 0 0 0 0 91 0 0 9 0 0 0 0 0 0 0 0 0 82 0 0 0.292 9 0.91
91 91 A 17 2 0 0 0 0 0 1 6 0 6 2 0 2 49 10 4 0 0 0 82 0 0 1.667 55 0.20
92 92 A 0 0 0 0 0 0 0 0 84 12 0 4 0 0 0 0 0 0 0 0 82 0 0 0.523 17 0.74
93 93 A 88 0 7 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 82 0 0 0.453 15 0.84
94 94 A 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 98 0 0 0 82 0 0 0.115 3 0.92
95 95 A 2 0 0 0 0 0 7 0 0 0 0 0 0 88 0 0 1 0 1 0 82 0 0 0.504 16 0.73
96 96 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 82 0 0 0.000 0 1.00
97 97 A 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 98 0 0 82 0 0 0.115 3 0.92
98 98 A 73 5 5 0 0 0 0 0 10 0 1 5 1 0 0 0 0 0 0 0 82 0 0 1.005 33 0.61
99 99 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 99 1 0 0 82 0 0 0.066 2 0.98
100 100 A 0 0 0 0 0 0 2 89 0 0 0 0 0 1 0 4 0 1 2 0 82 0 0 0.513 17 0.72
101 101 A 0 0 0 0 2 0 11 0 0 0 1 0 4 35 0 0 0 0 46 0 82 0 0 1.232 41 0.37
102 102 A 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 96 0 0 82 0 0 0.157 5 0.88
103 103 A 0 0 0 0 0 0 0 0 0 2 91 6 0 0 0 0 0 0 0 0 82 0 0 0.343 11 0.83
104 104 A 0 0 0 0 0 0 0 0 9 4 4 1 0 0 0 2 4 49 2 26 82 0 0 1.507 50 0.50
105 105 A 0 2 4 0 0 0 0 0 24 0 0 46 4 2 6 6 0 4 0 1 82 0 0 1.640 54 0.25
106 106 A 0 1 0 0 99 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 82 0 0 0.066 2 0.99
107 107 A 5 21 1 1 0 0 0 0 0 0 0 0 2 0 55 9 6 0 0 0 82 0 0 1.382 46 0.17
108 108 A 0 0 0 0 0 0 0 61 12 0 23 1 0 0 2 0 0 0 0 0 82 0 0 1.041 34 0.56
109 109 A 0 4 0 2 0 0 90 0 0 0 0 0 0 4 0 0 0 0 0 0 82 0 0 0.425 14 0.79
110 110 A 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 82 0 0 0.000 0 1.00
111 111 A 0 0 0 0 0 0 0 0 0 9 0 0 0 0 9 83 0 0 0 0 82 1 0 0.575 19 0.69
112 112 A 0 0 0 0 0 0 0 4 0 7 7 0 0 5 6 12 54 0 2 1 81 0 0 1.564 52 0.36
113 113 A 0 0 4 0 0 0 0 89 0 0 0 0 5 0 2 0 0 0 0 0 81 0 0 0.467 15 0.68
114 114 A 1 42 49 2 1 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 81 0 0 1.035 34 0.64
115 115 A 43 0 34 6 0 0 0 0 0 0 1 0 0 0 8 9 0 0 0 0 80 0 0 1.366 45 0.41
116 116 A 0 4 50 0 0 0 41 0 0 5 0 0 0 0 0 0 0 0 0 0 82 0 0 0.980 32 0.34
117 117 A 0 5 0 1 2 0 0 0 0 0 4 0 0 0 33 34 21 0 0 0 82 0 0 1.472 49 0.33
118 118 A 0 0 0 0 0 0 0 4 9 6 9 1 0 0 0 63 7 0 1 0 82 0 0 1.299 43 0.37
119 119 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 82 0 0 0.000 0 1.00
120 120 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 79 0 0 0.000 0 1.00
121 121 A 90 4 1 0 0 0 1 0 1 0 0 0 0 3 0 0 0 0 0 0 79 0 0 0.479 15 0.77
122 122 A 1 0 0 0 0 0 0 0 95 0 1 0 0 0 1 1 0 0 0 0 79 0 0 0.271 9 0.87
123 123 A 0 0 0 0 0 0 0 0 0 0 96 4 0 0 0 0 0 0 0 0 79 0 0 0.161 5 0.92
124 124 A 0 0 0 0 0 0 0 97 0 0 0 0 0 0 0 0 0 0 0 3 79 0 0 0.118 3 0.96
125 125 A 0 11 0 67 22 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 79 0 0 0.846 28 0.69
126 126 A 0 0 0 0 0 0 0 0 0 0 3 0 0 1 6 78 1 0 10 0 78 0 0 0.808 26 0.68
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
56 41 55 1 gNl
57 57 103 1 gNl
58 57 103 1 gNl
59 57 103 1 gNl
60 57 103 1 gNl
61 57 106 1 gNl
62 51 53 2 sQNk
63 47 68 27 gFYPHHAACGSGRQKLRFIRKLASSDRRk
64 52 117 2 tALk
65 52 67 2 tALk
66 52 67 2 tALk
67 47 48 3 pQSPk
68 47 48 3 pQSPk
69 42 52 12 sDSTEKPNCYMKSk
69 47 69 1 gNl
70 36 45 21 lRYCLRNQKIPPSCSFVSLQTTa
70 41 71 1 gSf
72 42 52 12 sDNTEKPNCFMKSk
72 47 69 1 gAl
75 35 52 5 kTLPSGk
75 46 68 22 gKDTSRVRKFNVLIRTKVIFNYKv
76 38 59 14 fETGQPCGTLLQQIKa
76 43 78 1 gKl
77 42 93 26 nQEGQQQAADSEQEQNKLGHEIFFWLGr
78 42 93 26 nQEGQQQAADSEQEQNKLGHEIFFWLGr
79 42 93 26 nQEGQQQAADSEQEQNKLGHEIFFWLGr
80 35 165 11 kLRPGESLARSPr
80 41 182 21 gGLQRLALLGGEGGEPGRGGRXx
81 32 82 2 yKKt
81 37 89 1 qAl
81 47 100 18 gSQTSLDEAGTAAYKTVELd
//