Complet list of 2dgz hssp file
Complete list of 2dgz.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 2DGZ
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-18
HEADER DNA BINDING PROTEIN 16-MAR-06 2DGZ
COMPND MOL_ID: 1; MOLECULE: WERNER SYNDROME PROTEIN VARIANT; CHAIN: A; FRAGME
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: HOMO SAPIENS; ORGANISM_COMMON: HUMAN;
AUTHOR C.ABE,Y.MUTO,M.INOUE,T.KIGAWA,T.TERADA,M.SHIROUZU, S.YOKOYAMA,RIKEN ST
DBREF 2DGZ A 1140 1239 UNP Q14191 WRN_HUMAN 1140 1239
SEQLENGTH 113
NCHAIN 1 chain(s) in 2DGZ data set
NALIGN 77
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : Q59F09_HUMAN 0.94 0.96 8 113 550 655 106 0 0 842 Q59F09 Werner syndrome protein variant (Fragment) OS=Homo sapiens PE=2 SV=1
2 : WRN_HUMAN 2FBX 0.93 0.95 7 113 1139 1245 107 0 0 1432 Q14191 Werner syndrome ATP-dependent helicase OS=Homo sapiens GN=WRN PE=1 SV=2
3 : F7FM29_MACMU 0.90 0.93 7 111 971 1075 105 0 0 1265 F7FM29 Uncharacterized protein (Fragment) OS=Macaca mulatta GN=WRN PE=4 SV=1
4 : G7PD58_MACFA 0.90 0.93 7 111 944 1048 105 0 0 1237 G7PD58 Putative uncharacterized protein (Fragment) OS=Macaca fascicularis GN=EGM_17222 PE=4 SV=1
5 : G3S1P3_GORGO 0.87 0.93 1 113 1103 1215 113 0 0 1402 G3S1P3 Uncharacterized protein (Fragment) OS=Gorilla gorilla gorilla PE=4 SV=1
6 : G3S912_GORGO 0.87 0.93 1 113 1105 1217 113 0 0 1404 G3S912 Uncharacterized protein OS=Gorilla gorilla gorilla PE=4 SV=1
7 : F1RX70_PIG 0.86 0.91 8 113 1111 1216 106 0 0 1409 F1RX70 Uncharacterized protein OS=Sus scrofa GN=WRN PE=4 SV=2
8 : G1RN00_NOMLE 0.86 0.93 1 113 1134 1246 113 0 0 1433 G1RN00 Uncharacterized protein OS=Nomascus leucogenys GN=WRN PE=4 SV=1
9 : G7MZ24_MACMU 0.86 0.91 1 111 1132 1242 111 0 0 1431 G7MZ24 Werner syndrome ATP-dependent helicase OS=Macaca mulatta GN=EGK_18829 PE=4 SV=1
10 : H2QW01_PANTR 0.86 0.93 1 113 1133 1245 113 0 0 1432 H2QW01 Uncharacterized protein OS=Pan troglodytes GN=WRN PE=4 SV=1
11 : F7I5U1_CALJA 0.84 0.90 7 113 1196 1304 109 1 2 1491 F7I5U1 Uncharacterized protein OS=Callithrix jacchus GN=WRN PE=4 SV=1
12 : G5C9Z1_HETGA 0.84 0.93 8 112 1114 1218 105 0 0 1411 G5C9Z1 Werner syndrome ATP-dependent helicase OS=Heterocephalus glaber GN=GW7_06043 PE=4 SV=1
13 : H2PQ01_PONAB 0.84 0.92 1 113 1133 1245 113 0 0 1432 H2PQ01 Uncharacterized protein OS=Pongo abelii GN=WRN PE=4 SV=1
14 : Q5RDV4_PONAB 0.83 0.91 1 113 1187 1299 113 0 0 1486 Q5RDV4 Putative uncharacterized protein DKFZp459L2333 OS=Pongo abelii GN=DKFZp459L2333 PE=2 SV=1
15 : D2I555_AILME 0.82 0.89 8 113 854 959 106 0 0 1113 D2I555 Putative uncharacterized protein (Fragment) OS=Ailuropoda melanoleuca GN=PANDA_020784 PE=4 SV=1
16 : G1L0X7_AILME 0.82 0.89 8 113 920 1025 106 0 0 1218 G1L0X7 Uncharacterized protein (Fragment) OS=Ailuropoda melanoleuca GN=WRN PE=4 SV=1
17 : G1T3R3_RABIT 0.81 0.90 1 113 1245 1357 113 0 0 1548 G1T3R3 Uncharacterized protein OS=Oryctolagus cuniculus GN=WRN PE=4 SV=2
18 : I3LC91_PIG 0.81 0.87 1 113 1202 1314 113 0 0 1507 I3LC91 Uncharacterized protein OS=Sus scrofa GN=WRN PE=4 SV=1
19 : H0UZQ5_CAVPO 0.80 0.88 2 113 1136 1247 112 0 0 1433 H0UZQ5 Uncharacterized protein OS=Cavia porcellus GN=WRN PE=4 SV=1
20 : G9KY41_MUSPF 0.79 0.87 8 113 11 119 109 1 3 272 G9KY41 Werner syndrome, RecQ helicase-like protein (Fragment) OS=Mustela putorius furo PE=2 SV=1
21 : F7D3Z6_HORSE 0.78 0.88 1 113 1124 1236 113 0 0 1388 F7D3Z6 Uncharacterized protein OS=Equus caballus GN=WRN PE=4 SV=1
22 : F7DK54_HORSE 0.78 0.88 1 113 1095 1207 113 0 0 1360 F7DK54 Uncharacterized protein (Fragment) OS=Equus caballus GN=WRN PE=4 SV=1
23 : S7PEM3_MYOBR 0.77 0.89 1 113 1102 1214 113 0 0 1642 S7PEM3 Werner syndrome ATP-dependent helicase OS=Myotis brandtii GN=D623_10033321 PE=4 SV=1
24 : G1P5R1_MYOLU 0.76 0.88 1 113 1081 1193 113 0 0 1386 G1P5R1 Uncharacterized protein (Fragment) OS=Myotis lucifugus GN=WRN PE=4 SV=1
25 : L5KDB0_PTEAL 0.76 0.88 1 113 1008 1120 113 0 0 1338 L5KDB0 Werner syndrome ATP-dependent helicase OS=Pteropus alecto GN=PAL_GLEAN10021595 PE=4 SV=1
26 : L8INR5_9CETA 0.76 0.81 1 113 1125 1237 113 0 0 1430 L8INR5 Werner syndrome ATP-dependent helicase OS=Bos mutus GN=M91_11317 PE=4 SV=1
27 : M3YFL8_MUSPF 0.76 0.88 1 113 1092 1204 113 0 0 1397 M3YFL8 Uncharacterized protein OS=Mustela putorius furo GN=WRN PE=4 SV=1
28 : U6CWW3_NEOVI 0.76 0.88 1 113 1092 1204 113 0 0 1397 U6CWW3 Werner syndrome ATP-dependent helicase OS=Neovison vison GN=WRN PE=2 SV=1
29 : E1BEE6_BOVIN 0.75 0.81 1 113 1099 1211 113 0 0 1404 E1BEE6 Uncharacterized protein OS=Bos taurus GN=WRN PE=4 SV=2
30 : F1PUF8_CANFA 0.75 0.88 1 113 1225 1337 113 0 0 1574 F1PUF8 Uncharacterized protein OS=Canis familiaris GN=WRN PE=4 SV=2
31 : F1PZR2_CANFA 0.75 0.88 1 113 1112 1224 113 0 0 1336 F1PZR2 Uncharacterized protein (Fragment) OS=Canis familiaris GN=WRN PE=4 SV=2
32 : F1PZR3_CANFA 0.75 0.88 1 113 1193 1305 113 0 0 1499 F1PZR3 Uncharacterized protein OS=Canis familiaris GN=WRN PE=4 SV=2
33 : G3H625_CRIGR 0.75 0.87 1 113 1104 1216 113 0 0 1405 G3H625 Werner syndrome ATP-dependent helicase-like OS=Cricetulus griseus GN=I79_005758 PE=4 SV=1
34 : L5LTT1_MYODS 0.75 0.88 1 113 1093 1205 113 0 0 1398 L5LTT1 Werner syndrome ATP-dependent helicase OS=Myotis davidii GN=MDA_GLEAN10024960 PE=4 SV=1
35 : G3SU95_LOXAF 0.74 0.88 3 113 1103 1215 113 1 2 1410 G3SU95 Uncharacterized protein (Fragment) OS=Loxodonta africana GN=WRN PE=4 SV=1
36 : W5NPQ2_SHEEP 0.74 0.82 1 113 1101 1213 113 0 0 1406 W5NPQ2 Uncharacterized protein OS=Ovis aries GN=WRN PE=4 SV=1
37 : F1LTH9_RAT 0.73 0.89 1 113 1098 1210 113 0 0 1400 F1LTH9 Protein Wrn OS=Rattus norvegicus GN=Wrn PE=4 SV=2
38 : M3WUH5_FELCA 0.73 0.87 1 113 1189 1301 113 0 0 1494 M3WUH5 Uncharacterized protein OS=Felis catus GN=WRN PE=4 SV=1
39 : H0X0X3_OTOGA 0.72 0.85 1 112 1106 1217 112 0 0 1371 H0X0X3 Uncharacterized protein (Fragment) OS=Otolemur garnettii GN=WRN PE=4 SV=1
40 : Q8BWH5_MOUSE 0.70 0.88 1 113 133 245 113 0 0 436 Q8BWH5 Putative uncharacterized protein (Fragment) OS=Mus musculus GN=Wrn PE=2 SV=1
41 : WRN_MOUSE 2E6L 0.70 0.88 1 113 1098 1210 113 0 0 1401 O09053 Werner syndrome ATP-dependent helicase homolog OS=Mus musculus GN=Wrn PE=1 SV=3
42 : I3MLC3_SPETR 0.69 0.82 1 113 1133 1247 115 1 2 1441 I3MLC3 Uncharacterized protein OS=Spermophilus tridecemlineatus GN=WRN PE=4 SV=1
43 : F7CIS0_ORNAN 0.62 0.83 3 109 989 1095 107 0 0 1333 F7CIS0 Uncharacterized protein OS=Ornithorhynchus anatinus GN=WRN PE=4 SV=2
44 : G3VTA2_SARHA 0.61 0.81 13 113 1101 1201 101 0 0 1387 G3VTA2 Uncharacterized protein OS=Sarcophilus harrisii GN=WRN PE=4 SV=1
45 : G3VTA3_SARHA 0.61 0.81 13 113 1147 1247 101 0 0 1393 G3VTA3 Uncharacterized protein OS=Sarcophilus harrisii GN=WRN PE=4 SV=1
46 : F7AMX7_MONDO 0.60 0.78 13 112 1054 1153 100 0 0 1333 F7AMX7 Uncharacterized protein OS=Monodelphis domestica GN=WRN PE=4 SV=2
47 : Q6GPM6_XENLA 0.60 0.85 8 109 1088 1189 102 0 0 1434 Q6GPM6 FFA-1 protein OS=Xenopus laevis GN=FFA-1 PE=2 SV=1
48 : V8PG70_OPHHA 0.60 0.83 19 111 954 1045 93 1 1 1191 V8PG70 Werner syndrome ATP-dependent helicase (Fragment) OS=Ophiophagus hannah GN=WRN PE=4 SV=1
49 : F1NAR0_CHICK 0.59 0.80 11 113 1125 1227 103 0 0 1497 F1NAR0 Uncharacterized protein OS=Gallus gallus GN=WRN PE=4 SV=2
50 : D3KR65_CHICK 0.58 0.79 11 113 1125 1227 103 0 0 1498 D3KR65 WRN helicase OS=Gallus gallus GN=WRN PE=2 SV=1
51 : K7G3X4_PELSI 0.58 0.83 8 113 1111 1216 106 0 0 1490 K7G3X4 Uncharacterized protein OS=Pelodiscus sinensis GN=WRN PE=4 SV=1
52 : W5KRY0_ASTMX 0.58 0.75 1 97 1051 1147 97 0 0 1409 W5KRY0 Uncharacterized protein OS=Astyanax mexicanus PE=4 SV=1
53 : H2ZWI5_LATCH 0.57 0.79 10 111 1114 1215 102 0 0 1469 H2ZWI5 Uncharacterized protein (Fragment) OS=Latimeria chalumnae PE=4 SV=2
54 : M7B4V2_CHEMY 0.57 0.85 9 113 1014 1118 105 0 0 1551 M7B4V2 Werner syndrome ATP-dependent helicase (Fragment) OS=Chelonia mydas GN=UY3_09899 PE=4 SV=1
55 : Q28D23_XENTR 0.57 0.83 1 100 1072 1171 100 0 0 1171 Q28D23 Werner syndrome homolog (Human) (Fragment) OS=Xenopus tropicalis GN=wrn PE=2 SV=1
56 : W5MXX5_LEPOC 0.57 0.82 7 104 1063 1160 98 0 0 1382 W5MXX5 Uncharacterized protein OS=Lepisosteus oculatus PE=4 SV=1
57 : W5MXW8_LEPOC 0.56 0.81 7 106 1084 1183 100 0 0 1464 W5MXW8 Uncharacterized protein OS=Lepisosteus oculatus PE=4 SV=1
58 : WRN_XENLA 0.56 0.82 1 109 1081 1189 109 0 0 1436 O93530 Werner syndrome ATP-dependent helicase homolog OS=Xenopus laevis GN=wrn PE=2 SV=1
59 : F7BIE2_XENTR 0.55 0.80 1 100 1072 1173 102 1 2 1173 F7BIE2 Uncharacterized protein OS=Xenopus tropicalis GN=wrn PE=4 SV=1
60 : F7E8G8_XENTR 0.55 0.81 1 113 1096 1208 113 0 0 1400 F7E8G8 Uncharacterized protein (Fragment) OS=Xenopus tropicalis GN=wrn PE=4 SV=1
61 : H9G760_ANOCA 0.55 0.81 2 111 1093 1202 110 0 0 1315 H9G760 Uncharacterized protein OS=Anolis carolinensis GN=WRN PE=4 SV=1
62 : R0L7C8_ANAPL 0.54 0.78 1 112 1029 1140 112 0 0 1340 R0L7C8 Werner syndrome ATP-dependent helicase-like protein (Fragment) OS=Anas platyrhynchos GN=Anapl_03850 PE=4 SV=1
63 : U3I2H1_ANAPL 0.54 0.78 1 112 1038 1149 112 0 0 1387 U3I2H1 Uncharacterized protein (Fragment) OS=Anas platyrhynchos GN=WRN PE=4 SV=1
64 : G1N0H1_MELGA 0.53 0.75 4 113 1120 1231 112 1 2 1501 G1N0H1 Uncharacterized protein (Fragment) OS=Meleagris gallopavo GN=WRN PE=4 SV=2
65 : E7FCY8_DANRE 0.50 0.75 7 113 1031 1137 107 0 0 1387 E7FCY8 Uncharacterized protein OS=Danio rerio GN=wrn PE=4 SV=1
66 : E9QGF6_DANRE 0.50 0.75 7 113 1080 1186 107 0 0 1436 E9QGF6 Uncharacterized protein OS=Danio rerio GN=wrn PE=4 SV=2
67 : V9KG59_CALMI 0.48 0.76 1 113 524 636 113 0 0 831 V9KG59 Werner syndrome ATP-dependent helicase-like protein (Fragment) OS=Callorhynchus milii PE=2 SV=1
68 : Q71J84_LACDL 0.35 0.56 1 90 21 113 93 2 3 114 Q71J84 ATP-dependent DNA helicase recQ (Fragment) OS=Lactobacillus delbrueckii subsp. lactis PE=4 SV=1
69 : I1F3R8_AMPQE 0.34 0.63 3 95 546 640 95 2 2 847 I1F3R8 Uncharacterized protein OS=Amphimedon queenslandica PE=4 SV=1
70 : W4ZGZ4_STRPU 0.34 0.70 1 108 7 115 109 1 1 346 W4ZGZ4 Uncharacterized protein OS=Strongylocentrotus purpuratus GN=Sp-WrnL_2 PE=4 SV=1
71 : I1EHQ8_AMPQE 0.33 0.65 4 95 205 298 94 2 2 502 I1EHQ8 Uncharacterized protein (Fragment) OS=Amphimedon queenslandica PE=4 SV=1
72 : I1F3T2_AMPQE 0.33 0.65 3 95 707 801 95 2 2 1005 I1F3T2 Uncharacterized protein OS=Amphimedon queenslandica PE=4 SV=1
73 : B3RTQ9_TRIAD 0.31 0.62 8 105 749 847 99 1 1 1020 B3RTQ9 Putative uncharacterized protein OS=Trichoplax adhaerens GN=TRIADDRAFT_56013 PE=4 SV=1
74 : F6Z6H7_CIOIN 0.31 0.57 19 107 853 943 91 2 2 1194 F6Z6H7 Uncharacterized protein OS=Ciona intestinalis GN=LOC100179894 PE=4 SV=2
75 : H2ZHZ6_CIOSA 0.31 0.57 9 105 230 328 99 2 2 580 H2ZHZ6 Uncharacterized protein OS=Ciona savignyi GN=Csa.1405 PE=4 SV=1
76 : F8JB96_HYPSM 0.30 0.56 2 100 530 628 100 2 2 727 F8JB96 ATP-dependent DNA helicase RecQ OS=Hyphomicrobium sp. (strain MC1) GN=recQ PE=4 SV=1
77 : H5SK29_9BACT 0.30 0.59 10 100 514 603 92 2 3 717 H5SK29 ATP-dependent DNA helicase RecQ OS=uncultured Bacteroidetes bacterium GN=HGMM_F40B03C03 PE=4 SV=1
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1133 A G 0 0 139 41 37 SS SAS SS SS SSSSSSSSSSSSSS SSSSSSS S T TTT SS GG G
2 1134 A S - 0 0 119 44 33 PP PPP PP PPP PPPPPPPPPPPPPP PPPPPPS Q P PPPPPP TR A
3 1135 A S - 0 0 122 48 79 EE EEE EE GRG RRGGGRGGRGGGGGPGGGEGGEP A C CCCLVV LQGT
4 1136 A G - 0 0 81 50 63 NN KKK KK KKN KKKKKKKKKKKKKKAKTKKTTKE P K KKKARRS EGGE
5 1137 A S - 0 0 127 50 68 AA SSA SS SYT SSSSSYPPYSSSSSSYSSSSSSP S P LPPPSSP KQED
6 1138 A S - 0 0 117 50 85 YY YYY YY YYY YYYYYYHHYYYYYYFYYYYSSYY A L SLLVPPL SSVI
7 1139 A G - 0 0 66 58 76 SSSSS SSSS SS SMG KKKKNKKKKKKKGKKKSKTSSSK K QPPRQQSLLRRRAKGV
8 1140 A S - 0 0 130 67 56 SSSSSSSSSSASSSSSFSSSSSSSSSSSSSSSSSSSSSSPPFS P SP ASSPAAPKKSAAPRGA
9 1141 A S - 0 0 107 69 65 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTSWSRLLSA P PQ PAHHPAATSSRQQSRAT
10 1142 A Q - 0 0 152 71 55 QQEEEEEEEEEEEEEEEEGEEEEEEEEEEEEEEEEEEEEEEDK E EPQEEAAEEEVRRVTTMQNQ
11 1143 A P - 0 0 110 73 45 PPPPPPPPPPPPPPPPPPAPPPPPLPPPPPPPPPPPPPPPPPT P PPPPPPPPPPPPPEESPPVEVK
12 1144 A V - 0 0 131 73 64 VVVVVVVVVIVVAAVVVVVVVVVVVVVVVVVVAVVVVVVAAVA E VVAAPADPPEDDAPPVGGAVPP
13 1145 A I - 0 0 49 76 42 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIITIIVIILIIIVV VVVVVVVVVVVVVVVVAAVKIV
14 1146 A S >> - 0 0 55 76 35 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSLSSSSLSSLSSSSS SSSSSSSSSSSSSSSSSSSSKD
15 1147 A A H 3>>S+ 0 0 56 76 60 AAAAAAAAAAAASSAAAAAAAAAASAAAAAAAAATTAAAAAASSSSP SSSAPSPPPPPPQSSSAASEDP
16 1148 A Q H 345S+ 0 0 160 76 60 QQQQQQQQQQQQQQQQQQQQQQQQKQQQQEEEQQQQQEQQQQSKKKR RRRRRRKRRRKKRPPQKKRQKK
17 1149 A E H <>5S+ 0 0 92 76 16 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE EEEEEEEDDEEEEEEEEEELEE
18 1150 A Q H X5S+ 0 0 67 76 75 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLQQQVQQLLQPQQQM NNKLLKMLLRMMKKKNIILATQ
19 1151 A E H X5S+ 0 0 124 78 23 EEEEEEEEEEEKEEEEEEKEEEEEEEEEEEEEDEEEDKVDDEDQQEEEEEEEEEEEEEEEEEEKQQEEEE
20 1152 A T H >X S+ 0 0 0 78 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
25 1157 A Y H 3X S+ 0 0 69 78 3 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYFYY
26 1158 A G H 3X S+ 0 0 35 78 50 GGGGGGGGGGGGGGIIGGGGGGGGGGGGGGGGAGGGASAAARGCCCGGGGGGGSGSSGGGGGGGGGAEES
27 1159 A K H X S+ 0 0 98 78 25 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQRNN
34 1166 A K H 3X S+ 0 0 159 78 25 KKKKKKKKKKKRKKKKKKKKKKKKKKKKKKKKKKKKKEKKKRKKKKKKKKKKKKKKKKKKKKKKKKKIQN
35 1167 A H H 3X S+ 0 0 17 78 81 HHHHHHHHHHQHHHQQHHHQRRLLYHQQHHHHHLQHHHQHHHYQQQILVVLLLLILLIIILVVVLLILII
36 1168 A A H <<>S+ 0 0 0 78 1 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
37 1169 A N H ><5S+ 0 0 114 78 61 NNNNNNNNNNNNNNNNNNNrHHNNSNSSNGGGNNNNNNNNNNNSSSSSNNSSTSSTTSSSNNNNSSSQHS
38 1170 A K H 3<5S+ 0 0 143 78 63 KKKKKKKKKKKKKKKKKKKkNNNNKMKKMKKKKNKTEQEKKIEEEEEEEEEIAEEEEEEEEEEEIIEKEE
39 1171 A M T 3<5S- 0 0 55 78 68 MMMMMMMMMMMMMMMMMMRMMMLLMMMMMMMMMLMMVKMMMMKKKKRIKKKKRRRKKRRRKKKKRRKRTM
40 1172 A D T < 5 + 0 0 140 78 27 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDNDDDDGDDDDDDDDDDVVDDDDDDDDDDDAAVDDDGDG
41 1173 A V < - 0 0 40 78 22 VVVVVVVVVVVVVVIIVVVVVVIIVVVVVVVVVIVVVVVVVVIIIIIIIIIVIIIIIIIIIVVIIIVVIA
42 1174 A P >> - 0 0 74 78 8 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPLPPPPPPPPPPPA
43 1175 A P H 3> S+ 0 0 58 78 0 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
44 1176 A A H 34 S+ 0 0 64 78 46 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAFFY
45 1177 A I H <4 S+ 0 0 107 78 30 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIVIVIIIILLLVVVVVIVVVVVVVVVVVVLLMVMM
46 1178 A L H < S- 0 0 1 78 18 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLMMLLLLLLLLLLIIV
47 1179 A A < - 0 0 1 78 11 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAFAA
48 1180 A T >> - 0 0 71 78 34 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSSN
49 1181 A N H 3> S+ 0 0 82 78 7 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNDNN
50 1182 A K H 3> S+ 0 0 99 78 9 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKLKK
51 1183 A I H <> S+ 0 0 8 78 42 IIIIIIIIIIIIIIIIIILIIIIIIIIIIIIIIIIIVIVVVIVVVVVVIIIIIIVIIVVVVIIIIIISLN
52 1184 A L H X S+ 0 0 0 78 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
53 1185 A V H X S+ 0 0 60 78 40 VVVVVVVLVVVLVVLLIVVLLLLLVLLLLLLLLLLVLLVLLLVVVVVVVVVLLVVLLVVVVVVVLLLRGL
54 1186 A D H >X>S+ 0 0 18 78 20 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDEEEDDDDDDDDDDDDDDDDEEDDDDDDDDDDDEEEDDEDID
55 1187 A M H 3X5S+ 0 0 0 78 8 MMMMMMMMMMMLMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMLMMMMMMMMMMMMMFMLL
56 1188 A A H 3<5S+ 0 0 4 78 24 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATAAAAAAAAAAAATAAASSAASA
57 1189 A K H <<5S+ 0 0 92 78 43 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRKKKKKKKKVRRKKKKKKKKKKTRRRKKKALV
58 1190 A M H <5S- 0 0 47 78 56 MMMMMMMMMMMMMMMMMMMMMMMMIMMMMMMMMMIIMDVMMMMMMMLIMMIILILIILLLIIITLLIRSI
59 1191 A R << - 0 0 12 78 11 RRRRRRRRRRrRRRRRRRRRRRRRRRRRRRRRRRsRRRRRRsRRRRRRRRRRRRRRRRRRRRRrRRRKRR
60 1192 A P + 0 0 0 78 0 PPPPPPPPPPpPPPPPPPPPPPPPPPPPPPPPPPpPPPPPPpPPPPPPPPPPPPPPPPPPPPPpPPPPPP
61 1193 A T + 0 0 19 78 34 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSSSTTTTTCTTTTTTTTTTTTCCTQTA
62 1194 A T S >> S- 0 0 49 78 14 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSTTTTTTTTTTTTTTTTTTTTTTTTTTS
63 1195 A V H 3> S+ 0 0 35 78 62 VVVVVVVVVVVVVVFFVVVFVVVVVVFFVFFFAVVVVVVVVVVLLLSVVVVVMVIVVSIIIVVVMMAPEI
64 1196 A E H 3> S+ 0 0 117 78 28 EEEEEEEEEEAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEQEEEEEEEEALEEEEEEEEEEESSDEEK
65 1197 A N H X> S+ 0 0 5 78 41 NNNNNNNNNNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSSSNINNNRNNNNNNNNSNNNSSNESS
66 1198 A V H >< S+ 0 0 0 78 31 VVIIVVVVIVVVIIVVVVLVVVVVVVVVVMMMVVVVVVIMMVLLLLMMVVVLLVMLLMMMIVVVLLLLLM
67 1199 A K H 3< S+ 0 0 64 78 38 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKIKKKKKKKKKKKKKKKKKKKKKKKKLRL
68 1200 A R H << S+ 0 0 124 78 55 RRRRRRRRRRRQRRRRRRLRRRRRRRRRRRRRQRRRQRKQQQTNNNKRRRRQRRKMMKKKRRRRQQRQLK
69 1201 A I S << S- 0 0 4 78 16 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIILIIIIVIILVVLLLIIIIVVVCII
70 1202 A D S S+ 0 0 102 78 21 DDDDDDDDDDDDDDDDDDDDDDNNDDDDDNNNDNDDDDDDDDDDDDDDDDDDDDDDDDDDDEEDDDDSDD
71 1203 A G S S+ 0 0 50 78 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
72 1204 A V - 0 0 14 78 9 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVIVVVVVVVVVVVVVVVVVVVVVI
73 1205 A S >> - 0 0 76 78 12 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSPGSA
74 1206 A E T 34 S+ 0 0 180 78 29 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEEEEEEEEEEEEEEEDQQ
75 1207 A G T >4 S+ 0 0 36 78 49 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAAAAAAAAAAAAAAAAAAAAAAAAAAQA
76 1208 A K G X> S+ 0 0 67 78 23 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKEKKKKKKKKKKKKKKKKKKFR
77 1209 A A G >< S+ 0 0 0 78 57 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASCCCSSAASASSSSSSSSSSSSAASLIA
78 1210 A A G X4 S+ 0 0 51 78 63 AAAAAAMAAAATAANNAMTNTTTTIVNNVTTTTTTVALAAATTTTTATTTANSAASSAAAATTTAASATD
79 1211 A M G <4 S+ 0 0 66 78 41 MMMMMMMVMMMMMMMMMMMMMMMMMMMMMMMMLMMMLMMLLVMMMMMMMMMMMMMMMMMMMMMMMMVNKN
80 1212 A L G + 0 0 25 78 48 AAAAAAAAAAAAAATTAAATAAAAAATTATTTDATAAATAAAAAASAAVVVAAVAAAAAAEVVGTTAggg
82 1214 A P H > S+ 0 0 51 78 37 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPLLPPPPPPPPPPSPPLPPPakq
83 1215 A L H >> S+ 0 0 0 78 10 LLLLLLLLLLLLLPLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLMLF
84 1216 A W H 3X S+ 0 0 21 78 17 WLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTLLLLLLVVLLLLLLLLLVLLLVVLLLLIILLLI
85 1217 A E H 3X S+ 0 0 101 78 45 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEEENAAHEDDEEEEEEIAAADDVATE
86 1218 A V H S+ 0 0 5 77 20 CCCCCCCCCCCCCCCCCCCCSSCCCCCCCCCCCCCCCCCCCCCSSSCSCCCCACCCCCCCSCCCCCC CC
92 1224 A Q H <5S+ 0 0 149 77 58 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHQQQQQQQIQQQQIQQQQQQHICCIIIQQQQQQE TK
93 1225 A T T <5S+ 0 0 90 77 77 TTTTTTITTTTATTIIVIEITTIVVIIIIIIIVIVIVKLVVETTTMATAAEKIAASSAAATAAATTA AE
94 1226 A N T 5S- 0 0 97 77 25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTNNNNNNNNNNYNNNNNNNNNNNNHHN HN
95 1227 A S T 5 + 0 0 118 77 57 SSSSSSSSSSSSNNSSSSSSSSSSSSSSSNNNSSSSSSSSSSNNNNSNGGNNKNSGGSSSNGGGGGE PD
96 1228 A V < - 0 0 32 74 39 VVVVVVVVVVVVVVVVVVVIIIVVIVIIVVVVVVLVVIVVVILLLLLFLLLLLLLLLLLLLLLLLLL I
97 1229 A Q + 0 0 145 74 38 QQQQQQQQQQQQQQQQEQQQQQQQQQQQQQQQQQQQQQQQQQQPPLKQQQEQEEKQQKKKQQQQQQE Q
98 1230 A T + 0 0 19 73 54 TTTTTTTTTTTATTTTTTTTTTTTTTTTTSSSTTTTTTTTTTTTTTVTTTT VTMVVVVMTTTTVVA L
99 1231 A D + 0 0 65 73 19 DDDDDDNDDDDDDDDDDNDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD DDDNNDSDDDDDDDD D
100 1232 A L + 0 0 76 73 53 LLLLLLVLLLLLLLLLLVLLLLLLLLLLLLLLLLILLLLLLFLLLLVIIIT KTIMMVIIIAAIVVM N
101 1233 A F - 0 0 38 69 12 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFLLFFFFFFFFFF FF LLF FFFFFSSF F
102 1234 A S S S+ 0 0 115 69 41 SSSSSSSSSSSSSSSSPSSSPPSSSSSSSSSSSSSSAPPSSSSSSSSKPPP SP SSS SKPPPSSS P
103 1235 A S S S+ 0 0 76 69 63 SSSSSSSSSSTNSSRRTSSRSSSSSSRRSSSSSSSSSNSSSNSSSSGNKKT ST SSG RTTTKNNL A
104 1236 A T - 0 0 85 69 64 TTTTTTATTTTTTTTTAATTTTTTSTTTSTTTTTTTTTIAAISPPPSSPPC LC TTS SSDDPSSS S
105 1237 A K - 0 0 101 68 73 KKEEKKKKEKKKKKKKKKKKKKNNKEKKEKKKKNKEKKKKKIRSSSVGEEG DG PV GGQQEAAN A
106 1238 A P - 0 0 118 66 57 PPPPPPPPPPPSPPPPPPPPPPPPPLPPLPPPPPPLPLPPPPPPPPSMSSS SS SS SSKKSSSQ P
107 1239 A Q + 0 0 194 65 65 QQQQQQQQQQQQQQQQQQQQQQQPQQQQQWWWQQQQQQQHHQKGGGQRTTR NR Q QQEETSST S
108 1240 A S - 0 0 115 64 58 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEKKEELLLSDDDE AE S SDAADVVA S
109 1241 A G - 0 0 58 63 41 EEEEEEEEEEEQEEEEEEEEQQEEEEEEEEEEEEEEDKEEEEEKKEEQQQK EK E EQSSKQQQ
110 1242 A P - 0 0 122 60 52 QQQQQQQQQQQKQQQQQQQQQQQQQEQQEPPPQQQQEQQQQQ EEE PKKQ QQ LPHHKTTQ
111 1243 A S + 0 0 120 60 56 KKKKKKKKKKKKKKRRKKKRKKKKKKRRKRRRKKKREGKEEK KKK DEEP KP TNRREHHK
112 1244 A S 0 0 126 54 66 TT TTKT TMSTTNNEKKNSSKKKKNNKNNNKKKKKNEKKK NNK TTE A S KKTSST
113 1245 A G 0 0 131 49 25 SS SSSS SS SSSSSSSSPPSSSNSSNSSSSSSSSS SSS SS SSA A S SSST
## ALIGNMENTS 71 - 77
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....8....:....9....:....0....:....1....:....2....:....3....:....4
1 1133 A G 0 0 139 41 37
2 1134 A S - 0 0 119 44 33 S
3 1135 A S - 0 0 122 48 79 G A
4 1136 A G - 0 0 81 50 63 GG K
5 1137 A S - 0 0 127 50 68 AA S
6 1138 A S - 0 0 117 50 85 VV S
7 1139 A G - 0 0 66 58 76 GG K
8 1140 A S - 0 0 130 67 56 GGS R
9 1141 A S - 0 0 107 69 65 AAE SS
10 1142 A Q - 0 0 152 71 55 NNN NAE
11 1143 A P - 0 0 110 73 45 VVV VSP
12 1144 A V - 0 0 131 73 64 PPV VAT
13 1145 A I - 0 0 49 76 42 IIE KSE
14 1146 A S >> - 0 0 55 76 35 KKD KPE
15 1147 A A H 3>>S+ 0 0 56 76 60 DDP AKL
16 1148 A Q H 345S+ 0 0 160 76 60 KKA HVH
17 1149 A E H <>5S+ 0 0 92 76 16 EEE EAL
18 1150 A Q H X5S+ 0 0 67 76 75 TTV VRH
19 1151 A E H X5S+ 0 0 124 78 23 EEEDDEN
20 1152 A T H >X S+ 0 0 0 78 0 LLLLLLL
25 1157 A Y H 3X S+ 0 0 69 78 3 YYYYYFL
26 1158 A G H 3X S+ 0 0 35 78 50 EETRRDA
27 1159 A K H X S+ 0 0 98 78 25 NNNNNQK
34 1166 A K H 3X S+ 0 0 159 78 25 QQKEEKK
35 1167 A H H 3X S+ 0 0 17 78 81 MMVIILL
36 1168 A A H <<>S+ 0 0 0 78 1 AAAAAAA
37 1169 A N H ><5S+ 0 0 114 78 61 HHNQQAQ
38 1170 A K H 3<5S+ 0 0 143 78 63 EEEDDSE
39 1171 A M T 3<5S- 0 0 55 78 68 TTIMMAY
40 1172 A D T < 5 + 0 0 140 78 27 DDGDDKQ
41 1173 A V < - 0 0 40 78 22 IICIILV
42 1174 A P >> - 0 0 74 78 8 PPAPPPP
43 1175 A P H 3> S+ 0 0 58 78 0 PPPPPPP
44 1176 A A H 34 S+ 0 0 64 78 46 FFYHHYY
45 1177 A I H <4 S+ 0 0 107 78 30 MMMLLVV
46 1178 A L H < S- 0 0 1 78 18 IIVVVVI
47 1179 A A < - 0 0 1 78 11 AAAAAAF
48 1180 A T >> - 0 0 71 78 34 SSNNNQQ
49 1181 A N H 3> S+ 0 0 82 78 7 NNNNNDE
50 1182 A K H 3> S+ 0 0 99 78 9 KKKKKKP
51 1183 A I H <> S+ 0 0 8 78 42 LLNDDTT
52 1184 A L H X S+ 0 0 0 78 0 LLLLLLL
53 1185 A V H X S+ 0 0 60 78 40 GGLLLII
54 1186 A D H >X>S+ 0 0 18 78 20 NNEEEEE
55 1187 A M H 3X5S+ 0 0 0 78 8 LLILLLM
56 1188 A A H 3<5S+ 0 0 4 78 24 SSASSSA
57 1189 A K H <<5S+ 0 0 92 78 43 LLTKKET
58 1190 A M H <5S- 0 0 47 78 56 SSIAAKY
59 1191 A R << - 0 0 12 78 11 RRRRRRF
60 1192 A P + 0 0 0 78 0 PPPPPPP
61 1193 A T + 0 0 19 78 34 TTNSSTF
62 1194 A T S >> S- 0 0 49 78 14 TASTSTT
63 1195 A V H 3> S+ 0 0 35 78 62 EEEKNEV
64 1196 A E H 3> S+ 0 0 117 78 28 DEDDTSE
65 1197 A N H X> S+ 0 0 5 78 41 SSNRNAE
66 1198 A V H >< S+ 0 0 0 78 31 LLLLLLL
67 1199 A K H 3< S+ 0 0 64 78 38 RRLLLHE
68 1200 A R H << S+ 0 0 124 78 55 LLKRRDR
69 1201 A I S << S- 0 0 4 78 16 IIIVVII
70 1202 A D S S+ 0 0 102 78 21 DDEDDMA
71 1203 A G S S+ 0 0 50 78 0 GGGGGGG
72 1204 A V - 0 0 14 78 9 VVIMMLV
73 1205 A S >> - 0 0 76 78 12 SSSSSGG
74 1206 A E T 34 S+ 0 0 180 78 29 QQNVVAP
75 1207 A G T >4 S+ 0 0 36 78 49 QQTVVSV
76 1208 A K G X> S+ 0 0 67 78 23 FFRKKKK
77 1209 A A G >< S+ 0 0 0 78 57 IICVVIA
78 1210 A A G X4 S+ 0 0 51 78 63 TTAKKAR
79 1211 A M G <4 S+ 0 0 66 78 41 KKKRRRK
80 1212 A L G + 0 0 25 78 48 ggggggg
82 1214 A P H > S+ 0 0 51 78 37 kkvpqap
83 1215 A L H >> S+ 0 0 0 78 10 LLFVVFF
84 1216 A W H 3X S+ 0 0 21 78 17 LLVLLLL
85 1217 A E H 3X S+ 0 0 101 78 45 TTKVAEE
86 1218 A V H S+ 0 0 5 77 20 CCCCCKV
92 1224 A Q H <5S+ 0 0 149 77 58 TTTTSKE
93 1225 A T T <5S+ 0 0 90 77 77 AAKEEHE
94 1226 A N T 5S- 0 0 97 77 25 HHNHNPN
95 1227 A S T 5 + 0 0 118 77 57 PPNENAE
96 1228 A V < - 0 0 32 74 39 FAALI
97 1229 A Q + 0 0 145 74 38 QSSNE
98 1230 A T + 0 0 19 73 54 TFMNR
99 1231 A D + 0 0 65 73 19 NDDRP
100 1232 A L + 0 0 76 73 53 KNNLL
101 1233 A F - 0 0 38 69 12 FFF
102 1234 A S S S+ 0 0 115 69 41 PDE
103 1235 A S S S+ 0 0 76 69 63 DDE
104 1236 A T - 0 0 85 69 64 TED
105 1237 A K - 0 0 101 68 73 SDE
106 1238 A P - 0 0 118 66 57 E
107 1239 A Q + 0 0 194 65 65 D
108 1240 A S - 0 0 115 64 58
109 1241 A G - 0 0 58 63 41
110 1242 A P - 0 0 122 60 52
111 1243 A S + 0 0 120 60 56
112 1244 A S 0 0 126 54 66
113 1245 A G 0 0 131 49 25
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1133 A 0 0 0 0 0 0 0 10 2 0 78 10 0 0 0 0 0 0 0 0 41 0 0 0.738 24 0.63
2 1134 A 0 0 0 0 0 0 0 0 2 84 7 2 0 0 2 0 2 0 0 0 44 0 0 0.673 22 0.66
3 1135 A 4 4 0 0 0 0 0 40 4 4 2 2 8 0 10 0 2 19 0 0 48 0 0 1.895 63 0.20
4 1136 A 0 0 0 0 0 0 0 10 4 2 2 6 0 0 4 60 0 6 6 0 50 0 0 1.457 48 0.37
5 1137 A 0 2 0 0 0 0 8 0 10 16 54 2 0 0 0 2 2 2 0 2 50 0 0 1.528 50 0.32
6 1138 A 8 8 2 0 2 0 56 0 2 4 14 0 0 4 0 0 0 0 0 0 50 0 0 1.496 49 0.15
7 1139 A 2 3 0 2 0 0 0 10 2 3 29 2 0 0 7 33 5 0 2 0 58 0 0 1.880 62 0.23
8 1140 A 0 0 0 0 3 0 0 4 10 10 66 0 0 0 3 3 0 0 0 0 67 0 0 1.202 40 0.44
9 1141 A 0 3 0 0 0 1 0 0 10 6 62 4 0 3 4 0 4 1 0 0 69 0 0 1.429 47 0.35
10 1142 A 3 0 0 1 0 0 0 1 4 1 0 3 0 0 3 1 8 65 7 1 71 0 0 1.412 47 0.44
11 1143 A 8 1 0 0 0 0 0 0 1 79 3 1 0 0 0 1 0 4 0 0 73 0 0 0.853 28 0.55
12 1144 A 59 0 1 0 0 0 0 3 16 12 0 1 0 0 0 0 0 3 0 4 73 0 0 1.312 43 0.36
13 1145 A 28 1 61 0 0 0 0 0 3 0 1 1 0 0 0 3 0 3 0 0 76 0 0 1.117 37 0.57
14 1146 A 0 4 0 0 0 0 0 0 0 1 86 0 0 0 0 5 0 1 0 3 76 0 0 0.626 20 0.64
15 1147 A 0 1 0 0 0 0 0 0 55 13 20 3 0 0 0 1 1 1 0 4 76 0 0 1.366 45 0.40
16 1148 A 1 0 0 0 0 0 0 0 1 3 1 0 0 3 16 17 53 5 0 0 76 0 0 1.449 48 0.40
17 1149 A 0 3 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 93 0 3 76 0 0 0.312 10 0.84
18 1150 A 4 11 3 5 0 0 0 0 1 1 0 4 0 1 3 7 57 0 4 0 76 0 0 1.638 54 0.24
19 1151 A 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 5 78 1 9 78 0 0 0.825 27 0.77
20 1152 A 0 27 0 0 5 0 1 0 6 0 3 51 0 0 0 3 0 1 3 0 78 0 0 1.418 47 0.21
21 1153 A 0 4 0 0 0 0 0 1 0 0 0 1 0 5 5 0 74 9 0 0 78 3 7 0.978 32 0.61
22 1154 A 0 0 13 1 0 0 0 9 1 0 8 64 0 0 1 0 1 0 0 0 75 0 0 1.208 40 0.43
23 1155 A 45 3 1 6 0 0 0 3 21 4 1 6 0 0 0 3 0 6 0 0 77 0 0 1.741 58 0.28
24 1156 A 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.000 0 1.00
25 1157 A 0 1 0 0 3 0 96 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.188 6 0.96
26 1158 A 0 0 3 0 0 0 0 67 9 0 6 1 4 0 4 0 0 5 0 1 78 0 0 1.271 42 0.49
27 1159 A 0 3 4 0 0 0 0 0 1 0 0 0 0 0 10 65 1 15 0 0 78 0 0 1.130 37 0.49
28 1160 A 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.000 0 1.00
29 1161 A 73 18 0 5 0 0 0 0 0 0 0 0 0 0 3 1 0 0 0 0 78 0 0 0.840 28 0.65
30 1162 A 8 0 0 0 0 0 0 0 18 0 8 8 0 0 0 1 0 55 3 0 78 0 0 1.378 46 0.31
31 1163 A 0 6 0 1 0 0 4 3 79 0 1 0 0 0 0 0 0 5 0 0 78 0 0 0.842 28 0.50
32 1164 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 78 0 0 0.000 0 1.00
33 1165 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 88 0 9 0 78 0 0 0.437 14 0.74
34 1166 A 0 0 1 0 0 0 0 0 0 0 0 0 0 0 3 87 4 4 1 0 78 0 0 0.576 19 0.74
35 1167 A 8 21 13 3 0 0 3 0 0 0 0 0 0 37 3 0 14 0 0 0 78 0 0 1.712 57 0.19
36 1168 A 0 0 0 0 0 0 0 0 99 0 1 0 0 0 0 0 0 0 0 0 78 0 0 0.069 2 0.98
37 1169 A 0 0 0 0 0 0 0 4 1 0 24 4 0 6 1 0 5 0 54 0 78 0 1 1.368 45 0.39
38 1170 A 0 0 5 3 0 0 0 0 1 0 1 1 0 0 0 41 1 37 6 3 78 0 0 1.473 49 0.37
39 1171 A 1 4 3 51 0 0 1 0 1 0 0 4 0 0 14 21 0 0 0 0 78 0 0 1.456 48 0.32
40 1172 A 4 0 0 0 0 0 0 5 3 0 0 0 0 0 0 1 1 0 1 85 78 0 0 0.680 22 0.73
41 1173 A 56 1 40 0 0 0 0 0 1 0 0 0 1 0 0 0 0 0 0 0 78 0 0 0.857 28 0.77
42 1174 A 0 1 0 0 0 0 0 0 3 96 0 0 0 0 0 0 0 0 0 0 78 0 0 0.188 6 0.91
43 1175 A 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 78 0 0 0.000 0 1.00
44 1176 A 0 0 0 0 5 0 5 0 87 0 0 0 0 3 0 0 0 0 0 0 78 0 0 0.518 17 0.53
45 1177 A 28 9 55 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 1.099 36 0.69
46 1178 A 6 85 6 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.588 19 0.81
47 1179 A 0 0 0 0 3 0 0 0 97 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.119 3 0.89
48 1180 A 0 0 0 0 0 0 0 0 0 0 5 87 0 0 0 0 3 0 5 0 78 0 0 0.518 17 0.65
49 1181 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 96 3 78 0 0 0.188 6 0.93
50 1182 A 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 97 0 0 0 0 78 0 0 0.137 4 0.91
51 1183 A 19 5 67 0 0 0 0 0 0 0 1 3 0 0 0 0 0 0 3 3 78 0 0 1.077 35 0.57
52 1184 A 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.000 0 1.00
53 1185 A 46 45 4 0 0 0 0 4 0 0 0 0 0 0 1 0 0 0 0 0 78 0 0 1.023 34 0.60
54 1186 A 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 18 3 78 78 0 0 0.650 21 0.79
55 1187 A 0 12 1 86 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.491 16 0.92
56 1188 A 0 0 0 0 0 0 0 0 87 0 10 3 0 0 0 0 0 0 0 0 78 0 0 0.447 14 0.75
57 1189 A 3 4 0 0 0 0 0 0 1 0 0 4 0 0 8 79 0 1 0 0 78 0 0 0.836 27 0.56
58 1190 A 1 10 19 56 0 0 1 0 3 0 4 1 0 0 1 1 0 0 0 1 78 0 0 1.428 47 0.43
59 1191 A 0 0 0 0 1 0 0 0 0 0 3 0 0 0 95 1 0 0 0 0 78 0 4 0.256 8 0.88
60 1192 A 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 78 0 0 0.000 0 1.00
61 1193 A 0 0 0 0 1 0 0 0 1 0 6 85 4 0 0 0 1 0 1 0 78 0 0 0.666 22 0.65
62 1194 A 0 0 0 0 0 0 0 0 1 0 5 94 0 0 0 0 0 0 0 0 78 0 0 0.270 9 0.85
63 1195 A 60 4 6 4 10 0 0 0 3 1 3 0 0 0 0 1 0 6 1 0 78 0 0 1.497 49 0.38
64 1196 A 0 1 0 0 0 0 0 0 3 0 4 1 0 0 0 1 1 83 0 5 78 0 0 0.747 24 0.72
65 1197 A 0 0 1 0 0 0 1 0 1 0 13 0 0 0 3 0 0 3 78 0 78 0 0 0.811 27 0.58
66 1198 A 49 27 9 15 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 1.208 40 0.68
67 1199 A 0 6 1 0 0 0 0 0 0 0 0 0 0 1 4 86 0 1 0 0 78 0 0 0.600 20 0.61
68 1200 A 0 5 0 3 0 0 0 0 0 0 0 1 0 0 63 10 13 0 4 1 78 0 0 1.272 42 0.45
69 1201 A 10 6 82 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 78 0 0 0.628 20 0.84
70 1202 A 0 0 0 1 0 0 0 0 1 0 1 0 0 0 0 0 0 4 8 85 78 0 0 0.632 21 0.78
71 1203 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.000 0 1.00
72 1204 A 92 1 4 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.349 11 0.90
73 1205 A 0 0 0 0 0 0 0 4 1 1 94 0 0 0 0 0 0 0 0 0 78 0 0 0.299 9 0.87
74 1206 A 3 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 5 86 1 3 78 0 0 0.638 21 0.71
75 1207 A 4 0 0 0 0 0 0 55 35 0 1 1 0 0 0 0 4 0 0 0 78 0 0 1.058 35 0.51
76 1208 A 0 0 0 0 4 0 0 0 0 0 0 0 0 0 3 92 0 1 0 0 78 0 0 0.349 11 0.77
77 1209 A 3 1 5 0 0 0 0 0 64 0 22 0 5 0 0 0 0 0 0 0 78 0 0 1.072 35 0.42
78 1210 A 4 1 1 3 0 0 0 0 40 0 5 33 0 0 1 3 0 0 8 1 78 0 0 1.619 54 0.37
79 1211 A 4 5 0 78 0 0 0 0 0 0 0 0 0 0 4 6 0 0 3 0 78 0 0 0.865 28 0.59
80 1212 A 0 87 3 0 6 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.515 17 0.83
81 1213 A 8 0 0 0 0 0 0 14 59 0 1 15 0 0 0 0 0 1 0 1 78 0 10 1.241 41 0.51
82 1214 A 1 4 0 0 0 0 0 0 3 85 1 0 0 0 0 4 3 0 0 0 78 0 0 0.692 23 0.63
83 1215 A 3 90 0 1 5 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 78 0 0 0.455 15 0.90
84 1216 A 8 85 4 0 0 3 0 0 0 0 0 1 0 0 0 0 0 0 0 0 78 0 0 0.614 20 0.83
85 1217 A 3 0 1 0 0 0 0 0 9 0 0 4 0 1 0 1 0 73 1 6 78 0 0 1.064 35 0.54
86 1218 A 72 1 9 1 0 0 0 0 3 0 0 0 0 0 0 5 0 6 0 3 78 0 0 1.082 36 0.51
87 1219 A 13 0 87 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.383 12 0.93
88 1220 A 0 0 1 0 0 0 0 0 3 0 4 1 1 0 8 78 1 1 1 0 78 0 0 0.944 31 0.62
89 1221 A 0 0 1 0 0 0 0 1 3 0 5 0 0 41 4 8 9 21 0 8 78 0 0 1.785 59 0.31
90 1222 A 0 3 0 0 90 0 8 0 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.388 12 0.97
91 1223 A 1 0 0 0 0 0 0 0 1 0 9 0 87 0 0 1 0 0 0 0 77 0 0 0.508 16 0.79
92 1224 A 0 0 8 0 0 0 0 0 0 0 1 6 3 3 0 3 74 3 0 0 77 0 0 1.035 34 0.41
93 1225 A 10 1 21 1 0 0 0 0 21 0 3 29 0 1 0 4 0 9 0 0 77 0 0 1.855 61 0.23
94 1226 A 0 0 0 0 0 0 1 0 0 1 0 3 0 8 0 0 0 0 87 0 77 0 0 0.528 17 0.74
95 1227 A 0 0 0 0 0 0 0 12 1 4 56 0 0 0 0 1 0 4 21 1 77 0 0 1.325 44 0.42
96 1228 A 46 35 14 0 3 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 74 0 0 1.190 39 0.61
97 1229 A 0 1 0 0 0 0 0 0 0 3 3 0 0 0 0 7 77 8 1 0 74 0 0 0.898 29 0.62
98 1230 A 11 1 0 4 1 0 0 0 3 0 4 73 0 0 1 0 0 0 1 0 73 0 0 1.071 35 0.46
99 1231 A 0 0 0 0 0 0 0 0 0 1 1 0 0 0 1 0 0 0 7 89 73 0 0 0.463 15 0.81
100 1232 A 8 62 12 4 1 0 0 0 3 0 0 3 0 0 0 3 0 0 4 0 73 0 0 1.378 46 0.46
101 1233 A 0 6 0 0 91 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 69 0 0 0.351 11 0.88
102 1234 A 0 0 0 0 0 0 0 0 1 20 72 0 0 0 0 3 0 1 0 1 69 0 0 0.844 28 0.58
103 1235 A 0 1 0 0 0 0 0 3 1 0 58 10 0 0 9 4 0 1 9 3 69 0 0 1.499 50 0.36
104 1236 A 0 1 3 0 0 0 0 0 7 9 17 54 3 0 0 0 0 1 0 4 69 0 0 1.505 50 0.35
105 1237 A 3 0 1 0 0 0 0 7 4 1 6 0 0 0 1 49 3 15 6 3 68 0 0 1.793 59 0.27
106 1238 A 0 6 0 2 0 0 0 0 0 65 21 0 0 0 0 3 2 2 0 0 66 0 0 1.074 35 0.43
107 1239 A 0 0 0 0 0 5 0 5 0 2 5 6 0 3 5 2 63 3 2 2 65 0 0 1.501 50 0.35
108 1240 A 3 5 0 0 0 0 0 0 6 0 8 0 0 0 0 3 0 67 0 8 64 0 0 1.199 40 0.41
109 1241 A 0 0 0 0 0 0 0 2 0 0 3 0 0 0 0 10 16 68 0 2 63 0 0 1.018 33 0.58
110 1242 A 0 2 0 0 0 0 0 0 0 10 0 3 0 3 0 7 65 10 0 0 60 0 0 1.216 40 0.48
111 1243 A 0 0 0 0 0 0 0 2 0 3 2 2 0 3 18 57 0 10 2 2 60 0 0 1.431 47 0.44
112 1244 A 0 0 0 2 0 0 0 0 2 0 13 22 0 0 0 35 0 6 20 0 54 0 0 1.599 53 0.33
113 1245 A 0 0 0 0 0 0 0 2 4 4 84 2 0 0 0 0 0 0 4 0 49 0 0 0.700 23 0.75
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
11 54 1249 2 rHRp
20 31 41 3 rKASk
35 58 1160 2 sHRp
42 60 1192 2 sDRp
59 22 1093 2 qAAt
64 57 1176 2 rTRp
68 22 42 2 gGDq
68 82 104 1 gKa
69 20 565 1 lEs
69 80 626 1 gMk
70 82 88 1 gAq
71 19 223 1 lEs
71 79 284 1 gMk
72 20 726 1 lEs
72 80 787 1 gMk
73 75 823 1 gAv
74 4 856 1 hSg
74 64 917 1 gEp
75 14 243 1 hSg
75 74 304 1 gLq
76 80 609 1 gAa
77 71 584 1 gPp
//