Complet list of 1epi hssp file
Complete list of 1epi.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1EPI
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-04-28
HEADER GROWTH FACTOR 24-MAR-92 1EPI
COMPND MOL_ID: 1; MOLECULE: EPIDERMAL GROWTH FACTOR; CHAIN: A; ENGINEERED: YE
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: MUS MUSCULUS; ORGANISM_COMMON: HOUSE M
AUTHOR D.KOHDA,F.INAGAKI
DBREF 1EPI A 1 53 UNP P01132 EGF_MOUSE 977 1029
SEQLENGTH 53
NCHAIN 1 chain(s) in 1EPI data set
NALIGN 90
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : EGF_MOUSE 1.00 1.00 1 53 977 1029 53 0 0 1217 P01132 Pro-epidermal growth factor OS=Mus musculus GN=Egf PE=1 SV=2
2 : Q3UWD7_MOUSE 1.00 1.00 1 53 476 528 53 0 0 716 Q3UWD7 Putative uncharacterized protein OS=Mus musculus GN=Egf PE=2 SV=1
3 : Q8VD07_MOUSE 1.00 1.00 1 53 960 1012 53 0 0 1200 Q8VD07 Egf protein OS=Mus musculus GN=Egf PE=2 SV=1
4 : I3MW38_SPETR 0.79 0.91 1 53 920 972 53 0 0 1118 I3MW38 Uncharacterized protein (Fragment) OS=Spermophilus tridecemlineatus GN=EGF PE=4 SV=1
5 : EGF_RAT 0.77 0.85 1 53 974 1026 53 0 0 1133 P07522 Pro-epidermal growth factor OS=Rattus norvegicus GN=Egf PE=1 SV=2
6 : F1M959_RAT 0.77 0.85 1 53 974 1026 53 0 0 1085 F1M959 Pro-epidermal growth factor OS=Rattus norvegicus GN=Egf PE=4 SV=2
7 : J9SG97_RAT 0.77 0.85 1 53 974 1026 53 0 0 1061 J9SG97 Epidermal growth factor OS=Rattus norvegicus GN=EGF PE=2 SV=1
8 : M0RAK7_RAT 0.77 0.85 1 53 973 1025 53 0 0 1130 M0RAK7 Uncharacterized protein OS=Rattus norvegicus PE=4 SV=1
9 : Q6P6T8_RAT 0.77 0.85 1 53 973 1025 53 0 0 1132 Q6P6T8 Egf protein OS=Rattus norvegicus GN=Egf PE=2 SV=1
10 : EGF_PIG 0.74 0.91 1 53 970 1022 53 0 0 1214 Q00968 Pro-epidermal growth factor OS=Sus scrofa GN=EGF PE=1 SV=2
11 : G1TBJ6_RABIT 0.74 0.92 1 53 970 1022 53 0 0 1198 G1TBJ6 Uncharacterized protein OS=Oryctolagus cuniculus GN=EGF PE=4 SV=2
12 : G3ID03_CRIGR 0.74 0.91 1 53 612 664 53 0 0 699 G3ID03 Pro-epidermal growth factor (Fragment) OS=Cricetulus griseus GN=I79_021564 PE=4 SV=1
13 : F6TFM0_MACMU 0.72 0.88 4 53 932 981 50 0 0 1166 F6TFM0 Uncharacterized protein (Fragment) OS=Macaca mulatta GN=EGF PE=4 SV=1
14 : G7MTM7_MACMU 0.72 0.88 4 53 974 1023 50 0 0 1207 G7MTM7 Putative uncharacterized protein OS=Macaca mulatta GN=EGK_16015 PE=4 SV=1
15 : G7P635_MACFA 0.72 0.88 4 53 974 1023 50 0 0 1207 G7P635 Putative uncharacterized protein OS=Macaca fascicularis GN=EGM_14614 PE=4 SV=1
16 : H0UUP9_CAVPO 0.72 0.88 4 53 974 1022 50 1 1 1189 H0UUP9 Uncharacterized protein (Fragment) OS=Cavia porcellus GN=EGF PE=4 SV=1
17 : H2PE43_PONAB 0.72 0.88 4 53 974 1023 50 0 0 1177 H2PE43 Uncharacterized protein OS=Pongo abelii GN=EGF PE=4 SV=2
18 : EGF_HUMAN 0.70 0.86 4 53 974 1023 50 0 0 1207 P01133 Pro-epidermal growth factor OS=Homo sapiens GN=EGF PE=1 SV=2
19 : G1S3J7_NOMLE 0.70 0.88 4 53 974 1023 50 0 0 1207 G1S3J7 Uncharacterized protein OS=Nomascus leucogenys GN=EGF PE=4 SV=1
20 : G3QSQ3_GORGO 0.70 0.86 4 53 974 1023 50 0 0 1165 G3QSQ3 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101153700 PE=4 SV=1
21 : G3S6T2_GORGO 0.70 0.86 4 53 977 1026 50 0 0 1205 G3S6T2 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101153700 PE=4 SV=1
22 : H0WQH5_OTOGA 0.70 0.91 1 53 931 983 53 0 0 1143 H0WQH5 Uncharacterized protein (Fragment) OS=Otolemur garnettii GN=EGF PE=4 SV=1
23 : H2QQ13_PANTR 0.70 0.86 4 53 974 1023 50 0 0 1207 H2QQ13 Uncharacterized protein OS=Pan troglodytes GN=EGF PE=4 SV=1
24 : K9IVS0_DESRO 0.70 0.87 1 53 906 958 53 0 0 1077 K9IVS0 Putative pro-epidermal growth factor (Fragment) OS=Desmodus rotundus PE=2 SV=1
25 : K9J0A7_DESRO 0.70 0.87 1 53 965 1017 53 0 0 1136 K9J0A7 Putative pro-epidermal growth factor OS=Desmodus rotundus PE=2 SV=1
26 : L5K6M6_PTEAL 0.70 0.89 1 53 754 806 53 0 0 991 L5K6M6 Pro-epidermal growth factor OS=Pteropus alecto GN=PAL_GLEAN10022612 PE=4 SV=1
27 : Q6QBS2_HUMAN 0.70 0.86 4 53 4 53 50 0 0 53 Q6QBS2 Epidermal growth factor (Fragment) OS=Homo sapiens GN=EGF PE=2 SV=1
28 : G1M8V3_AILME 0.69 0.81 6 53 663 709 48 1 1 864 G1M8V3 Uncharacterized protein (Fragment) OS=Ailuropoda melanoleuca GN=EGF PE=4 SV=1
29 : EGF_CANFA 0.68 0.79 1 53 973 1024 53 1 1 1216 Q9BEA0 Pro-epidermal growth factor OS=Canis familiaris GN=EGF PE=2 SV=1
30 : EGF_FELCA 0.68 0.79 1 53 969 1020 53 1 1 1210 Q95ND4 Pro-epidermal growth factor OS=Felis catus GN=EGF PE=2 SV=1
31 : F1PWS8_CANFA 0.68 0.79 1 53 974 1025 53 1 1 1216 F1PWS8 Pro-epidermal growth factor OS=Canis familiaris GN=EGF PE=4 SV=2
32 : J9NZ75_CANFA 0.68 0.79 1 53 973 1024 53 1 1 1211 J9NZ75 Pro-epidermal growth factor OS=Canis familiaris GN=EGF PE=4 SV=1
33 : M3VUW6_FELCA 0.68 0.79 1 53 930 981 53 1 1 1171 M3VUW6 Pro-epidermal growth factor (Fragment) OS=Felis catus GN=EGF PE=4 SV=1
34 : M3WWG8_FELCA 0.68 0.79 1 53 971 1022 53 1 1 1122 M3WWG8 Pro-epidermal growth factor OS=Felis catus GN=EGF PE=4 SV=1
35 : G3TC14_LOXAF 0.67 0.86 3 53 934 984 51 0 0 1168 G3TC14 Uncharacterized protein (Fragment) OS=Loxodonta africana GN=EGF PE=4 SV=1
36 : F6T5D5_CALJA 0.66 0.88 4 53 932 981 50 0 0 1165 F6T5D5 Uncharacterized protein OS=Callithrix jacchus GN=EGF PE=4 SV=1
37 : F7GXF4_CALJA 0.66 0.88 4 53 974 1023 50 0 0 1207 F7GXF4 Uncharacterized protein OS=Callithrix jacchus GN=EGF PE=4 SV=1
38 : F7HIN6_CALJA 0.66 0.88 4 53 291 340 50 0 0 374 F7HIN6 Uncharacterized protein (Fragment) OS=Callithrix jacchus GN=EGF PE=4 SV=1
39 : G1PSH1_MYOLU 0.66 0.85 1 53 935 987 53 0 0 1178 G1PSH1 Uncharacterized protein (Fragment) OS=Myotis lucifugus GN=EGF PE=4 SV=1
40 : L5MDV4_MYODS 0.66 0.87 1 53 1595 1647 53 0 0 1838 L5MDV4 Pro-epidermal growth factor OS=Myotis davidii GN=MDA_GLEAN10008435 PE=4 SV=1
41 : S7N681_MYOBR 0.66 0.87 1 53 966 1018 53 0 0 1190 S7N681 Pro-epidermal growth factor OS=Myotis brandtii GN=D623_10030582 PE=4 SV=1
42 : U3DWK0_CALJA 0.66 0.88 4 53 974 1023 50 0 0 1208 U3DWK0 Pro-epidermal growth factor isoform 1 preproprotein OS=Callithrix jacchus GN=EGF PE=2 SV=1
43 : M3YAV8_MUSPF 0.64 0.83 1 53 967 1018 53 1 1 1207 M3YAV8 Uncharacterized protein OS=Mustela putorius furo GN=EGF PE=4 SV=1
44 : U6DB28_NEOVI 0.64 0.83 1 53 967 1018 53 1 1 1172 U6DB28 Pro-epidermal growth factor (Fragment) OS=Neovison vison GN=EGF PE=2 SV=1
45 : F7B726_HORSE 0.62 0.81 1 53 972 1024 53 0 0 1210 F7B726 Uncharacterized protein OS=Equus caballus GN=EGF PE=4 SV=1
46 : F7B762_HORSE 0.62 0.81 1 53 973 1025 53 0 0 1211 F7B762 Uncharacterized protein OS=Equus caballus GN=EGF PE=4 SV=1
47 : A8KBG4_XENTR 0.60 0.71 4 51 928 975 48 0 0 1051 A8KBG4 LOC100127663 protein OS=Xenopus tropicalis GN=egf PE=2 SV=1
48 : F7BYJ3_XENTR 0.60 0.71 4 51 930 977 48 0 0 1053 F7BYJ3 Uncharacterized protein OS=Xenopus tropicalis GN=egf PE=4 SV=1
49 : F7BYK2_XENTR 0.60 0.71 4 51 928 975 48 0 0 1051 F7BYK2 Uncharacterized protein OS=Xenopus tropicalis GN=egf PE=4 SV=1
50 : W5N1G8_LEPOC 0.60 0.74 4 53 985 1034 50 0 0 1204 W5N1G8 Uncharacterized protein OS=Lepisosteus oculatus PE=4 SV=1
51 : W5N1H8_LEPOC 0.60 0.74 4 53 977 1026 50 0 0 1049 W5N1H8 Uncharacterized protein (Fragment) OS=Lepisosteus oculatus PE=4 SV=1
52 : Q28867_HORSE 0.58 0.79 1 48 42 89 48 0 0 89 Q28867 Epidermal growth factor (Fragment) OS=Equus caballus GN=epidermal growth factor/ EGF PE=2 SV=1
53 : G3VFM5_SARHA 0.57 0.84 5 53 932 980 49 0 0 1013 G3VFM5 Uncharacterized protein OS=Sarcophilus harrisii GN=EGF PE=4 SV=1
54 : G3VFM6_SARHA 0.57 0.84 5 53 926 974 49 0 0 1129 G3VFM6 Uncharacterized protein OS=Sarcophilus harrisii GN=EGF PE=4 SV=1
55 : H0Z1B5_TAEGU 0.57 0.74 1 53 1025 1077 53 0 0 1253 H0Z1B5 Uncharacterized protein OS=Taeniopygia guttata GN=EGF PE=4 SV=1
56 : H3D665_TETNG 0.57 0.76 5 53 882 930 49 0 0 967 H3D665 Uncharacterized protein (Fragment) OS=Tetraodon nigroviridis PE=4 SV=1
57 : L8YGV5_TUPCH 0.57 0.74 1 53 657 706 53 1 3 852 L8YGV5 Pro-epidermal growth factor (Fragment) OS=Tupaia chinensis GN=TREES_T100005710 PE=4 SV=1
58 : M7CIJ5_CHEMY 0.57 0.78 1 51 2047 2097 51 0 0 2276 M7CIJ5 Pro-epidermal growth factor OS=Chelonia mydas GN=UY3_01923 PE=3 SV=1
59 : H2N242_ORYLA 0.56 0.71 6 53 949 996 48 0 0 1038 H2N242 Uncharacterized protein OS=Oryzias latipes PE=4 SV=1
60 : H2USM7_TAKRU 0.56 0.73 6 53 927 974 48 0 0 997 H2USM7 Uncharacterized protein OS=Takifugu rubripes PE=4 SV=1
61 : B3DH82_DANRE 0.55 0.77 1 53 905 957 53 0 0 1113 B3DH82 Egf protein OS=Danio rerio GN=egf PE=2 SV=1
62 : F1QGQ2_DANRE 0.55 0.77 1 53 969 1021 53 0 0 1177 F1QGQ2 Uncharacterized protein OS=Danio rerio GN=egf PE=4 SV=1
63 : F1QKU8_DANRE 0.55 0.77 1 53 906 958 53 0 0 1114 F1QKU8 Uncharacterized protein OS=Danio rerio GN=egf PE=4 SV=1
64 : K7F4H3_PELSI 0.55 0.76 1 51 979 1029 51 0 0 1199 K7F4H3 Uncharacterized protein OS=Pelodiscus sinensis GN=EGF PE=4 SV=1
65 : K7F4H4_PELSI 0.55 0.76 1 51 1029 1079 51 0 0 1184 K7F4H4 Uncharacterized protein OS=Pelodiscus sinensis GN=EGF PE=4 SV=1
66 : Q6VQA2_DANRE 0.55 0.77 1 53 906 958 53 0 0 1114 Q6VQA2 Epidermal growth factor OS=Danio rerio GN=egf PE=2 SV=1
67 : F6R392_MONDO 0.54 0.83 6 53 928 975 48 0 0 1143 F6R392 Uncharacterized protein OS=Monodelphis domestica GN=EGF PE=4 SV=2
68 : M3ZJA3_XIPMA 0.54 0.71 6 53 885 932 48 0 0 997 M3ZJA3 Uncharacterized protein OS=Xiphophorus maculatus PE=4 SV=1
69 : G3PVY7_GASAC 0.53 0.68 1 53 888 940 53 0 0 981 G3PVY7 Uncharacterized protein OS=Gasterosteus aculeatus PE=4 SV=1
70 : I3IXR8_ORENI 0.53 0.70 1 53 980 1032 53 0 0 1065 I3IXR8 Uncharacterized protein (Fragment) OS=Oreochromis niloticus PE=4 SV=1
71 : S9YQV6_9CETA 0.53 0.79 1 53 28 80 53 0 0 191 S9YQV6 Uncharacterized protein OS=Camelus ferus GN=CB1_000078002 PE=4 SV=1
72 : W5L7Z6_ASTMX 0.52 0.74 4 53 972 1021 50 0 0 1179 W5L7Z6 Uncharacterized protein OS=Astyanax mexicanus PE=4 SV=1
73 : I3LVG5_PIG 0.48 0.71 6 53 399 446 48 0 0 481 I3LVG5 Pro-epidermal growth factor (Fragment) OS=Sus scrofa GN=EGF PE=4 SV=1
74 : K9INK7_DESRO 0.42 0.56 1 52 943 978 52 2 16 1057 K9INK7 Putative pro-epidermal growth factor OS=Desmodus rotundus PE=2 SV=1
75 : GRFA_MYXVL 0.41 0.54 6 47 37 81 46 2 5 85 P08072 Growth factor OS=Myxoma virus (strain Lausanne) GN=MGF PE=1 SV=1
76 : W4YMF7_STRPU 0.41 0.55 6 53 1083 1125 49 2 7 1715 W4YMF7 Uncharacterized protein OS=Strongylocentrotus purpuratus GN=Sp-Notch1-13 PE=4 SV=1
77 : G1KL57_ANOCA 0.40 0.54 5 52 60 103 48 2 4 167 G1KL57 Uncharacterized protein OS=Anolis carolinensis GN=EREG PE=4 SV=2
78 : G3WL15_SARHA 0.40 0.60 6 53 241 284 48 1 4 716 G3WL15 Uncharacterized protein OS=Sarcophilus harrisii GN=NRG2 PE=4 SV=1
79 : G3WL17_SARHA 0.40 0.60 6 53 241 284 48 1 4 708 G3WL17 Uncharacterized protein OS=Sarcophilus harrisii GN=NRG2 PE=4 SV=1
80 : L5JX10_PTEAL 0.40 0.60 6 53 113 156 48 1 4 364 L5JX10 Pro-neuregulin-2, membrane-bound isoform OS=Pteropus alecto GN=PAL_GLEAN10016796 PE=4 SV=1
81 : V4APC6_LOTGI 0.40 0.59 1 50 7 64 58 4 8 95 V4APC6 Uncharacterized protein (Fragment) OS=Lottia gigantea GN=LOTGIDRAFT_89155 PE=4 SV=1
82 : D6WGH5_TRICA 0.38 0.54 6 52 135 179 50 3 8 291 D6WGH5 Keren OS=Tribolium castaneum GN=Krn PE=4 SV=1
83 : D8X0C3_TRICA 0.38 0.54 6 52 113 157 50 3 8 271 D8X0C3 Transforming growth factor alpha-like protein OS=Tribolium castaneum PE=2 SV=1
84 : E9I206_DAPPU 0.38 0.55 6 53 54 106 53 3 5 244 E9I206 Putative uncharacterized protein OS=Daphnia pulex GN=DAPPUDRAFT_271340 PE=4 SV=1
85 : R4WQY7_9HEMI 0.38 0.54 6 52 93 137 50 3 8 240 R4WQY7 Uncharacterized protein OS=Riptortus pedestris PE=2 SV=1
86 : V3ZVD4_LOTGI 0.38 0.59 1 50 7 64 58 4 8 95 V3ZVD4 Uncharacterized protein (Fragment) OS=Lottia gigantea GN=LOTGIDRAFT_89401 PE=4 SV=1
87 : B4JPB6_DROGR 0.37 0.56 6 53 201 252 52 3 4 312 B4JPB6 GH13486 OS=Drosophila grimshawi GN=Dgri\GH13486 PE=4 SV=1
88 : L7N137_MYOLU 0.37 0.59 3 53 45 92 51 2 3 152 L7N137 Uncharacterized protein (Fragment) OS=Myotis lucifugus PE=4 SV=1
89 : S7NVI6_MYOBR 0.37 0.59 3 53 173 220 51 2 3 280 S7NVI6 Probetacellulin OS=Myotis brandtii GN=D623_10008857 PE=4 SV=1
90 : V3ZVE9_LOTGI 0.36 0.55 1 50 230 287 58 4 8 393 V3ZVE9 Uncharacterized protein OS=Lottia gigantea GN=LOTGIDRAFT_160631 PE=4 SV=1
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A N 0 0 150 47 33 NNNNNNNNNNNN N NNN NNNNNN NNN NNNN N G SD NNNDDN NN
2 2 A S + 0 0 80 47 45 SSSSSSSSSSSS S TTS GSGGSS VGG GGSS S D SG GGGGGG SA
3 3 A Y - 0 0 175 50 51 YYYYNIIINYFN Y YYY YYYYYYY YYY FFYY Y F YF VVVFFV AV
4 4 A P + 0 0 116 71 79 PPPPTTTTTSPPSSSPSSSSSPSRRPS RQRRQQPSSSSSSSLLQQRRREEQ V PV QQQMMQ QE
5 5 A G + 0 0 19 75 53 GGGGGGGGGEGGGGGGGEGEEGEEEEE EEEEEEGGGGEEEGEEEEEEESSEEEGSEG SSSGGS SS
6 6 A a - 0 0 44 91 3 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
7 7 A P > - 0 0 63 91 25 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPSSPPPPPSPPPPPPPPPPPPPPPPPP
8 8 A S T 3 S+ 0 0 105 91 74 SSSEPPPPPPPPLLLPLLLLLPLPPPLSSPSSPPPPPPPPPPPPQQLLLSSQPPPSPSSSSSSSSSPPSA
9 9 A S T > S+ 0 0 87 91 47 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSAAASSSSSSTESSTTTTSSTPSST
10 10 A Y T < + 0 0 73 91 55 YYYHYYYYYHHHHHHHHHHHHHHHHHHYYYYYYYHHHHHHHHYYYYYYYHHYYYYHYYHHHHHHHHYHHH
11 11 A D T 3 S+ 0 0 150 91 50 DDDDDDDDDDDDDDDDDDDDDDDAAADEDDDDDDDSSSSAASDDDDDDDEEDDDEQEEEDDDDDDDDDEE
12 12 A G S < S+ 0 0 62 91 50 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGTTGSSSNGSANSSSSSSSAST
13 13 A Y S S+ 0 0 59 91 12 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY
14 14 A b - 0 0 22 91 3 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
15 15 A L S S+ 0 0 115 91 24 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
16 16 A N S S- 0 0 15 91 60 NNNHNNNNNHHHHHHHHHHHHHHHHHHYYYYYYYNHHHHHHHHHHHNNNYYHHHHYHHYYYYYHHYHYYY
17 17 A G + 0 0 55 91 39 GGGGGGGGGGGGDDDGDDDDDGDGGGDNNNNNNNDDDDGGGDNNGGGGGGGGGGGGGGQEDDDGGDGHQH
18 18 A G - 0 0 25 91 15 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
19 19 A V E -A 32 0A 85 88 36 VVVVVVVVVVVVVVVVVVVVVAVVVVVVVVVVVVLVVVMMMVVVKKVVVVVKVVVVVVVIVVVVVVTVVV
20 20 A a E +A 31 0A 35 91 6 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
21 21 A M E -A 30 0A 21 91 57 MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMVVIIIFFVIINFRIFFFFFIIFIVFY
22 22 A H - 0 0 66 91 47 HHHYYYYYYYYYYYYHYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYHHHYYYYYYYYYYYYYYYYYYYYY
23 23 A I - 0 0 54 91 45 IIIVVVVVVIIVIIIIIIIIIVIIIIIIIIIIIIIIIIIIIIIILLFFFIILVVVFIVYFFFFVVFVFYF
24 24 A E S S+ 0 0 126 91 66 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEVVPPPPPVSSSPASPPPPPSSPSPPP
25 25 A S S S- 0 0 103 91 75 SSSASSSSSAAAAPASAAAAATASSAAAAAAAAAQTTTAAATAAQQEEEEEQEEDEVIEEDDDIIEEEEE
26 26 A L S S- 0 0 139 91 44 LLLVVVVVVVVVLLLLLLLLLVLVVVLVVVVVVVVVVVVVVVVVVVLLLIIVLLLMLLMVMMMLLMLLML
27 27 A D + 0 0 99 91 39 DDDDDDDDDDDDDDDNDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDKKKGGDEEQDGQEDEEEQQEEKED
28 28 A S - 0 0 78 91 79 SSSSRRRRRSNSKKKTKKKKKTKTTNKRRRRRRRQKKKRRRKRRTTDDDSSTNNDSSDASSSSDDSNISS
29 29 A Y + 0 0 65 91 44 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYHHYYYYYHYYYYYYYYYYYYYYYYYY
30 30 A T E -A 21 0A 39 89 49 TTTAVVVVVAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGAAAAAAAAAAAAAAAAAAAAA
31 31 A b E -A 20 0A 9 90 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
32 32 A N E -A 19 0A 92 90 57 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNKNNNNNNNNNNNRRRNNNKKNNNTNNNNNTTNKNNN
33 33 A c - 0 0 27 90 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
34 34 A V B > -B 37 0B 59 90 52 VVVVVVVVVVVIVVVVVVVVVVVVVIVVVVVVVVIVVVVVVVVVVVVVVVVVMMVVAVVVVVVVVVMIAV
35 35 A I T 3 S+ 0 0 136 90 82 IIIIIIIIIFVVVVVIVVVVVVVVVVVFFFFFFFVVVVVVVVFFVVAAAKKVVVTAVTSALLLTTLVRSS
36 36 A G T 3 S+ 0 0 27 90 15 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
37 37 A Y B < +B 34 0B 16 90 1 YYYYYYYYYYYYYYYYYYYYYFYFFYYYYYYYYYFYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY
38 38 A S B +C 44 0C 67 90 53 SSSVIIIIIVVVIIIVIIIIIVIVVVIVVVVVVVILLLVVVLVVVVVVVMMVVVVIVVMIMMMVVMVLIM
39 39 A G S S- 0 0 54 90 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
40 40 A D S S- 0 0 119 90 29 DDDEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
41 41 A R S S- 0 0 81 91 10 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
42 42 A c S S+ 0 0 0 91 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
43 43 A Q + 0 0 120 91 25 QQQQQQQQQQQQQQQEQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ
44 44 A T B S+C 38 0C 90 91 75 TTTHHHHHHHHHYYYHYYYYYHYYYHYHHHHHHHHYYYHHHYHHHHFFFFFHHHFFHFFFFFFFFFHFFF
45 45 A R - 0 0 95 91 72 RRRRRRRRRRRRRRRQRRQRRRRRRRRRRRRRRRRQQQRRRQRRQQDDDSSQSSSSLSSSSSSSSSSSSS
46 46 A D - 0 0 101 91 22 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDEDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
47 47 A L > - 0 0 108 69 29 LLLLLLLLLLLLLLL.LLLLLLLLLLL.......LLLLLLLL..LLLLLLLLLLLL.LLLLLLLLLLLLL
48 48 A R T 3 S+ 0 0 168 78 75 RRRRRRRRRKKRKKKLKKKKKKKRRKKLLLLLLLRKKKKKKKLLRRKKKEEREEEE.EEEEEEEEEEEEE
49 49 A W T 3 S+ 0 0 148 79 30 WWWWWWWWWWWWWWWKWWWWWWWWWWWTKKKKKKWWWWWWWWRRWWSSSWW WWWW.WWWWWWWWWWWWW
50 50 A W S < S+ 0 0 51 81 20 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW WWWWLWLWWWWWWWWLGL
51 51 A E S S- 0 0 118 81 41 EEEEKKKKKEEEEEEEEEEEEEEEEEEEEEEEEEKEEEEEEEEEEEEEEEE EEEDQEEDEEEDDEEEEE
52 52 A L 0 0 129 79 3 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL LL MMQLL LLLLL LMLLL
53 53 A R 0 0 255 75 24 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR QQ RRRQR RQQQQ QRQQQ
## ALIGNMENTS 71 - 90
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....8....:....9....:....0....:....1....:....2....:....3....:....4
1 1 A N 0 0 150 47 33 S N S S S
2 2 A S + 0 0 80 47 45 V T S S S
3 3 A Y - 0 0 175 50 51 Y Y C C FFC
4 4 A P + 0 0 116 71 79 RE R S S SSS
5 5 A G + 0 0 19 75 53 KS E G D D PPD
6 6 A a - 0 0 44 91 3 CCCCCPCCCCCCCCCCCCCC
7 7 A P > - 0 0 63 91 25 PPNPNPKNNNPPPPTPSPPP
8 8 A S T 3 S+ 0 0 105 91 74 PSGPDDSEEETPPPANDKKS
9 9 A S T > S+ 0 0 87 91 47 STPSDPDSSSSAAAEYEKKS
10 10 A Y T < + 0 0 73 91 55 YHHHYCMAAACYYYHCYYYC
11 11 A D T 3 S+ 0 0 150 91 50 HDDAKNNKKKKAAATQIEES
12 12 A G S < S+ 0 0 62 91 50 GSGGNSNSSSDAATADKHHD
13 13 A Y S S+ 0 0 59 91 12 YFFYYfYYYYnwwyynyYYn
14 14 A b - 0 0 22 91 3 CCRCCcCCCCcccccccCCc
15 15 A L S S+ 0 0 115 91 24 LLLLLSFVVVDLLLLDLIID
16 16 A N S S- 0 0 15 91 60 NYHHNNNNNNCNNNNCNNNC
17 17 A G + 0 0 55 91 39 GNHGNGGGGGGGGGGGGGGN
18 18 A G - 0 0 25 91 15 GGNGgGDGGGgAAAGgGKKg
19 19 A V E -A 32 0A 85 88 36 VVTVvA.VVVlTTTTlN..i
20 20 A a E +A 31 0A 35 91 6 CCLCACCCCCCCCCCCCCCC
21 21 A M E -A 30 0A 21 91 57 TLMMLSMYYYFFFFFFFRRF
22 22 A H - 0 0 66 91 47 YYLYNRYYYYNTTTANRFFN
23 23 A I - 0 0 54 91 45 LFLINILIIITVVIITWVVG
24 24 A E S S+ 0 0 126 91 66 EPQEVGVEEEGKKKKGAGGG
25 25 A S S S- 0 0 103 91 75 LERSSQEGGGTIIIITNAAT
26 26 A L S S- 0 0 139 91 44 IMIVLSLIIILGGGGLALLL
27 27 A D + 0 0 99 91 39 NEMDNSNNNNDDDEDDNQQD
28 28 A S - 0 0 78 91 79 HSETPSEQQQISSSTISTTI
29 29 A Y + 0 0 65 91 44 SYYYFFHLLLnllilnfPPn
30 30 A T E -A 21 0A 39 89 49 AGS..QSSSStnnnatySSt
31 31 A b E -A 20 0A 9 90 0 CCC.CCCCCCCCCCCCCCCC
32 32 A N E -A 19 0A 92 90 57 TNF.ATRKKKSEEEDSVLLS
33 33 A c - 0 0 27 90 0 CCC.CCCCCCCCCCCCCCCC
34 34 A V B > -B 37 0B 59 90 52 VLV.HTYPPPPAATAPADDP
35 35 A I T 3 S+ 0 0 136 90 82 VPF.INTNNNSEEEDSVDDS
36 36 A G T 3 S+ 0 0 27 90 15 GGG.NGGGGGLGGGGLGGGL
37 37 A Y B < +B 34 0B 16 90 1 FYY.YYYFFFYYYFYYYYYY
38 38 A S B +C 44 0C 67 90 53 FIV.VSVFFFSMMMISVAAS
39 39 A G S S- 0 0 54 90 0 GGG.GGGGGGGGGGGGGGGG
40 40 A D S S- 0 0 119 90 29 EEE.SQVQQQDPPQEDEAAD
41 41 A R S S- 0 0 81 91 10 RRRARRRRRRQRRRRQRRRQ
42 42 A c S S+ 0 0 0 91 0 CCCCCCCCCCCCCCCCCCCC
43 43 A Q + 0 0 120 91 25 QQQKQQGLLLQEEEDQDEEQ
44 44 A T B S+C 38 0C 90 91 75 SFHTFNHEEETYYYYTSRRT
45 45 A R - 0 0 95 91 72 QSRQIRSKKKQKKKKQKVVL
46 46 A D - 0 0 101 91 22 DDDKNDNLLLdDDdDdtDDq
47 47 A L > - 0 0 108 69 29 FLL.L.....e..g.eg..e
48 48 A R T 3 S+ 0 0 168 78 75 TEK. .....E..S.EV..K
49 49 A W T 3 S+ 0 0 148 79 30 WWW. .....W..Y.WYFFS
50 50 A W S < S+ 0 0 51 81 20 WWW. .F...W..L.WVFFW
51 51 A E S S- 0 0 118 81 41 DEE. .EPPP ..P. PYY
52 52 A L 0 0 129 79 3 LLLL .LLLL LLLL LLL
53 53 A R 0 0 255 75 24 RQR R RRR R RRR
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 0 2 0 0 11 0 0 0 0 0 0 0 81 6 47 0 0 0.668 22 0.66
2 2 A 4 0 0 0 0 0 0 30 2 0 55 6 0 0 0 0 0 0 0 2 47 0 0 1.162 38 0.55
3 3 A 10 0 6 0 18 0 52 0 2 0 0 0 6 0 0 0 0 0 6 0 50 0 0 1.464 48 0.48
4 4 A 3 3 0 3 0 0 0 0 0 17 32 7 0 0 14 0 15 6 0 0 71 0 0 1.881 62 0.21
5 5 A 0 0 0 0 0 0 0 39 0 3 13 0 0 0 0 1 0 40 0 4 75 0 0 1.286 42 0.47
6 6 A 0 0 0 0 0 0 0 0 0 1 0 0 99 0 0 0 0 0 0 0 91 0 0 0.060 2 0.97
7 7 A 0 0 0 0 0 0 0 0 0 88 4 1 0 0 0 1 0 0 5 0 91 0 0 0.509 16 0.74
8 8 A 0 14 0 0 0 0 0 1 2 41 26 1 0 0 0 2 3 4 1 3 91 0 0 1.674 55 0.26
9 9 A 0 0 0 0 0 0 1 0 7 3 73 9 0 0 0 2 0 3 0 2 91 0 0 1.068 35 0.53
10 10 A 0 0 0 1 0 0 42 0 3 0 0 0 4 49 0 0 0 0 0 0 91 0 0 1.012 33 0.45
11 11 A 0 0 1 0 0 0 0 0 10 0 7 1 0 1 0 5 2 12 2 58 91 0 0 1.454 48 0.49
12 12 A 0 0 0 0 0 0 0 60 5 0 19 4 0 2 0 1 0 0 4 3 91 0 0 1.298 43 0.49
13 13 A 0 0 0 0 3 2 91 0 0 0 0 0 0 0 0 0 0 0 3 0 91 0 9 0.393 13 0.87
14 14 A 0 0 0 0 0 0 0 0 0 0 0 0 99 0 1 0 0 0 0 0 91 0 0 0.060 2 0.97
15 15 A 3 89 2 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 3 91 0 0 0.512 17 0.75
16 16 A 0 0 0 0 0 0 22 0 0 0 0 0 3 45 0 0 0 0 30 0 91 0 0 1.165 38 0.39
17 17 A 0 0 0 0 0 0 0 59 0 0 0 0 0 3 0 0 2 1 13 21 91 0 0 1.150 38 0.61
18 18 A 0 0 0 0 0 0 0 92 3 0 0 0 0 0 0 2 0 0 1 1 91 3 4 0.369 12 0.85
19 19 A 77 3 2 3 0 0 0 0 2 0 0 7 0 0 0 3 0 0 1 0 88 0 0 0.951 31 0.63
20 20 A 0 1 0 0 0 0 0 0 1 0 0 0 98 0 0 0 0 0 0 0 91 0 0 0.121 4 0.93
21 21 A 4 2 10 53 20 0 4 0 0 0 1 1 0 0 3 0 0 0 1 0 91 0 0 1.507 50 0.43
22 22 A 0 1 0 0 2 0 77 0 1 0 0 3 0 9 2 0 0 0 4 0 91 0 0 0.932 31 0.53
23 23 A 21 7 52 0 13 1 2 1 0 0 0 2 0 0 0 0 0 0 1 0 91 0 0 1.431 47 0.55
24 24 A 5 0 0 0 0 0 0 7 2 18 8 0 0 0 0 4 1 55 0 0 91 0 0 1.441 48 0.33
25 25 A 1 1 8 0 0 0 0 3 31 1 15 9 0 0 1 0 5 19 1 4 91 0 0 2.032 67 0.24
26 26 A 38 38 8 9 0 0 0 4 1 0 1 0 0 0 0 0 0 0 0 0 91 0 0 1.383 46 0.56
27 27 A 0 0 0 1 0 0 0 3 0 0 1 0 0 0 0 4 7 12 9 63 91 0 0 1.290 43 0.61
28 28 A 0 0 4 0 0 0 0 0 1 1 26 12 0 1 19 15 4 2 5 8 91 0 0 2.072 69 0.20
29 29 A 0 7 1 0 3 0 78 0 0 2 1 0 0 4 0 0 0 0 3 0 91 2 8 0.918 30 0.55
30 30 A 6 0 0 0 0 0 1 4 69 0 8 8 0 0 0 0 1 0 3 0 89 0 0 1.175 39 0.50
31 31 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 90 0 0 0.000 0 1.00
32 32 A 1 2 0 0 1 0 0 0 1 0 3 6 0 0 4 8 0 3 69 1 90 0 0 1.266 42 0.42
33 33 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 90 0 0 0.000 0 1.00
34 34 A 71 1 4 3 0 0 1 0 7 7 0 2 0 1 0 0 0 0 0 2 90 0 0 1.174 39 0.48
35 35 A 37 4 13 0 12 0 0 0 6 1 7 6 0 0 1 2 0 3 4 3 90 0 0 2.083 69 0.18
36 36 A 0 3 0 0 0 0 0 96 0 0 0 0 0 0 0 0 0 0 1 0 90 0 0 0.207 6 0.84
37 37 A 0 0 0 0 10 0 90 0 0 0 0 0 0 0 0 0 0 0 0 0 90 0 0 0.325 10 0.99
38 38 A 43 6 23 12 4 0 0 0 2 0 9 0 0 0 0 0 0 0 0 0 90 0 0 1.558 51 0.46
39 39 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 90 0 0 0.000 0 1.00
40 40 A 1 0 0 0 0 0 0 0 2 2 1 0 0 0 0 0 6 79 0 9 90 0 0 0.832 27 0.70
41 41 A 0 0 0 0 0 0 0 0 1 0 0 0 0 0 96 0 3 0 0 0 91 0 0 0.205 6 0.90
42 42 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 91 0 0 0.000 0 1.00
43 43 A 0 3 0 0 0 0 0 1 0 0 0 0 0 0 0 1 86 7 0 2 91 0 0 0.607 20 0.75
44 44 A 0 0 0 0 23 0 22 0 0 0 2 9 0 37 2 0 0 3 1 0 91 0 0 1.583 52 0.24
45 45 A 2 2 1 0 0 0 0 0 0 0 23 0 0 0 45 9 14 0 0 3 91 0 0 1.519 50 0.28
46 46 A 0 3 0 0 0 0 0 0 0 0 0 1 0 0 0 1 1 1 2 90 91 22 5 0.489 16 0.78
47 47 A 0 91 0 0 1 0 0 3 0 0 0 0 0 0 0 0 0 4 0 0 69 0 0 0.383 12 0.71
48 48 A 1 13 0 0 0 0 0 0 0 0 1 1 0 0 22 33 0 28 0 0 78 0 0 1.486 49 0.25
49 49 A 0 0 0 0 3 77 3 0 0 0 5 1 0 0 3 9 0 0 0 0 79 0 0 0.900 30 0.70
50 50 A 1 6 0 0 4 88 0 1 0 0 0 0 0 0 0 0 0 0 0 0 81 0 0 0.518 17 0.80
51 51 A 0 0 0 0 0 0 2 0 0 6 0 0 0 0 0 7 1 77 0 6 81 0 0 0.887 29 0.59
52 52 A 0 95 0 4 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 79 0 0 0.229 7 0.96
53 53 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 84 0 16 0 0 0 75 0 0 0.440 14 0.76
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
75 14 50 4 gTCFTv
76 9 1091 1 fPc
81 14 20 1 nLc
81 19 26 1 gKl
81 30 38 2 nTCt
81 47 57 4 dCPGKe
82 9 143 1 wYc
82 25 160 2 lLYn
83 9 121 1 wYc
83 25 138 2 lLYn
84 9 62 1 yYc
84 25 79 2 iLYn
84 42 98 2 dLDg
85 9 101 1 yYc
85 25 118 2 lIPa
86 14 20 1 nLc
86 19 26 1 gKl
86 30 38 2 nTCt
86 47 57 4 dCPGEe
87 9 209 1 yFc
87 25 226 1 fSy
87 42 244 2 tENg
90 14 243 1 nLc
90 19 249 1 gKi
90 30 261 2 nTCt
90 47 280 4 qCTKTe
//