Complet list of 3egf hssp file
Complete list of 3egf.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 3EGF
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2013-09-28
HEADER GROWTH FACTOR 1994-01-31 3EGF
COMPND EPIDERMAL GROWTH FACTOR
SOURCE Mus musculus
AUTHOR Montelione, G.T.; Wuthrich, K.; Scheraga, H.A.
SEQLENGTH 53
NCHAIN 1 chain(s) in 3EGF data set
NALIGN 79
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : EGF_MOUSE 1GK5 1.00 1.00 1 53 977 1029 53 0 0 1217 P01132 Pro-epidermal growth factor OS=Mus musculus GN=Egf PE=1 SV=2
2 : Q3UWD7_MOUSE 1.00 1.00 1 53 476 528 53 0 0 716 Q3UWD7 Putative uncharacterized protein OS=Mus musculus GN=Egf PE=2 SV=1
3 : Q8VD07_MOUSE 1.00 1.00 1 53 960 1012 53 0 0 1200 Q8VD07 Egf protein OS=Mus musculus GN=Egf PE=2 SV=1
4 : I3MW38_SPETR 0.79 0.91 1 53 920 972 53 0 0 1118 I3MW38 Uncharacterized protein (Fragment) OS=Spermophilus tridecemlineatus GN=EGF PE=4 SV=1
5 : EGF_RAT 0.77 0.85 1 53 974 1026 53 0 0 1133 P07522 Pro-epidermal growth factor OS=Rattus norvegicus GN=Egf PE=1 SV=2
6 : F1M959_RAT 0.77 0.85 1 53 974 1026 53 0 0 1085 F1M959 Pro-epidermal growth factor OS=Rattus norvegicus GN=Egf PE=4 SV=2
7 : J9SG97_RAT 0.77 0.85 1 53 974 1026 53 0 0 1061 J9SG97 Epidermal growth factor OS=Rattus norvegicus GN=EGF PE=2 SV=1
8 : M0RAK7_RAT 0.77 0.85 1 53 973 1025 53 0 0 1130 M0RAK7 Uncharacterized protein OS=Rattus norvegicus PE=4 SV=1
9 : Q6P6T8_RAT 0.77 0.85 1 53 973 1025 53 0 0 1132 Q6P6T8 Egf protein OS=Rattus norvegicus GN=Egf PE=2 SV=1
10 : EGF_PIG 0.74 0.91 1 53 970 1022 53 0 0 1214 Q00968 Pro-epidermal growth factor OS=Sus scrofa GN=EGF PE=1 SV=2
11 : G1TBJ6_RABIT 0.74 0.92 1 53 936 988 53 0 0 1176 G1TBJ6 Uncharacterized protein (Fragment) OS=Oryctolagus cuniculus GN=EGF PE=4 SV=1
12 : G3ID03_CRIGR 0.74 0.91 1 53 612 664 53 0 0 699 G3ID03 Pro-epidermal growth factor (Fragment) OS=Cricetulus griseus GN=I79_021564 PE=4 SV=1
13 : H0UUP9_CAVPO 0.74 0.93 4 49 974 1019 46 0 0 1189 H0UUP9 Uncharacterized protein (Fragment) OS=Cavia porcellus GN=LOC100714471 PE=4 SV=1
14 : F6TFM0_MACMU 0.72 0.87 1 53 929 981 53 0 0 1166 F6TFM0 Uncharacterized protein (Fragment) OS=Macaca mulatta GN=EGF PE=4 SV=1
15 : G7MTM7_MACMU 0.72 0.87 1 53 971 1023 53 0 0 1207 G7MTM7 Putative uncharacterized protein OS=Macaca mulatta GN=EGK_16015 PE=4 SV=1
16 : G7P635_MACFA 0.72 0.87 1 53 971 1023 53 0 0 1207 G7P635 Putative uncharacterized protein OS=Macaca fascicularis GN=EGM_14614 PE=4 SV=1
17 : H2PE43_PONAB 0.72 0.87 1 53 971 1023 53 0 0 1177 H2PE43 Uncharacterized protein OS=Pongo abelii GN=EGF PE=4 SV=2
18 : EGF_HUMAN 1P9J 0.70 0.85 1 53 971 1023 53 0 0 1207 P01133 Pro-epidermal growth factor OS=Homo sapiens GN=EGF PE=1 SV=2
19 : G1S3J7_NOMLE 0.70 0.87 1 53 971 1023 53 0 0 1207 G1S3J7 Uncharacterized protein OS=Nomascus leucogenys GN=EGF PE=4 SV=1
20 : G3QSQ3_GORGO 0.70 0.85 1 53 971 1023 53 0 0 1165 G3QSQ3 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101153700 PE=4 SV=1
21 : G3S6T2_GORGO 0.70 0.85 1 53 974 1026 53 0 0 1205 G3S6T2 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101153700 PE=4 SV=1
22 : H0WQH5_OTOGA 0.70 0.91 1 53 931 983 53 0 0 1143 H0WQH5 Uncharacterized protein (Fragment) OS=Otolemur garnettii GN=EGF PE=4 SV=1
23 : H2QQ13_PANTR 0.70 0.85 1 53 971 1023 53 0 0 1207 H2QQ13 Uncharacterized protein OS=Pan troglodytes GN=EGF PE=4 SV=1
24 : K9IVS0_DESRO 0.70 0.87 1 53 906 958 53 0 0 1077 K9IVS0 Putative pro-epidermal growth factor (Fragment) OS=Desmodus rotundus PE=2 SV=1
25 : K9J0A7_DESRO 0.70 0.87 1 53 965 1017 53 0 0 1136 K9J0A7 Putative pro-epidermal growth factor OS=Desmodus rotundus PE=2 SV=1
26 : L5K6M6_PTEAL 0.70 0.89 1 53 754 806 53 0 0 991 L5K6M6 Pro-epidermal growth factor OS=Pteropus alecto GN=PAL_GLEAN10022612 PE=4 SV=1
27 : Q6QBS2_HUMAN3NJP 0.70 0.86 4 53 4 53 50 0 0 53 Q6QBS2 Epidermal growth factor (Fragment) OS=Homo sapiens GN=EGF PE=2 SV=1
28 : EGF_CANFA 0.69 0.84 1 49 973 1021 49 0 0 1216 Q9BEA0 Pro-epidermal growth factor OS=Canis familiaris GN=EGF PE=2 SV=1
29 : EGF_FELCA 0.69 0.84 1 49 969 1017 49 0 0 1210 Q95ND4 Pro-epidermal growth factor OS=Felis catus GN=EGF PE=2 SV=1
30 : F1PWS8_CANFA 0.69 0.84 1 49 974 1022 49 0 0 1216 F1PWS8 Pro-epidermal growth factor OS=Canis familiaris GN=EGF PE=4 SV=2
31 : J9NZ75_CANFA 0.69 0.84 1 49 973 1021 49 0 0 1211 J9NZ75 Pro-epidermal growth factor OS=Canis familiaris GN=EGF PE=4 SV=1
32 : M3VUW6_FELCA 0.69 0.84 1 49 930 978 49 0 0 1171 M3VUW6 Pro-epidermal growth factor (Fragment) OS=Felis catus GN=EGF PE=4 SV=1
33 : M3WWG8_FELCA 0.69 0.84 1 49 971 1019 49 0 0 1122 M3WWG8 Pro-epidermal growth factor OS=Felis catus GN=EGF PE=4 SV=1
34 : M3YAV8_MUSPF 0.67 0.86 1 49 967 1015 49 0 0 1207 M3YAV8 Uncharacterized protein OS=Mustela putorius furo GN=EGF PE=4 SV=1
35 : F6T5D5_CALJA 0.66 0.87 1 53 929 981 53 0 0 1165 F6T5D5 Uncharacterized protein OS=Callithrix jacchus GN=EGF PE=4 SV=1
36 : F7GXF4_CALJA 0.66 0.87 1 53 971 1023 53 0 0 1207 F7GXF4 Uncharacterized protein OS=Callithrix jacchus GN=EGF PE=4 SV=1
37 : F7HIN6_CALJA 0.66 0.88 4 53 291 340 50 0 0 374 F7HIN6 Uncharacterized protein (Fragment) OS=Callithrix jacchus GN=EGF PE=4 SV=1
38 : G1PSH1_MYOLU 0.66 0.85 1 53 935 987 53 0 0 1178 G1PSH1 Uncharacterized protein (Fragment) OS=Myotis lucifugus PE=4 SV=1
39 : L5MDV4_MYODS 0.66 0.87 1 53 1595 1647 53 0 0 1838 L5MDV4 Pro-epidermal growth factor OS=Myotis davidii GN=MDA_GLEAN10008435 PE=4 SV=1
40 : G3TC14_LOXAF 0.64 0.85 1 53 932 984 53 0 0 1168 G3TC14 Uncharacterized protein (Fragment) OS=Loxodonta africana PE=4 SV=1
41 : G1NEX1_MELGA 0.63 0.78 6 51 1037 1082 46 0 0 1251 G1NEX1 Uncharacterized protein OS=Meleagris gallopavo PE=4 SV=1
42 : R0JVC3_ANAPL 0.63 0.78 6 51 1031 1076 46 0 0 1253 R0JVC3 Pro-epidermal growth factor (Fragment) OS=Anas platyrhynchos GN=Anapl_04798 PE=4 SV=1
43 : F7B726_HORSE 0.62 0.81 1 53 972 1024 53 0 0 1210 F7B726 Uncharacterized protein OS=Equus caballus GN=EGF PE=4 SV=1
44 : F7B762_HORSE 0.62 0.81 1 53 973 1025 53 0 0 1211 F7B762 Uncharacterized protein OS=Equus caballus GN=EGF PE=4 SV=1
45 : F1NEP5_CHICK 0.61 0.78 6 51 1031 1076 46 0 0 1245 F1NEP5 Uncharacterized protein OS=Gallus gallus GN=EGF PE=4 SV=1
46 : Q6PPB4_CHICK 0.61 0.78 6 51 1031 1076 46 0 0 1245 Q6PPB4 Epidermal growth factor OS=Gallus gallus GN=EGF PE=2 SV=1
47 : A8KBG4_XENTR 0.60 0.71 4 51 928 975 48 0 0 1051 A8KBG4 LOC100127663 protein OS=Xenopus tropicalis GN=egf PE=2 SV=1
48 : F7BYJ3_XENTR 0.60 0.71 4 51 930 977 48 0 0 1053 F7BYJ3 Uncharacterized protein OS=Xenopus tropicalis GN=egf PE=4 SV=1
49 : F7BYK2_XENTR 0.60 0.71 4 51 928 975 48 0 0 1051 F7BYK2 Uncharacterized protein OS=Xenopus tropicalis GN=egf PE=4 SV=1
50 : H0Z1B5_TAEGU 0.58 0.75 2 53 1026 1077 52 0 0 1253 H0Z1B5 Uncharacterized protein OS=Taeniopygia guttata GN=EGF PE=4 SV=1
51 : H3D665_TETNG 0.58 0.75 6 53 883 930 48 0 0 967 H3D665 Uncharacterized protein (Fragment) OS=Tetraodon nigroviridis GN=EGF PE=4 SV=1
52 : Q28867_HORSE 0.58 0.79 1 48 42 89 48 0 0 89 Q28867 Epidermal growth factor (Fragment) OS=Equus caballus GN=epidermal growth factor/ EGF PE=2 SV=1
53 : G3VFM5_SARHA 0.57 0.84 5 53 932 980 49 0 0 1013 G3VFM5 Uncharacterized protein OS=Sarcophilus harrisii GN=EGF PE=4 SV=1
54 : G3VFM6_SARHA 0.57 0.84 5 53 926 974 49 0 0 1129 G3VFM6 Uncharacterized protein OS=Sarcophilus harrisii GN=EGF PE=4 SV=1
55 : L8YGV5_TUPCH 0.57 0.78 1 51 657 707 51 0 0 852 L8YGV5 Pro-epidermal growth factor (Fragment) OS=Tupaia chinensis GN=TREES_T100005710 PE=4 SV=1
56 : M7CIJ5_CHEMY 0.57 0.78 1 51 2047 2097 51 0 0 2276 M7CIJ5 Pro-epidermal growth factor OS=Chelonia mydas GN=UY3_01923 PE=3 SV=1
57 : H2N242_ORYLA 0.56 0.71 6 53 949 996 48 0 0 1038 H2N242 Uncharacterized protein OS=Oryzias latipes PE=4 SV=1
58 : H2USM7_TAKRU 0.56 0.73 6 53 927 974 48 0 0 997 H2USM7 Uncharacterized protein OS=Takifugu rubripes PE=4 SV=1
59 : B3DH82_DANRE 0.55 0.77 1 53 905 957 53 0 0 1113 B3DH82 Egf protein OS=Danio rerio GN=egf PE=2 SV=1
60 : F1QGQ2_DANRE 0.55 0.77 1 53 969 1021 53 0 0 1177 F1QGQ2 Uncharacterized protein OS=Danio rerio GN=egf PE=4 SV=1
61 : F1QKU8_DANRE 0.55 0.77 1 53 906 958 53 0 0 1114 F1QKU8 Uncharacterized protein OS=Danio rerio GN=egf PE=4 SV=1
62 : K7F4H3_PELSI 0.55 0.76 1 51 979 1029 51 0 0 1199 K7F4H3 Uncharacterized protein OS=Pelodiscus sinensis GN=EGF PE=4 SV=1
63 : K7F4H4_PELSI 0.55 0.76 1 51 1029 1079 51 0 0 1184 K7F4H4 Uncharacterized protein OS=Pelodiscus sinensis GN=EGF PE=4 SV=1
64 : Q6VQA2_DANRE 0.55 0.77 1 53 906 958 53 0 0 1114 Q6VQA2 Epidermal growth factor OS=Danio rerio GN=egf PE=2 SV=1
65 : F6R392_MONDO 0.54 0.83 6 53 928 975 48 0 0 1143 F6R392 Uncharacterized protein OS=Monodelphis domestica GN=EGF PE=4 SV=2
66 : M3ZJA3_XIPMA 0.54 0.71 6 53 885 932 48 0 0 997 M3ZJA3 Uncharacterized protein OS=Xiphophorus maculatus GN=EGF PE=4 SV=1
67 : G3PVY7_GASAC 0.53 0.68 1 53 888 940 53 0 0 981 G3PVY7 Uncharacterized protein OS=Gasterosteus aculeatus GN=EGF PE=4 SV=1
68 : I3IXR8_ORENI 0.53 0.70 1 53 980 1032 53 0 0 1065 I3IXR8 Uncharacterized protein (Fragment) OS=Oreochromis niloticus PE=4 SV=1
69 : I3LVG5_PIG 0.48 0.71 6 53 399 446 48 0 0 481 I3LVG5 Pro-epidermal growth factor (Fragment) OS=Sus scrofa GN=EGF PE=4 SV=1
70 : K9INK7_DESRO 0.47 0.64 1 47 943 978 47 1 11 1057 K9INK7 Putative pro-epidermal growth factor OS=Desmodus rotundus PE=2 SV=1
71 : K9INM6_DESRO 0.47 0.64 1 47 965 1000 47 1 11 1079 K9INM6 Putative pro-epidermal growth factor OS=Desmodus rotundus PE=2 SV=1
72 : A7STC0_NEMVE 0.43 0.59 1 46 198 240 46 2 3 386 A7STC0 Predicted protein OS=Nematostella vectensis GN=v1g130899 PE=4 SV=1
73 : Q91MZ0_LSDV 0.43 0.50 14 47 46 85 40 1 6 89 Q91MZ0 LSDV016 EGF-like growth factor OS=Lumpy skin disease virus NI-2490 GN=LSDV016 PE=4 SV=1
74 : R4WQY7_9HEMI 0.41 0.57 1 47 89 137 49 1 2 240 R4WQY7 Uncharacterized protein OS=Riptortus pedestris PE=2 SV=1
75 : C3Z2S3_BRAFL 0.40 0.57 3 46 31 77 47 1 3 288 C3Z2S3 Putative uncharacterized protein OS=Branchiostoma floridae GN=BRAFLDRAFT_118182 PE=4 SV=1
76 : G1KW09_ANOCA 0.40 0.56 1 48 53 99 48 1 1 153 G1KW09 Uncharacterized protein OS=Anolis carolinensis PE=4 SV=1
77 : K1PJ78_CRAGI 0.40 0.55 6 52 214 260 47 0 0 585 K1PJ78 Pro-neuregulin-2, membrane-bound isoform OS=Crassostrea gigas GN=CGI_10026269 PE=4 SV=1
78 : A7SHT9_NEMVE 0.39 0.52 3 48 116 161 46 0 0 388 A7SHT9 Predicted protein OS=Nematostella vectensis GN=v1g212493 PE=4 SV=1
79 : B2CWB6_9POXV 0.39 0.50 6 47 37 81 46 2 5 85 B2CWB6 MGF OS=Myxoma virus GN=m010L PE=4 SV=1
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A N 0 0 155 56 25 NNNNNNNNNNNN NNNNNNNNNNNNN NNNNNNNNN NNH NN N SD NNNDDN NN N
2 2 A S E -A 23 0A 82 57 44 SSSSSSSSSSSS SSSSSSSSSSTTS GSGGSSGSS VGI SS D S SG GGGGGG SA T
3 3 A Y E -A 22 0A 126 59 72 YYYYNIIINYFN DDDDDDDDYDYYY YYYYYYFDD YYY YY F Y YF VVVFFV AV Y
4 4 A P + 0 0 94 65 80 PPPPTTTTTSPPPSSSSSSSSPSRRPSRQRRQQLSSSSSP QQ RRRV Q PV QQQMMQ QE R
5 5 A G + 0 0 23 67 57 GGGGGGGGGEGGGGGGGEGEEGEEEEEEEEEEEEGGGEEG EE EEEG EEEEG SSSGGS SS E
6 6 A a - 0 0 62 79 7 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
7 7 A P > - 0 0 51 79 30 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPSSPPPPPPPSPPPPPPPPPPPPPPPPNP
8 8 A S T 3 S+ 0 0 115 79 75 SSSEPPPPPPPPPLLLLLLLLPLPPPLSPSSPPPPPPPPPPPQQPPLLLPSQPPPSSSSSSSSSPPSAGP
9 9 A S T 3 S+ 0 0 96 79 43 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSAAAAASTSSSESSTTTTSSTPSSTPS
10 10 A Y S < S- 0 0 91 79 51 YYYHYYYYYHHHHHHHHHHHHHHHHHHYYYYYYYHHHHHHYYYYYYYYYYHYYYYYHHHHHHHHYHHHHH
11 11 A D S S+ 0 0 162 79 34 DDDDDDDDDDDDDDDDDDDDDDDAAADDDDDDDDSSSSADDDDDDDDDDEQDDDEEEDDDDDDDDDEEDA
12 12 A G + 0 0 57 79 45 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGSSGGSSGGGSNGSSGSANSSSSSSSASTGG
13 13 A Y + 0 0 51 78 7 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYFY
14 14 A b S S- 0 0 28 80 4 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCRC
15 15 A L + 0 0 77 80 1 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
16 16 A N S S- 0 0 107 80 53 NNNHNNNNNHHHHHHHHHHHHHHHHHHYYYYYYHHHHHHNHHHHHHNNNHYHHHHHYYYYYHHYHYYYHH
17 17 A G S S+ 0 0 46 80 39 GGGGGGGGGGGGGDDDDDDDDGDGGGDNNNNNNNDDDGGDGGGGGGGGGGGGGGGGQEDDDGGDGHQHHG
18 18 A G - 0 0 30 80 5 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNG
19 19 A V E - B 0 32A 94 79 37 VVVVVVVVVVVVVVVVVVVVVAVVVVVVVVVVVVVVVMMLVVKKVVVVVVVKVVVVVIVVVVVVTVVVTV
20 20 A a E + B 0 31A 30 80 5 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCLC
21 21 A M E - B 0 30A 56 80 64 MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMNNVVNNIIINFVIIRIFFFFFIIFIVFYMM
22 22 A H E -AB 3 29A 14 80 45 HHHYYYYYYYYYHYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYHHHYYYYYYYYYYYYYYYYYYYLY
23 23 A I E -A 2 0A 39 80 47 IIIVVVVVVIIVIIIIIIIIIVIIIIIIIIIIIIIIIIIIVVLLVVFFFVFLVVIVYFFFFVVFVFYFLI
24 24 A E S S+ 0 0 104 80 71 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEESSVVSSPPPSPVSSASPPPPPSSPSPPPQE
25 25 A S S S+ 0 0 101 79 76 SSSASSSSSAAASAPAAAAAATASSAAAAAAAAATTTAAQDDQQDDEEEDEQEEVIEEDDDIIEEEEERS
26 26 A L S S- 0 0 154 80 39 LLLVVVVVVVVVLLLLLLLLLVLVVVLVVVVVVVVVVVVVLLVVLLLLLLMVLLLLMVMMMLLMLLMLIV
27 27 A D S S+ 0 0 122 80 45 DDDDDDDDDDDDNDDDDDDDDDDDDDDDDDDDDDDDDDDDQQDDQQKKKQDDEEGQEDEEEQQEEKEDMD
28 28 A S - 0 0 53 80 80 SSSSRRRRRSNSTKKKKKKKKTKTTNKRRRRRRRKKKRRQDDTTDDDDDDSTNNSDASSSSDDSNISSET
29 29 A Y E +B 22 0A 41 80 30 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYHHYYYYYYYHYYYYYYYYYYYYYYYYYY
30 30 A T E -B 21 0A 25 80 42 TTTAVVVVVAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGAAAAAAAAAAAAAAAAAAASA
31 31 A b E -B 20 0A 3 77 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC.
32 32 A N E +B 19 0A 86 77 46 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNKNNNNNNRRRNNNKKNTNNNNNTTNKNNNF.
33 33 A c - 0 0 21 78 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC.
34 34 A V > - 0 0 78 78 41 VVVVVVVVVVVIVVVVVVVVVVVVVIVVVVVVVVVVVVVIVVVVVVVVVVVVMMAVVVVVVVVVMIAVV.
35 35 A I T 3 S+ 0 0 82 78 73 IIIIIIIIIFVVIVVVVVVVVVVVVVVFFFFFFFVVVVVVTTVVTTAAATAVVVVTSALLLTTLVRSSF.
36 36 A G T 3 S+ 0 0 14 78 10 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG.
37 37 A Y < + 0 0 80 78 6 YYYYYYYYYYYYYYYYYYYYYFYFFYYYYYYYYYYYYYYFYYYYYYYYYYYYYYYYYYYYYYYYYYYYY.
38 38 A S + 0 0 25 78 48 SSSVIIIIIVVVVIIIIIIIIVIVVVIVVVVVVVLLLVVIVVVVVVVVVVIVVVVVMIMMMVVMVLIMV.
39 39 A G S S- 0 0 42 78 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG.
40 40 A D S S- 0 0 67 78 22 DDDEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEEEEEEEEEEEEEEEEEEEEEEEEEEEE.
41 41 A R S S- 0 0 65 78 4 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR.
42 42 A c S S+ 0 0 1 80 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
43 43 A Q + 0 0 131 80 20 QQQQQQQQQQQQEQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQK
44 44 A T - 0 0 57 80 71 TTTHHHHHHHHHHYYYYYYYYHYYYHYHHHHHHHYYYHHHFFHHFFFFFFFHHHHFFFFFFFFFHFFFHT
45 45 A R + 0 0 193 80 72 RRRRRRRRRRRRQRRRRRQRRRRRRRRRRRRRRRQQQRRRSSQQSSDDDSSQSSLSSSSSSSSSSSSSRQ
46 46 A D S S- 0 0 74 80 21 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDEDDDDDDDDDDDDDDDDDDDDDDDDDDDDDK
47 47 A L + 0 0 112 78 5 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
48 48 A R - 0 0 169 73 57 RRRRRRRRRKKRKKKKKKKKKKKRRKKKKKKKKRKKKKKREERREEKKKEEREEQEEEEEEEEEEEEEK
49 49 A W + 0 0 201 70 17 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWSSSWW WWLWWWWWWWWWWWWWW
50 50 A W + 0 0 82 62 14 WWWWWWWWWWWW WWWWWWWWWWWWWW WWWWWWWWWWWWWWWWW WWRWLWWWWWWWWLGLW
51 51 A E - 0 0 77 62 21 EEEEKKKKKEEE EEEEEEEEEEEEEE EEEEEKEEEEEEEEEED EEHEEDEEEDDEEEEEE
52 52 A L 0 0 137 51 8 LLLLLLLLLLLL LLLLLLLLLLLLLL LLLLLL LL QL MM LLLLL LMLLLL
53 53 A R 0 0 289 50 29 RRRRRRRRRRRR RRRRRRRRRRRRRR RRRRRR RR RQ RR RQQQQ QRQQQR
## ALIGNMENTS 71 - 79
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....8....:....9....:....0....:....1....:....2....:....3....:....4
1 1 A N 0 0 155 56 25 NE H K
2 2 A S E -A 23 0A 82 57 44 TS T S
3 3 A Y E -A 22 0A 126 59 72 YF YYL Y
4 4 A P + 0 0 94 65 80 RD PDQ S
5 5 A G + 0 0 23 67 57 EY CDA I
6 6 A a - 0 0 62 79 7 CC TCCCAC
7 7 A P > - 0 0 51 79 30 PK APLKTN
8 8 A S T 3 S+ 0 0 115 79 75 PA EEERRD
9 9 A S T 3 S+ 0 0 96 79 43 SS HSEECD
10 10 A Y S < S- 0 0 91 79 51 HG TYHDYY
11 11 A D S S+ 0 0 162 79 34 AD ASNQDK
12 12 A G + 0 0 57 79 45 GP YGNGMN
13 13 A Y + 0 0 51 78 7 Y. YYYYPY
14 14 A b S S- 0 0 28 80 4 CCCCCCCCC
15 15 A L + 0 0 77 80 1 LLLLLILLL
16 16 A N S S- 0 0 107 80 53 HNNNNNNNN
17 17 A G S S+ 0 0 46 80 39 GGGGGGGGN
18 18 A G - 0 0 30 80 5 GGGGGNGGG
19 19 A V E - B 0 32A 94 79 37 VSTTT.VTT
20 20 A a E + B 0 31A 30 80 5 CCCCCCCCC
21 21 A M E - B 0 30A 56 80 64 MTYFTIRQF
22 22 A H E -AB 3 29A 14 80 45 YNKAIFIAT
23 23 A I E -A 2 0A 39 80 47 IGTILHISV
24 24 A E S S+ 0 0 104 80 71 EATKPTKYA
25 25 A S S S+ 0 0 101 79 76 SSFIGEDH.
26 26 A L S S- 0 0 154 80 39 VNIGLLLDL
27 27 A D S S+ 0 0 122 80 45 DYLDPKDDN
28 28 A S - 0 0 53 80 80 TISTEMINN
29 29 A Y E +B 22 0A 41 80 30 YCYLDPKYV
30 30 A T E -B 21 0A 25 80 42 ATniqTSSs
31 31 A b E -B 20 0A 3 77 0 ..cccCCCc
32 32 A N E +B 19 0A 86 77 46 ..RDSRATA
33 33 A c - 0 0 21 78 0 .CCCCCCCC
34 34 A V > - 0 0 78 78 41 .RKAPIRRH
35 35 A I T 3 S+ 0 0 82 78 73 .PLDSTRPI
36 36 A G T 3 S+ 0 0 14 78 10 .GGGDGNEN
37 37 A Y < + 0 0 80 78 6 .YYYKYFYY
38 38 A S + 0 0 25 78 48 .REIVSTPV
39 39 A G S S- 0 0 42 78 0 .GGGGGGGG
40 40 A D S S- 0 0 67 78 22 .NVEEETRS
41 41 A R S S- 0 0 65 78 4 .RRRRRRNR
42 42 A c S S+ 0 0 1 80 0 CCCCCCCCC
43 43 A Q + 0 0 131 80 20 KEHDEEADQ
44 44 A T - 0 0 57 80 71 TALYNNIEF
45 45 A R + 0 0 193 80 72 QRRKSLPAI
46 46 A D S S- 0 0 74 80 21 KDSDNMNGN
47 47 A L + 0 0 112 78 5 L LL LPLL
48 48 A R - 0 0 169 73 57 RNR
49 49 A W + 0 0 201 70 17 I
50 50 A W + 0 0 82 62 14 R
51 51 A E - 0 0 77 62 21 E
52 52 A L 0 0 137 51 8 F
53 53 A R 0 0 289 50 29
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 0 0 0 0 2 0 0 4 0 2 0 2 86 5 56 0 0 0.624 20 0.74
2 2 A 2 0 2 0 0 0 0 21 2 0 63 9 0 0 0 0 0 0 0 2 57 0 0 1.115 37 0.55
3 3 A 8 2 5 0 12 0 47 0 2 0 0 0 0 0 0 0 0 0 5 19 59 0 0 1.570 52 0.28
4 4 A 3 2 0 3 0 0 0 0 0 20 26 8 0 0 15 0 18 2 0 3 65 0 0 1.920 64 0.20
5 5 A 0 0 1 0 0 0 1 40 1 0 9 0 1 0 0 0 0 43 0 1 67 0 0 1.259 42 0.42
6 6 A 0 0 0 0 0 0 0 0 1 0 0 1 97 0 0 0 0 0 0 0 79 0 0 0.136 4 0.93
7 7 A 0 1 0 0 0 0 0 0 1 87 4 1 0 0 0 3 0 0 3 0 79 0 0 0.594 19 0.70
8 8 A 0 16 0 0 0 0 0 1 3 44 23 0 0 0 3 0 4 5 0 1 79 0 0 1.567 52 0.25
9 9 A 0 0 0 0 0 0 0 0 6 3 75 9 1 1 0 0 0 4 0 1 79 0 0 0.991 33 0.57
10 10 A 0 0 0 0 0 0 44 1 0 0 0 1 0 52 0 0 0 0 0 1 79 0 0 0.867 28 0.48
11 11 A 0 0 0 0 0 0 0 0 9 0 6 0 0 0 0 1 3 8 1 72 79 0 0 1.024 34 0.65
12 12 A 0 0 0 1 0 0 1 67 3 1 20 1 0 0 0 0 0 0 5 0 79 1 0 1.057 35 0.54
13 13 A 0 0 0 0 1 0 97 0 0 1 0 0 0 0 0 0 0 0 0 0 78 0 0 0.137 4 0.93
14 14 A 0 0 0 0 0 0 0 0 0 0 0 0 99 0 1 0 0 0 0 0 80 0 0 0.067 2 0.96
15 15 A 0 99 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 80 0 0 0.067 2 0.98
16 16 A 0 0 0 0 0 0 20 0 0 0 0 0 0 54 0 0 0 0 26 0 80 0 0 1.007 33 0.47
17 17 A 0 0 0 0 0 0 0 60 0 0 0 0 0 4 0 0 3 1 10 22 80 0 0 1.143 38 0.60
18 18 A 0 0 0 0 0 0 0 98 0 0 0 0 0 0 0 0 0 0 3 0 80 1 0 0.117 3 0.95
19 19 A 80 1 1 3 0 0 0 0 1 0 1 9 0 0 0 4 0 0 0 0 79 0 0 0.834 27 0.62
20 20 A 0 1 0 0 0 0 0 0 0 0 0 0 99 0 0 0 0 0 0 0 80 0 0 0.067 2 0.95
21 21 A 5 0 13 55 13 0 3 0 0 0 0 3 0 0 3 0 1 0 6 0 80 0 0 1.503 50 0.36
22 22 A 0 1 3 0 1 0 79 0 3 0 0 1 0 10 0 1 0 0 1 0 80 0 0 0.877 29 0.55
23 23 A 25 6 47 0 14 0 3 1 0 0 1 1 0 1 0 0 0 0 0 0 80 0 0 1.458 48 0.53
24 24 A 4 0 0 0 0 0 1 0 4 17 14 3 0 0 0 3 1 54 0 0 80 1 0 1.452 48 0.28
25 25 A 1 0 5 0 1 0 0 1 29 1 19 5 0 1 1 0 5 18 0 11 79 0 0 2.014 67 0.23
26 26 A 40 45 3 9 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 1 80 0 0 1.196 39 0.60
27 27 A 0 1 0 1 0 0 1 1 0 1 0 0 0 0 0 6 10 11 3 64 80 0 0 1.302 43 0.54
28 28 A 0 0 4 1 0 0 0 0 1 0 21 13 0 0 17 16 1 3 9 14 80 0 0 2.055 68 0.20
29 29 A 1 1 0 0 0 0 89 0 0 1 0 0 1 4 0 1 0 0 0 1 80 0 0 0.558 18 0.69
30 30 A 6 0 1 0 0 0 0 4 74 0 5 8 0 0 0 0 1 0 1 0 80 3 4 1.029 34 0.57
31 31 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 77 0 0 0.000 0 1.00
32 32 A 0 0 0 0 1 0 0 0 3 0 1 5 0 0 6 5 0 0 77 1 77 0 0 0.953 31 0.54
33 33 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 78 0 0 0.000 0 1.00
34 34 A 78 0 6 4 0 0 0 0 4 1 0 0 0 1 4 1 0 0 0 0 78 0 0 0.912 30 0.59
35 35 A 37 6 15 0 12 0 0 0 6 3 5 12 0 0 3 0 0 0 0 1 78 0 0 1.902 63 0.26
36 36 A 0 0 0 0 0 0 0 95 0 0 0 0 0 0 0 0 0 1 3 1 78 0 0 0.256 8 0.89
37 37 A 0 0 0 0 6 0 92 0 0 0 0 0 0 0 0 1 0 0 0 0 78 0 0 0.306 10 0.94
38 38 A 50 5 26 8 0 0 0 0 0 1 6 1 0 0 1 0 0 1 0 0 78 0 0 1.445 48 0.52
39 39 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 78 0 0 0.000 0 1.00
40 40 A 1 0 0 0 0 0 0 0 0 0 1 1 0 0 1 0 0 87 1 6 78 0 0 0.575 19 0.78
41 41 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 99 0 0 0 1 0 78 0 0 0.069 2 0.96
42 42 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 80 0 0 0.000 0 1.00
43 43 A 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0 3 88 5 0 3 80 0 0 0.561 18 0.80
44 44 A 0 1 1 0 28 0 20 0 1 0 0 8 0 38 0 0 0 1 3 0 80 0 0 1.550 51 0.28
45 45 A 0 3 1 0 0 0 0 0 1 1 28 0 0 0 49 1 13 0 0 4 80 0 0 1.400 46 0.28
46 46 A 0 0 0 1 0 0 0 1 0 0 1 0 0 0 0 3 0 1 4 89 80 0 0 0.540 18 0.78
47 47 A 0 99 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 78 0 0 0.069 2 0.95
48 48 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 27 41 1 29 1 0 73 0 0 1.196 39 0.43
49 49 A 0 1 1 0 0 93 0 0 0 0 4 0 0 0 0 0 0 0 0 0 70 0 0 0.325 10 0.82
50 50 A 0 5 0 0 0 90 0 2 0 0 0 0 0 0 3 0 0 0 0 0 62 0 0 0.416 13 0.86
51 51 A 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 10 0 82 0 6 62 0 0 0.630 21 0.78
52 52 A 0 90 0 6 2 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 51 0 0 0.414 13 0.91
53 53 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 82 0 18 0 0 0 50 0 0 0.471 15 0.70
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
73 18 63 6 nKKPLMFc
74 31 119 2 iPAc
75 29 59 3 qAIRc
79 25 61 4 sLNPFc
//