Complet list of 2evn hssp file
Complete list of 2evn.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 2EVN
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-18
HEADER STRUCTURAL GENOMICS, UNKNOWN FUNCTION 31-OCT-05 2EVN
COMPND MOL_ID: 1; MOLECULE: PROTEIN AT1G77540; CHAIN: A; ENGINEERED: YES
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: ARABIDOPSIS THALIANA; ORGANISM_COMMON:
AUTHOR R.C.TYLER,S.SINGH,M.TONELLI,M.S.MIN,J.L.MARKLEY,CENTER FOR EUKARYOTIC
DBREF 2EVN A 1 103 UNP Q9CAQ2 Y1754_ARATH 12 114
SEQLENGTH 103
NCHAIN 1 chain(s) in 2EVN data set
NALIGN 41
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : Y1754_ARATH 1XMT 1.00 1.00 1 103 12 114 103 0 0 114 Q9CAQ2 Acetyltransferase At1g77540 OS=Arabidopsis thaliana GN=At1g77540 PE=1 SV=2
2 : D7KUK3_ARALL 0.96 0.99 1 103 1 103 103 0 0 103 D7KUK3 Nmr solution structures Of At1g77540 OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_339915 PE=4 SV=1
3 : V4JQ70_THESL 0.93 0.96 1 97 13 109 97 0 0 116 V4JQ70 Uncharacterized protein OS=Thellungiella salsuginea GN=EUTSA_v10019334mg PE=4 SV=1
4 : M4CHJ6_BRARP 0.86 0.95 2 103 11 112 102 0 0 112 M4CHJ6 Uncharacterized protein OS=Brassica rapa subsp. pekinensis GN=BRA003679 PE=4 SV=1
5 : R0ICP1_9BRAS 0.85 0.94 1 103 1 103 103 0 0 103 R0ICP1 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10021197mg PE=4 SV=1
6 : V4KTK7_THESL 0.84 0.96 1 97 1 97 97 0 0 106 V4KTK7 Uncharacterized protein OS=Thellungiella salsuginea GN=EUTSA_v10009164mg PE=4 SV=1
7 : Q9XHZ9_ARATH 0.82 0.95 2 80 6 84 79 0 0 95 Q9XHZ9 F8K7.21 protein OS=Arabidopsis thaliana GN=F8K7.21 PE=4 SV=1
8 : D7KKU3_ARALL 0.78 0.91 2 98 6 102 97 0 0 111 D7KKU3 Predicted protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_679950 PE=4 SV=1
9 : Q8H0Y9_ARATH 0.78 0.93 2 98 6 102 97 0 0 111 Q8H0Y9 Acyltransferase-like protein OS=Arabidopsis thaliana GN=At1g21770 PE=4 SV=1
10 : M4DN63_BRARP 0.77 0.89 2 100 3 101 99 0 0 107 M4DN63 Uncharacterized protein OS=Brassica rapa subsp. pekinensis GN=BRA017950 PE=4 SV=1
11 : M5XYY9_PRUPE 0.77 0.94 2 96 9 103 95 0 0 112 M5XYY9 Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa013628mg PE=4 SV=1
12 : Q8L8R8_ARATH 0.77 0.92 2 98 6 102 97 0 0 111 Q8L8R8 Putative uncharacterized protein OS=Arabidopsis thaliana PE=2 SV=1
13 : R0GRX4_9BRAS 0.77 0.90 2 101 6 105 100 0 0 110 R0GRX4 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10010702mg PE=4 SV=1
14 : V4T2S6_9ROSI 0.76 0.90 3 103 11 111 101 0 0 111 V4T2S6 Uncharacterized protein OS=Citrus clementina GN=CICLE_v10002899mg PE=4 SV=1
15 : B9RJ31_RICCO 0.71 0.90 2 103 11 112 102 0 0 112 B9RJ31 Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1030600 PE=4 SV=1
16 : K4CM04_SOLLC 0.70 0.87 1 103 1 103 103 0 0 103 K4CM04 Uncharacterized protein OS=Solanum lycopersicum GN=Solyc08g067800.1 PE=4 SV=1
17 : W9SD96_9ROSA 0.70 0.90 2 80 15 93 79 0 0 96 W9SD96 Uncharacterized protein OS=Morus notabilis GN=L484_025477 PE=4 SV=1
18 : V7B606_PHAVU 0.68 0.88 2 103 13 114 102 0 0 114 V7B606 Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_008G160300g PE=4 SV=1
19 : C6TD41_SOYBN 0.67 0.88 4 103 16 117 102 1 2 117 C6TD41 Uncharacterized protein OS=Glycine max PE=4 SV=1
20 : W1PLG7_AMBTC 0.67 0.87 2 94 4 96 93 0 0 143 W1PLG7 Uncharacterized protein OS=Amborella trichopoda GN=AMTR_s00017p00173900 PE=4 SV=1
21 : G7J502_MEDTR 0.62 0.87 1 103 5 107 103 0 0 107 G7J502 Uncharacterized protein OS=Medicago truncatula GN=MTR_3g088570 PE=4 SV=1
22 : A9NK54_PICSI 0.59 0.82 5 97 12 102 93 1 2 112 A9NK54 Putative uncharacterized protein OS=Picea sitchensis PE=4 SV=1
23 : S8CC93_9LAMI 0.58 0.78 3 97 6 99 95 1 1 100 S8CC93 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_10270 PE=4 SV=1
24 : G3MHC8_9ACAR 0.54 0.72 2 103 29 141 114 2 13 141 G3MHC8 Putative uncharacterized protein (Fragment) OS=Amblyomma maculatum PE=2 SV=1
25 : A9SLK2_PHYPA 0.53 0.66 8 90 1 95 95 2 12 95 A9SLK2 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_19709 PE=4 SV=1
26 : K3YAW2_SETIT 0.53 0.76 2 98 17 112 97 1 1 121 K3YAW2 Uncharacterized protein OS=Setaria italica GN=Si011354m.g PE=4 SV=1
27 : B6U6V0_MAIZE 0.50 0.74 2 98 16 117 102 1 5 126 B6U6V0 Putative uncharacterized protein OS=Zea mays PE=2 SV=1
28 : M0T7Z1_MUSAM 0.50 0.66 5 103 17 136 120 2 21 136 M0T7Z1 Uncharacterized protein OS=Musa acuminata subsp. malaccensis PE=4 SV=1
29 : W5BDG4_WHEAT 0.47 0.66 8 98 17 118 103 2 13 129 W5BDG4 Uncharacterized protein OS=Triticum aestivum PE=4 SV=1
30 : I1IXT4_BRADI 0.46 0.65 3 98 12 118 108 2 13 129 I1IXT4 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI5G10270 PE=4 SV=1
31 : C5Y8N3_SORBI 0.45 0.69 2 89 17 112 96 2 8 151 C5Y8N3 Putative uncharacterized protein Sb06g016940 OS=Sorghum bicolor GN=Sb06g016940 PE=4 SV=1
32 : M7ZA99_TRIUA 0.45 0.64 2 98 13 120 109 2 13 131 M7ZA99 Uncharacterized protein OS=Triticum urartu GN=TRIUR3_16292 PE=4 SV=1
33 : W5AXL0_WHEAT 0.45 0.64 2 98 13 120 109 2 13 131 W5AXL0 Uncharacterized protein OS=Triticum aestivum PE=4 SV=1
34 : F2E5D4_HORVD 0.43 0.62 2 98 12 119 109 2 13 130 F2E5D4 Predicted protein OS=Hordeum vulgare var. distichum PE=2 SV=1
35 : M8C3N3_AEGTA 0.43 0.63 2 98 12 119 109 2 13 130 M8C3N3 Uncharacterized protein OS=Aegilops tauschii GN=F775_15752 PE=4 SV=1
36 : W5BRC4_WHEAT 0.43 0.63 2 98 49 156 109 2 13 167 W5BRC4 Uncharacterized protein OS=Triticum aestivum PE=4 SV=1
37 : J3LXZ3_ORYBR 0.41 0.65 3 97 12 122 111 1 16 130 J3LXZ3 Uncharacterized protein OS=Oryza brachyantha GN=OB04G20180 PE=4 SV=1
38 : Q01JR5_ORYSA 0.38 0.60 2 80 15 113 99 2 20 116 Q01JR5 OSIGBa0160I14.8 protein OS=Oryza sativa GN=OSIGBa0160I14.8 PE=4 SV=1
39 : Q7XQP7_ORYSJ 0.38 0.60 2 80 15 113 99 2 20 116 Q7XQP7 OSJNBa0084A10.8 protein OS=Oryza sativa subsp. japonica GN=OSJNBa0084A10.8 PE=4 SV=1
40 : B4RG69_PHEZH 0.31 0.61 3 97 2 95 97 3 5 104 B4RG69 Acetyltransferase OS=Phenylobacterium zucineum (strain HLK1) GN=PHZ_c0779 PE=4 SV=1
41 : C1E233_MICSR 0.31 0.51 23 97 50 148 99 3 24 149 C1E233 Predicted protein OS=Micromonas sp. (strain RCC299 / NOUM17) GN=MICPUN_57093 PE=4 SV=1
## ALIGNMENTS 1 - 41
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A M 0 0 230 8 25 MMM MM M V
2 2 A A + 0 0 76 31 44 AAAATAAAATAAA AASG GG A PP PGGGGG AA
3 3 A T - 0 0 124 36 72 TTVTRTTMTEATKKATGG RS TV PP APAAAAAAAAS
4 4 A E - 0 0 76 37 48 EEEEEEEEENEEEEEEGKDEG TA EE AEAAAAAAEED
5 5 A P S S+ 0 0 118 39 77 PPPPAKKKKKAKKIAAASGNSRTS ATP ETEEQEEAEEA
6 6 A P S S- 0 0 19 39 69 PPPPPAPPPPPPPPPPPNNSPATP EEP DEDDDDDEEEI
7 7 A K - 0 0 115 39 74 KKKKKKKKKKKKKKKKKNKKMMTV SSV SSSSSSSEVVQ
8 8 A I - 0 0 47 41 10 IIIIIIIIIIIIIIIIIIIIIIIIVIIIVIIIIVVVIVVV
9 9 A V E -A 18 0A 60 41 20 VVVVEVVVVEVVVVVVAVVVVVVVVVVVVVVVVVVVLLLV
10 10 A W E -A 17 0A 119 41 7 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWK
11 11 A N E >> -A 16 0A 47 41 56 NNSNNNNNNNNNNNKSNNNNNNNNNRRNRRSRRRRRRRRN
12 12 A E T 45S+ 0 0 155 41 22 EEEAEEEEEEEEEEQEEEEEEPDEAEEKEEEEEEEEEEED
13 13 A G T 45S+ 0 0 60 41 56 GGGGGGGGGGAGGSNRAAAAGGEKEDDEEEDEEEEEEDDE
14 14 A K T 45S- 0 0 122 41 80 KKKKKRRRRRKRRKNVQLQETCDEKKKRAAKAAAAAAAAA
15 15 A R T <5 + 0 0 104 41 78 RRRHGHHHHHRHHRSGRQQKKKKGSEGEGRGGGGGGRRRG
16 16 A R E < -AB 11 27A 50 41 21 RRRRRRRRRRRRRRRRRRRRRRKKRRRRKRRKKKKKRRRQ
17 17 A F E -AB 10 26A 12 41 0 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF
18 18 A E E -AB 9 25A 20 41 6 EEEEEEEEEEEEEEEEEEEEEEEEAEQEEEEEEEEEEEEE
19 19 A T > - 0 0 0 41 14 TTTTTTTTTTSTTTTTTTTSTTTTTTTTTTTTTTTTTTTV
20 20 A E T 3 S+ 0 0 81 41 61 EEEEEEDEDEEDEEEEEEQEEEAEEPPEPPPPPPPPPPPr
21 21 A D T 3 S- 0 0 49 41 0 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDd
22 22 A H S < S+ 0 0 81 41 82 HHHHHHHHHHQHHKKKKKKKKKKKGGGKGGGGGGGGGGGE
23 23 A E S S+ 0 0 125 42 32 EEEEEEEEEEKEEEEEKEEKKEEELEEEEREEEEEEGEETE
24 24 A A S S+ 0 0 0 42 0 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
25 25 A F E -BC 18 43A 52 42 3 FFFFYFFFFFYFFYYYYFFFYFYFYFFFFFFFFFFFYYYFY
26 26 A I E -BC 17 42A 0 42 34 IIIIIIIIIIIIIVLLLLVILLLLLLLLLLLLLLLLLLLAV
27 27 A E E +BC 16 40A 53 42 40 EEEDEEEEEEEEEEEEQEEQEEEQDEQQQQQQQQRRQQQEA
28 28 A Y E - C 0 39A 6 42 7 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYDYYFY
29 29 A K E - C 0 38A 64 42 75 KKKKKKKKKKVKKVVEAVAHVEEYVRRHRRRRRRRRRRRRE
30 30 A M E - C 0 37A 109 42 22 MMTLMMMMMMLMIVLLLLLLLLALMLLLLLLLLLLLLLLMI
31 31 A R E >>> + C 0 36A 114 42 79 RRRRRKKKKMRKERRRRRKRRRRRLPLRVVLVVVVVLLLVR
32 32 A N T 345S- 0 0 119 42 74 NNNNNNNNNNENNENDDEENDGGenSddvasvvaaaaaaQf
33 33 A N T 345S- 0 0 159 42 75 NNNNKNDDDNNDDNDGDKKSNEErePakpsppppppattGe
34 34 A G T <45S+ 0 0 12 42 51 GGGGGGGGGGGGGGGGGGgGGVKTpAAgAAaAAAAAAttGr
35 35 A K T <5S+ 0 0 120 30 70 KKKQKRKKKTKKKKKRKKmKK...e.Av..a.....Aaa.g
36 36 A V E - 0 0 21 42 0 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
46 46 A S G > S+ 0 0 93 42 71 SSSSSSPPPSSPPSRRPPPSPPPRVRRRGGRGGGGGGGGEE
47 47 A F G 3 S+ 0 0 151 42 18 FSSSSSSSSSSSSSSTSSSSSSSSSSSSSSSSSSSSSSSAS
48 48 A K G X S+ 0 0 31 42 16 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKFQ
49 49 A R T < S+ 0 0 130 42 41 RRRRRAAAAGRAARRRRRRRRRRRRRRRRRRRRRRRRRRER
50 50 A G T 3 S+ 0 0 78 42 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
51 51 A L S < S- 0 0 125 42 72 LLLLLLLLLLLLLLLLLLLQLMLLLQQMQRQQQQQQRRRKK
52 52 A G S >> S+ 0 0 21 42 3 GGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGG
53 53 A L H 3> S+ 0 0 11 42 7 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLVI
54 54 A A H 3> S+ 0 0 1 42 0 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
55 55 A S H <> S+ 0 0 5 42 49 SSSSSSSSSSSSSSSSSSSSSAGNGAAAASAAAAAAAAASE
56 56 A H H X S+ 0 0 64 42 63 HHHLHHHHHHHHYHHHHHHHHQHHERRLRRRRRRRRRRRAI
57 57 A L H X S+ 0 0 0 42 3 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLI
58 58 A C H X S+ 0 0 0 42 20 CCCCCCCCCCCCCCCCCCCCTCCTCCCCCCCCCCCCCCCAV
59 59 A V H X S+ 0 0 23 42 80 VVVIVVVVVVVVVRVIVVVILSVVKDDDDDDDDDDDDDDRE
60 60 A A H X S+ 0 0 24 42 14 AAAAAAAAAAAAAAAAAAASAAAAAAAAAAAAAAAAAAATE
61 61 A A H X S+ 0 0 0 42 0 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
62 62 A F H X S+ 0 0 0 42 6 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFLM
63 63 A E H X S+ 0 0 107 42 64 EEEEEEEEEENEENNSDHQNNQTNAAAAAADAAAAAAAAGE
64 64 A H H X S+ 0 0 60 42 14 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHYW
65 65 A A H <>S+ 0 0 0 42 0 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
66 66 A S H ><5S+ 0 0 39 42 69 SSSSSSSSSSKSSKKQKQQKTQRQRRRQRRRQQQRRRRRRS
67 67 A S H 3<5S+ 0 0 110 42 76 SSSSSSEEESSEESSSSSSLSQNAKGACRRGRRRRRREEEA
68 68 A H T 3<5S- 0 0 139 42 29 HHHHHRHHHRNHHHHHHHHHHHHHHRRHHHRHHHHHRRRHN
69 69 A S T < 5 + 0 0 115 42 37 SSSSSSSSSSSSSSSSSSSSSSSSKGGSGGGGGGGGGGGGS
70 70 A I < - 0 0 17 42 31 IIILFFFFFFIFFMMLLFLMLLIMLMMMMLMMMMMMMMMLL
71 71 A S E -d 36 0A 51 42 70 SSSSSSSSSSSSSSSSSSSSSSAVLRRLRRRRRRRRLRRKS
72 72 A I E -d 37 0A 1 42 11 IVVVVIIIIIVIIIVVVVIVVVVVVVVVVVVVVVVVVVVVV
73 73 A I E -d 38 0A 33 42 28 IIIIIVIIIVIIVIIIIIIIIIIIQVLIIILIIIIILLLID
74 74 A P E > +d 39 0A 7 42 5 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPS
75 75 A S T 3 S+ 0 0 79 42 33 SSSSSTTTTTTTTTSSTTTTTSTTTTTTTTTTATTTTTTTS
76 76 A C T >> + 0 0 4 42 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
77 77 A S H <> + 0 0 36 42 6 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTS
78 78 A Y H 3> S+ 0 0 118 42 0 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYFY
79 79 A V H <>>S+ 0 0 0 42 19 VVVVVVVVVVVVVVVVIIVVIVIIIIIIIIIIIIIIIIIMV
80 80 A S H <5S+ 0 0 0 42 11 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSAN
81 81 A D H <5S+ 0 0 82 38 27 DEDDEE EEEDEDDDD DDEEEGDDDDDDDVDDDDDE GE
82 82 A T H X5S+ 0 0 51 37 5 TTTTTT TTTTTTTTT TTTTTTTTTTTTTSTTTTTT .T
83 83 A F H X5S+ 0 0 33 38 10 FFFFFF FFFFFFYFF FFFFFFFFYYYYYTYYYYYY YF
84 84 A L H 4< - 0 0 36 38 31 NNNNNN YNNNYNNNN NNNNNNNHNNNNNRNNNNNN HP
88 88 A P G > + 0 0 65 38 15 PPPPQP PPPPPPPEP PPPPPPPPPPPPPPPPPPPP PH
89 89 A S G 3 S+ 0 0 73 38 61 SSSTSS STSSTSTSS SSSSAESEAAAAAPAAAAAA En
90 90 A W G X> S+ 0 0 27 37 17 WWWWWW WWWWWWWWW WWWWMWWWLLWWW WWLWWW Ww
91 91 A K T <4 S+ 0 0 96 36 54 KKKKKK EQKNQQNNN NNNNKNS EKNNN NNHNNN HK
92 92 A P T 34 S+ 0 0 110 36 80 PPPPPH HRHTNHSSS SSTSASA EEFDE EEDDDE DH
93 93 A L T <4 S+ 0 0 17 36 23 LLLLLL LLLVLLIVI VVVVLLV LLLLL LLLLLL II
94 94 A I S < S- 0 0 20 36 14 IVVVVV VVVLVVIVV VVVVVIV VVVVL VVVVVV VM
95 95 A H S S- 0 0 64 35 62 HHYHHH HHYYHHYFH YY YNYY YDCYC YYYYYL HI
96 96 A S S S- 0 0 7 35 67 SSSSSS SSTSSSSSK ST KPKT KKNKK KKKKKT DR
97 97 A E S S- 0 0 30 34 49 EEEEEE EEE EEEEQ KE EQEE DDGAD AAAAAD AD
98 98 A V S S- 0 0 43 27 46 VV DD DDE DDDDD GG S E QQEDD DDDDD
99 99 A F S S+ 0 0 125 15 93 FF PL S DPIL GG G L P
100 100 A K + 0 0 95 15 16 KK KK K KRKK KK Q K K
101 101 A S S S+ 0 0 71 14 12 SS SS TSSS SS S S S
102 102 A S 0 0 54 13 57 SS SS SNH NN H S S
103 103 A I 0 0 203 13 11 II II III II I I M
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 13 0 0 88 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0.377 12 0.75
2 2 A 0 0 0 0 0 0 0 26 55 10 3 6 0 0 0 0 0 0 0 0 31 0 0 1.193 39 0.55
3 3 A 6 0 0 3 0 0 0 6 31 8 6 28 0 0 6 6 0 3 0 0 36 0 0 1.927 64 0.27
4 4 A 0 0 0 0 0 0 0 5 22 0 0 3 0 0 0 3 0 59 3 5 37 0 0 1.248 41 0.52
5 5 A 0 0 3 0 0 0 0 3 21 15 8 8 0 0 3 18 3 18 3 0 39 0 0 2.094 69 0.22
6 6 A 0 0 3 0 0 0 0 0 5 51 3 3 0 0 0 0 0 15 5 15 39 0 0 1.505 50 0.30
7 7 A 10 0 0 5 0 0 0 0 0 0 23 3 0 0 0 51 3 3 3 0 39 0 0 1.443 48 0.26
8 8 A 20 0 80 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 41 0 0 0.494 16 0.90
9 9 A 85 7 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 5 0 0 41 0 0 0.564 18 0.79
10 10 A 0 0 0 0 0 98 0 0 0 0 0 0 0 0 0 2 0 0 0 0 41 0 0 0.115 3 0.92
11 11 A 0 0 0 0 0 0 0 0 0 0 7 0 0 0 29 2 0 0 61 0 41 0 0 0.943 31 0.44
12 12 A 0 0 0 0 0 0 0 0 5 2 0 0 0 0 0 2 2 83 0 5 41 0 0 0.722 24 0.78
13 13 A 0 0 0 0 0 0 0 37 12 0 2 0 0 0 2 2 0 29 2 12 41 0 0 1.603 53 0.43
14 14 A 2 2 0 0 0 0 0 0 27 0 0 2 2 0 20 29 5 5 2 2 41 0 0 1.870 62 0.19
15 15 A 0 0 0 0 0 0 0 29 0 0 5 0 0 20 27 10 5 5 0 0 41 0 0 1.701 56 0.22
16 16 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 78 20 2 0 0 0 41 0 0 0.603 20 0.78
17 17 A 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 41 0 0 0.000 0 1.00
18 18 A 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 2 95 0 0 41 0 0 0.229 7 0.93
19 19 A 2 0 0 0 0 0 0 0 0 0 5 93 0 0 0 0 0 0 0 0 41 0 0 0.308 10 0.86
20 20 A 0 0 0 0 0 0 0 0 2 32 0 0 0 0 2 0 2 54 0 7 41 0 1 1.161 38 0.38
21 21 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 41 0 0 0.000 0 1.00
22 22 A 0 0 0 0 0 0 0 34 0 0 0 0 0 32 0 29 2 2 0 0 41 0 0 1.272 42 0.17
23 23 A 0 2 0 0 0 0 0 2 0 0 0 2 0 0 2 10 0 81 0 0 42 0 0 0.751 25 0.68
24 24 A 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.000 0 1.00
25 25 A 0 0 0 0 69 0 31 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.619 20 0.97
26 26 A 7 55 36 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.975 32 0.66
27 27 A 0 0 0 0 0 0 0 0 2 0 0 0 0 0 5 0 33 55 0 5 42 0 0 1.075 35 0.59
28 28 A 0 0 0 0 2 0 95 0 0 0 0 0 0 0 0 0 0 0 0 2 42 0 0 0.224 7 0.93
29 29 A 14 0 0 0 0 0 2 0 5 0 0 0 0 5 33 31 0 10 0 0 42 0 0 1.610 53 0.25
30 30 A 2 60 5 29 0 0 0 0 2 0 0 2 0 0 0 0 0 0 0 0 42 0 0 1.079 36 0.78
31 31 A 19 14 0 2 0 0 0 0 0 2 0 0 0 0 45 14 0 2 0 0 42 0 0 1.498 49 0.20
32 32 A 7 0 0 0 2 0 0 5 17 0 5 0 0 0 0 0 2 12 38 12 42 0 16 1.829 61 0.26
33 33 A 0 0 0 0 0 0 0 5 5 19 5 5 0 0 2 10 0 10 24 17 42 0 0 2.073 69 0.25
34 34 A 2 0 0 0 0 0 0 57 26 2 0 7 0 0 2 2 0 0 0 0 42 12 7 1.215 40 0.49
35 35 A 3 0 0 3 0 0 0 3 17 0 0 3 0 0 7 57 3 3 0 0 30 0 0 1.481 49 0.30
36 36 A 77 0 3 0 0 0 0 0 17 0 0 3 0 0 0 0 0 0 0 0 40 0 0 0.687 22 0.64
37 37 A 0 2 0 95 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.224 7 0.96
38 38 A 2 0 2 2 0 0 0 0 0 0 0 0 0 0 0 0 0 7 5 81 42 0 0 0.772 25 0.68
39 39 A 0 60 10 31 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.896 29 0.85
40 40 A 76 5 2 0 0 2 0 0 7 2 0 5 0 0 0 0 0 0 0 0 42 0 0 0.953 31 0.58
41 41 A 0 0 0 0 0 0 0 0 0 0 0 0 0 83 17 0 0 0 0 0 42 0 0 0.451 15 0.80
42 42 A 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 42 0 0 0.000 0 1.00
43 43 A 2 0 0 0 24 0 74 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.655 21 0.90
44 44 A 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.000 0 1.00
45 45 A 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 42 0 0 0.000 0 1.00
46 46 A 2 0 0 0 0 0 0 24 0 26 26 0 0 0 17 0 0 5 0 0 42 0 0 1.576 52 0.28
47 47 A 0 0 0 0 5 0 0 0 2 0 90 2 0 0 0 0 0 0 0 0 42 0 0 0.414 13 0.81
48 48 A 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 95 2 0 0 0 42 0 0 0.224 7 0.83
49 49 A 0 0 0 0 0 0 0 2 14 0 0 0 0 0 81 0 0 2 0 0 42 0 0 0.627 20 0.59
50 50 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.000 0 1.00
51 51 A 0 57 0 5 0 0 0 0 0 0 0 0 0 0 10 5 24 0 0 0 42 0 0 1.175 39 0.27
52 52 A 0 0 0 0 0 0 0 98 0 0 0 0 0 0 0 0 0 0 2 0 42 0 0 0.113 3 0.96
53 53 A 2 95 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.224 7 0.93
54 54 A 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.000 0 1.00
55 55 A 0 0 0 0 0 0 0 5 33 0 57 0 0 0 0 0 0 2 2 0 42 0 0 1.009 33 0.51
56 56 A 0 5 2 0 0 0 2 0 2 0 0 0 0 52 31 0 2 2 0 0 42 0 0 1.292 43 0.36
57 57 A 0 98 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.113 3 0.96
58 58 A 2 0 0 0 0 0 0 0 2 0 0 5 90 0 0 0 0 0 0 0 42 0 0 0.414 13 0.80
59 59 A 45 2 7 0 0 0 0 0 0 0 2 0 0 0 5 2 0 2 0 33 42 0 0 1.414 47 0.19
60 60 A 0 0 0 0 0 0 0 0 93 0 2 2 0 0 0 0 0 2 0 0 42 0 0 0.336 11 0.85
61 61 A 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.000 0 1.00
62 62 A 0 2 0 2 95 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.224 7 0.94
63 63 A 0 0 0 0 0 0 0 2 33 0 2 2 0 2 0 0 5 33 14 5 42 0 0 1.656 55 0.36
64 64 A 0 0 0 0 0 2 2 0 0 0 0 0 0 95 0 0 0 0 0 0 42 0 0 0.224 7 0.86
65 65 A 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.000 0 1.00
66 66 A 0 0 0 0 0 0 0 0 0 0 33 2 0 0 31 12 21 0 0 0 42 0 0 1.402 46 0.30
67 67 A 0 2 0 0 0 0 0 5 7 0 38 0 2 0 19 2 2 19 2 0 42 0 0 1.778 59 0.23
68 68 A 0 0 0 0 0 0 0 0 0 0 0 0 0 76 19 0 0 0 5 0 42 0 0 0.668 22 0.71
69 69 A 0 0 0 0 0 0 0 33 0 0 64 0 0 0 0 2 0 0 0 0 42 0 0 0.739 24 0.63
70 70 A 0 24 14 40 21 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 1.316 43 0.68
71 71 A 2 7 0 0 0 0 0 0 2 0 57 0 0 0 29 2 0 0 0 0 42 0 0 1.133 37 0.30
72 72 A 74 0 26 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.575 19 0.89
73 73 A 10 12 74 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 2 42 0 0 0.879 29 0.71
74 74 A 0 0 0 0 0 0 0 0 0 98 2 0 0 0 0 0 0 0 0 0 42 0 0 0.113 3 0.94
75 75 A 0 0 0 0 0 0 0 0 2 0 24 74 0 0 0 0 0 0 0 0 42 0 0 0.655 21 0.67
76 76 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 42 0 0 0.000 0 1.00
77 77 A 0 0 0 0 0 0 0 0 0 0 98 2 0 0 0 0 0 0 0 0 42 0 0 0.113 3 0.94
78 78 A 0 0 0 0 2 0 98 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.113 3 0.99
79 79 A 50 0 48 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 0 0 0.789 26 0.81
80 80 A 0 0 0 0 0 0 0 0 2 0 95 0 0 0 0 0 0 0 2 0 42 0 0 0.224 7 0.88
81 81 A 3 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 32 0 61 38 1 0 0.919 30 0.72
82 82 A 0 0 0 0 0 0 0 0 0 0 3 97 0 0 0 0 0 0 0 0 37 0 0 0.124 4 0.95
83 83 A 0 0 0 0 63 0 34 0 0 0 0 3 0 0 0 0 0 0 0 0 38 0 0 0.753 25 0.89
84 84 A 0 87 8 3 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 38 0 0 0.514 17 0.85
85 85 A 3 0 0 0 0 0 0 3 0 82 3 0 0 5 0 5 0 0 0 0 38 0 0 0.763 25 0.65
86 86 A 0 0 0 0 0 0 0 0 0 3 0 0 0 0 89 8 0 0 0 0 38 0 0 0.396 13 0.87
87 87 A 0 0 0 0 0 0 5 0 0 3 0 0 0 5 3 0 0 0 84 0 38 0 0 0.646 21 0.69
88 88 A 0 0 0 0 0 0 0 0 0 92 0 0 0 3 0 0 3 3 0 0 38 0 0 0.363 12 0.85
89 89 A 0 0 0 0 0 0 0 0 32 3 45 11 0 0 0 0 0 8 3 0 38 0 1 1.353 45 0.39
90 90 A 0 8 0 3 0 89 0 0 0 0 0 0 0 0 0 0 0 0 0 0 37 0 0 0.403 13 0.83
91 91 A 0 0 0 0 0 0 0 0 0 0 3 0 0 6 0 31 8 6 47 0 36 0 0 1.344 44 0.45
92 92 A 0 0 0 0 3 0 0 0 6 17 19 6 0 14 3 0 0 17 3 14 36 0 0 2.084 69 0.19
93 93 A 19 69 11 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 36 0 0 0.816 27 0.76
94 94 A 81 6 11 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 36 0 0 0.678 22 0.86
95 95 A 0 3 3 0 3 0 46 0 0 0 0 0 6 34 0 0 0 0 3 3 35 0 0 1.396 46 0.38
96 96 A 0 0 0 0 0 0 0 0 0 3 43 11 0 0 3 34 0 0 3 3 35 0 0 1.384 46 0.32
97 97 A 0 0 0 0 0 0 0 3 21 0 0 0 0 0 0 3 6 53 0 15 34 0 0 1.318 43 0.50
98 98 A 11 0 0 0 0 0 0 7 0 0 4 0 0 0 0 0 7 11 0 59 27 0 0 1.306 43 0.53
99 99 A 0 20 7 0 20 0 0 20 0 20 7 0 0 0 0 0 0 0 0 7 15 0 0 1.829 61 0.06
100 100 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 87 7 0 0 0 15 0 0 0.485 16 0.83
101 101 A 0 0 0 0 0 0 0 0 0 0 93 7 0 0 0 0 0 0 0 0 14 0 0 0.257 8 0.88
102 102 A 0 0 0 0 0 0 0 0 0 0 62 0 0 15 0 0 0 0 23 0 13 0 0 0.925 30 0.42
103 103 A 0 0 92 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 13 0 0 0.271 9 0.88
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
19 32 47 2 gKVm
24 32 60 12 eLVVADGEGNKKKr
25 26 26 10 nPAVGAGQSAAe
25 28 38 2 pAKe
27 32 47 5 dGRGAPa
28 29 45 19 dVGLVAAAANSSDNPEQEKRk
28 31 66 2 gAAv
29 26 42 12 vQPRASSGGGGASp
30 31 42 12 aLPRASSGGATPAs
31 32 48 7 sGHGGGAAp
31 34 57 1 aVa
32 32 44 12 vQPRASSVRGGASp
33 32 44 12 vQPRASSVRGGASp
34 32 43 12 aQPRASSGGGGASp
35 32 43 12 aQPRASSGRGGASp
36 32 80 12 aQPRASSGRGGASp
37 31 42 16 aQPRSASPSCGGGGGATa
38 32 46 18 aAAQPRSSSSGDGGGGGGAt
38 34 66 2 tPAa
39 32 46 18 aAAQPRSSSSGDGGGGGGAt
39 34 66 2 tPAa
40 19 20 2 rLGd
41 11 60 19 fFIDPYELEPPKRKANTVGVe
41 13 81 2 rSNg
41 68 138 3 nRQRw
//