Complet list of 2eyc hssp file
Complete list of 2eyc.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 2EYC
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-19
HEADER PLANT PROTEIN 09-NOV-05 2EYC
COMPND MOL_ID: 1; MOLECULE: CRAMBIN; CHAIN: A; ENGINEERED: YES
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: CRAMBE HISPANICA SUBSP. ABYSSINICA; OR
AUTHOR H.C.AHN,J.L.MARKLEY
DBREF 2EYC A 1 46 UNP P01542 CRAM_CRAAB 1 46
SEQLENGTH 46
NCHAIN 1 chain(s) in 2EYC data set
NALIGN 88
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : CRAM_CRAAB 1CNR 1.00 1.00 1 46 1 46 46 0 0 46 P01542 Crambin OS=Crambe hispanica subsp. abyssinica GN=THI2 PE=1 SV=2
2 : Q9S979_CRAAB 0.84 0.93 3 46 9 52 44 0 0 118 Q9S979 Crambin=THIONIN variant THI2CA5 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
3 : Q9S976_CRAAB 0.57 0.82 2 45 26 69 44 0 0 134 Q9S976 Crambin=THIONIN variant THI2CA10 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
4 : Q43227_TULGE 0.56 0.78 2 46 14 58 45 0 0 112 Q43227 Thionin class 1 (Precursor) OS=Tulipa gesneriana GN=Thi1-4 PE=2 SV=1
5 : R0GYM0_9BRAS 0.55 0.76 2 43 108 149 42 0 0 215 R0GYM0 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10005712mg PE=4 SV=1
6 : Q8LT03_AVESA 0.53 0.76 2 46 30 74 45 0 0 137 Q8LT03 Leaf thionin Asthi1 OS=Avena sativa GN=Asthi1 PE=2 SV=1
7 : THN_DENCL 0.53 0.84 2 44 2 44 43 0 0 46 P01541 Denclatoxin-B OS=Dendrophthora clavata PE=1 SV=1
8 : Q9S975_CRAAB 0.52 0.82 2 45 22 65 44 0 0 130 Q9S975 Crambin=THIONIN variant THI2CA11 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
9 : Q8LT02_AVESA 0.51 0.71 2 46 30 74 45 0 0 137 Q8LT02 Leaf thionin Asthi2 OS=Avena sativa GN=Asthi2 PE=2 SV=1
10 : R0H6L0_9BRAS 0.50 0.72 2 46 26 71 46 1 1 135 R0H6L0 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10018220mg PE=4 SV=1
11 : Q8LSZ9_AVESA 0.49 0.76 2 46 30 73 45 1 1 136 Q8LSZ9 Thionin Asthi5 OS=Avena sativa GN=Asthi5 PE=2 SV=1
12 : Q8LT01_AVESA 0.49 0.64 2 46 30 74 45 0 0 137 Q8LT01 Leaf thionin Asthi3 OS=Avena sativa GN=Asthi3 PE=2 SV=1
13 : Q9S9A2_VISAL 0.49 0.78 2 46 28 72 45 0 0 111 Q9S9A2 Viscotoxin A OS=Viscum album PE=4 SV=1
14 : R0F7R7_9BRAS 0.49 0.78 2 46 26 70 45 0 0 134 R0F7R7 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006002mg PE=4 SV=1
15 : R0I5Q0_9BRAS 0.49 0.76 2 38 26 62 37 0 0 133 R0I5Q0 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10014899mg PE=4 SV=1
16 : THN3_VISAL 1OKH 0.49 0.78 2 46 28 72 45 0 0 111 P01538 Viscotoxin-A3 OS=Viscum album GN=THI2.1 PE=1 SV=2
17 : Q41609_TULGE 0.48 0.70 2 44 20 63 44 1 1 124 Q41609 Thionin class 4 (Precursor) OS=Tulipa gesneriana GN=Thi4-1 PE=2 SV=1
18 : THN_PYRPU 0.48 0.74 2 46 2 47 46 1 1 47 P07504 Thionin OS=Pyrularia pubera GN=THI1 PE=1 SV=1
19 : Q38L62_SALMI 0.47 0.67 2 46 20 63 45 1 1 121 Q38L62 Thionin OS=Salvia miltiorrhiza PE=2 SV=1
20 : Q43225_TULGE 0.47 0.76 2 46 20 64 45 0 0 114 Q43225 Thionin class 1 (Precursor) OS=Tulipa gesneriana GN=Thi1-2 PE=4 SV=1
21 : Q43226_TULGE 0.47 0.76 2 46 20 64 45 0 0 107 Q43226 Thionin class 1 (Precursor) OS=Tulipa gesneriana GN=Thi1-3 PE=4 SV=1
22 : Q9S980_CRAAB 0.47 0.72 2 44 26 68 43 0 0 133 Q9S980 Crambin=THIONIN variant THI2CA3 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
23 : Q9S981_CRAAB 0.47 0.71 2 46 26 70 45 0 0 134 Q9S981 Crambin=THIONIN variant THI2CA2 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
24 : R0FRU0_9BRAS 0.47 0.80 2 46 26 70 45 0 0 134 R0FRU0 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10018222mg PE=4 SV=1
25 : R0G868_9BRAS 0.47 0.78 2 46 24 68 45 0 0 132 R0G868 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10014907mg PE=4 SV=1
26 : R0I1C8_9BRAS 0.47 0.78 2 46 24 68 45 0 0 132 R0I1C8 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10014906mg PE=4 SV=1
27 : THNA_PHOLI 0.47 0.76 2 46 2 46 45 0 0 46 P01540 Ligatoxin-A OS=Phoradendron liga PE=1 SV=1
28 : K3YJV8_SETIT 0.46 0.66 2 42 66 106 41 0 0 173 K3YJV8 Uncharacterized protein OS=Setaria italica GN=Si014527m.g PE=4 SV=1
29 : K3YK37_SETIT 0.46 0.66 2 42 35 75 41 0 0 142 K3YK37 Uncharacterized protein OS=Setaria italica GN=Si014606m.g PE=4 SV=1
30 : K3YK38_SETIT 0.46 0.66 2 42 35 75 41 0 0 142 K3YK38 Uncharacterized protein OS=Setaria italica GN=Si014607m.g PE=4 SV=1
31 : K3YK39_SETIT 0.46 0.68 2 42 35 75 41 0 0 142 K3YK39 Uncharacterized protein OS=Setaria italica GN=Si014608m.g PE=4 SV=1
32 : K3YK46_SETIT 0.46 0.68 2 42 35 75 41 0 0 140 K3YK46 Uncharacterized protein OS=Setaria italica GN=Si014608m.g PE=4 SV=1
33 : K3YK47_SETIT 0.46 0.66 2 42 35 75 41 0 0 140 K3YK47 Uncharacterized protein OS=Setaria italica GN=Si014607m.g PE=4 SV=1
34 : M0URZ0_HORVD 0.46 0.77 8 46 1 39 39 0 0 102 M0URZ0 Uncharacterized protein (Fragment) OS=Hordeum vulgare var. distichum PE=4 SV=1
35 : C9W327_HORVD 0.44 0.76 2 46 30 74 45 0 0 137 C9W327 Lemma thionin OS=Hordeum vulgare var. distichum GN=LemThio1 PE=2 SV=1
36 : D0VWT3_VISAL3C8P 0.44 0.76 2 46 2 46 45 0 0 46 D0VWT3 Viscotoxin A1 OS=Viscum album PE=1 SV=1
37 : F2D5I9_HORVD 0.44 0.76 2 46 30 74 45 0 0 137 F2D5I9 Predicted protein OS=Hordeum vulgare var. distichum PE=2 SV=1
38 : F2D7C5_HORVD 0.44 0.76 2 46 30 74 45 0 0 137 F2D7C5 Predicted protein OS=Hordeum vulgare var. distichum PE=2 SV=1
39 : I1Q2L0_ORYGL 0.44 0.69 2 37 29 64 36 0 0 150 I1Q2L0 Uncharacterized protein OS=Oryza glaberrima PE=4 SV=1
40 : Q43224_TULGE 0.44 0.67 2 46 20 64 45 0 0 121 Q43224 Thionin class 1 (Precursor) OS=Tulipa gesneriana GN=Thi1-1 PE=2 SV=1
41 : Q9S9A0_VISAL 0.44 0.76 2 46 28 72 45 0 0 114 Q9S9A0 Thionin OS=Viscum album PE=4 SV=1
42 : R0FHE8_9BRAS 0.44 0.71 2 46 26 70 45 0 0 136 R0FHE8 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10002233mg PE=4 SV=1
43 : R0GLD4_9BRAS 0.44 0.67 2 46 26 70 45 0 0 109 R0GLD4 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006071mg PE=4 SV=1
44 : R0GP19_9BRAS 0.44 0.73 2 46 26 70 45 0 0 134 R0GP19 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10012371mg PE=4 SV=1
45 : R0HES6_9BRAS 0.44 0.78 2 46 24 68 45 0 0 132 R0HES6 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10024304mg PE=4 SV=1
46 : R0HUF3_9BRAS 0.44 0.72 2 37 26 61 36 0 0 133 R0HUF3 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10021136mg PE=4 SV=1
47 : THN5_HORVU 0.44 0.76 2 46 30 74 45 0 0 137 P09617 Leaf-specific thionin OS=Hordeum vulgare GN=THI1.5 PE=2 SV=2
48 : THN7_HORVU 0.44 0.76 2 46 30 74 45 0 0 137 Q42838 Thionin BTH7 OS=Hordeum vulgare PE=2 SV=1
49 : THNB_PHOLI 0.44 0.76 2 46 2 46 45 0 0 46 P59358 Ligatoxin-B OS=Phoradendron liga PE=1 SV=1
50 : THNB_VISAL 1JMP 0.44 0.76 2 46 8 52 45 0 0 103 P08943 Viscotoxin-B (Fragment) OS=Viscum album GN=THI2.2 PE=1 SV=2
51 : A2YDH3_ORYSI 0.43 0.65 2 38 29 65 37 0 0 135 A2YDH3 Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_23164 PE=4 SV=1
52 : A2YDH5_ORYSI 0.43 0.65 2 38 29 65 37 0 0 135 A2YDH5 Thionin OS=Oryza sativa subsp. indica GN=OsI_23165 PE=2 SV=1
53 : B9FTI4_ORYSJ 0.43 0.65 2 38 29 65 37 0 0 110 B9FTI4 Putative uncharacterized protein OS=Oryza sativa subsp. japonica GN=OsJ_21501 PE=4 SV=1
54 : B9FTI5_ORYSJ 0.43 0.65 2 38 29 65 37 0 0 110 B9FTI5 Putative uncharacterized protein OS=Oryza sativa subsp. japonica GN=OsJ_21502 PE=4 SV=1
55 : C7J3B8_ORYSJ 0.43 0.65 2 38 29 65 37 0 0 104 C7J3B8 Os06g0515301 protein OS=Oryza sativa subsp. japonica GN=Os06g0515301 PE=4 SV=1
56 : D2CVR6_BRANA 0.43 0.73 3 46 27 70 44 0 0 134 D2CVR6 Thionin OS=Brassica napus PE=2 SV=1
57 : D7L3I4_ARALL 0.43 0.73 3 42 30 69 40 0 0 89 D7L3I4 Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_899723 PE=4 SV=1
58 : I1H3Q0_BRADI 0.43 0.73 2 45 30 73 44 0 0 139 I1H3Q0 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI1G57337 PE=4 SV=1
59 : M4C9D6_BRARP 0.43 0.73 3 46 27 70 44 0 0 134 M4C9D6 Thionin OS=Brassica rapa subsp. pekinensis GN=THI2 PE=4 SV=1
60 : Q5Z434_ORYSJ 0.43 0.65 2 38 29 65 37 0 0 135 Q5Z434 Os06g0512700 protein OS=Oryza sativa subsp. japonica GN=OSJNBa0061G23.50 PE=2 SV=1
61 : Q5Z4K0_ORYSJ 0.43 0.65 2 38 29 65 37 0 0 135 Q5Z4K0 Os06g0513781 protein OS=Oryza sativa subsp. japonica GN=OSJNBa0022O06.36 PE=2 SV=1
62 : Q5Z4S4_ORYSJ 0.43 0.65 2 38 29 65 37 0 0 135 Q5Z4S4 Os06g0514100 protein OS=Oryza sativa subsp. japonica GN=P0597A07.39 PE=2 SV=1
63 : Q8LT04_ORYSJ 0.43 0.65 2 38 29 65 37 0 0 135 Q8LT04 Os06g0514800 protein OS=Oryza sativa subsp. japonica GN=Osthi1 PE=2 SV=1
64 : Q9S974_CRAAB 0.43 0.75 2 45 27 70 44 0 0 135 Q9S974 Crambin=THIONIN variant THI2CA12 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
65 : R0H2A6_9BRAS 0.43 0.73 2 38 26 62 37 0 0 136 R0H2A6 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10005988mg PE=4 SV=1
66 : R0HVZ8_9BRAS 0.43 0.70 2 45 26 69 44 0 0 134 R0HVZ8 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10016370mg PE=4 SV=1
67 : THN_BRARP 0.43 0.73 3 46 27 70 44 0 0 133 Q9SBK8 Thionin OS=Brassica rapa subsp. pekinensis GN=THI2 PE=2 SV=1
68 : F2D1X7_HORVD 0.42 0.73 2 46 30 74 45 0 0 137 F2D1X7 Predicted protein OS=Hordeum vulgare var. distichum PE=2 SV=1
69 : I1H3Q6_BRADI 0.42 0.64 2 46 30 74 45 0 0 139 I1H3Q6 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI1G57400 PE=4 SV=1
70 : M0WAI3_HORVD 0.42 0.73 2 46 30 74 45 0 0 94 M0WAI3 Uncharacterized protein OS=Hordeum vulgare var. distichum PE=4 SV=1
71 : M0WAI4_HORVD 0.42 0.73 2 46 30 74 45 0 0 85 M0WAI4 Uncharacterized protein OS=Hordeum vulgare var. distichum PE=4 SV=1
72 : Q8LT00_AVESA 0.42 0.71 2 46 30 74 45 0 0 142 Q8LT00 Thionin Asthi4 OS=Avena sativa GN=Asthi4 PE=2 SV=1
73 : R0F343_9BRAS 0.42 0.69 2 46 26 69 45 1 1 77 R0F343 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006928mg PE=4 SV=1
74 : R0F705_9BRAS 0.42 0.76 2 46 26 70 45 0 0 134 R0F705 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006003mg PE=4 SV=1
75 : R0G7T6_9BRAS 0.42 0.71 2 46 24 68 45 0 0 132 R0G7T6 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10014903mg PE=4 SV=1
76 : R0H0H7_9BRAS 0.42 0.69 2 46 26 69 45 1 1 133 R0H0H7 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006551mg PE=4 SV=1
77 : THNC_VISAL 3C8P 0.42 0.76 2 46 2 46 45 0 0 46 P83554 Viscotoxin-C1 OS=Viscum album PE=1 SV=1
78 : THNX_HORVU 0.42 0.76 2 46 30 74 45 0 0 137 Q8H0Q5 Probable leaf thionin OS=Hordeum vulgare PE=2 SV=1
79 : I1H3P2_BRADI 0.41 0.61 2 45 30 73 44 0 0 135 I1H3P2 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI1G57285 PE=4 SV=1
80 : I1H3P3_BRADI 0.41 0.66 2 45 56 99 44 0 0 161 I1H3P3 Uncharacterized protein (Fragment) OS=Brachypodium distachyon GN=BRADI1G57290 PE=4 SV=1
81 : I1H3P4_BRADI 0.41 0.66 2 45 56 99 44 0 0 127 I1H3P4 Uncharacterized protein (Fragment) OS=Brachypodium distachyon GN=BRADI1G57290 PE=4 SV=1
82 : I1H3P5_BRADI 0.41 0.59 2 45 30 73 44 0 0 135 I1H3P5 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI1G57296 PE=4 SV=1
83 : Q9S977_CRAAB 0.41 0.73 2 45 27 70 44 0 0 135 Q9S977 Crambin=THIONIN variant THI2CA9 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
84 : THN2_VISAL 1JMN 0.41 0.76 2 42 2 42 41 0 0 46 P32880 Viscotoxin-A2 OS=Viscum album GN=THI2.3 PE=1 SV=1
85 : I1H3P1_BRADI 0.40 0.67 2 46 30 74 45 0 0 135 I1H3P1 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI1G57280 PE=4 SV=1
86 : M0WAF4_HORVD 0.40 0.71 2 46 30 74 45 0 0 137 M0WAF4 Uncharacterized protein OS=Hordeum vulgare var. distichum PE=4 SV=1
87 : Q9S9D7_HORVU 0.40 0.71 2 46 30 74 45 0 0 137 Q9S9D7 Thionin OS=Hordeum vulgare PE=4 SV=1
88 : THN6_HORVU 0.40 0.71 2 46 30 74 45 0 0 137 P09618 Leaf-specific thionin BTH6 OS=Hordeum vulgare PE=2 SV=3
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A T 0 0 80 2 0 T
2 2 A T E -A 34 0A 53 83 7 T SSSSSSSSSTSSSSSSSSSSSSSSSSSSSSS SSSSSSSSSSSSSSSSSSSSS S SSSSSSS SSS
3 3 A a E +A 33 0A 11 88 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
4 4 A b B -b 45 0B 4 88 3 CCCCCCCCCCCCCCCCFCCCCCCCCCCCCCCCC CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
5 5 A P S S+ 0 0 83 88 45 PPPPPKPPKPPKPPPPPRPRRPPPPPPPPPPPP KPKKPRRPPPPPKKPPPPPPPPPKPPPPPPPPPKKK
6 6 A S > - 0 0 42 88 60 SNSSNDTSDNSDNNSNSNSNNTTNSTSSSSSSS NSNNSTNSDNSSNNSNSSSSSRSTRSSSSTTSRTTT
7 7 A I H > S+ 0 0 119 88 34 ITITTITITRTDTTTTTTTTTMPTTTTTTTTTT TTTTTTTIKTTTTTTTTTTTTTETTTTTTKTSTTTT
8 8 A V H > S+ 0 0 110 89 58 VTTATMATTTSITTNTAWTTTAITTTTTTTTTTTTTTTIATTTATSTTTTTTTTTISMITTTTSTTIFTF
9 9 A A H > S+ 0 0 17 89 32 AAAAAAAAAAAAGAAGAAAAAAAAAAAAAAAAAGGGGGAAGSSPAAGGAGAAAAADSADAAAAAAADAAA
10 10 A R H X S+ 0 0 39 89 4 RRRRRRRRRRRRRRRRKRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
11 11 A S H X S+ 0 0 49 89 32 SSNNNNNNNNNNNNNNYNNNNIKNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNENNNNNNNNNNNNN
12 12 A N H X S+ 0 0 64 89 68 NNTCCCQTCVCCIIIICCICCQTCIIIVVIVVICCICCVCCFCCIICCIIVVVVVINCIVVVVTVIICCC
13 13 A F H X S+ 0 0 14 89 1 FFYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYFYYYYYYFYYYYYY
14 14 A N H X S+ 0 0 82 89 10 NNNNNNNNNNNNNNNNNNNNNNVNNNNNNNNNNNNNNNNNNKNNNNNNNNNNNNNNDNNNNNNDNNNNNN
15 15 A V H >< S+ 0 0 78 89 66 VVIVVVIVVVVVATAAAVTVVAVATTTTTTTTTVTTAASVASAGTTAATTSSSSSALVASSSSVTTATVT
16 16 A c H 3<>S+ 0 0 33 89 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
17 17 A R H 3<5S+ 0 0 122 89 8 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRCRRRRRRRRRRRRRRIRRRRRRRRRRRRR
18 18 A L T <<5S+ 0 0 130 89 41 LLLFLILLILLILLLLLLLLLALFFLLLLILLIFFLFFFLVILIFFFFLLFFFFFLSFLFFFFLLFLFFF
19 19 A P T 5S- 0 0 105 89 66 PPPPPPPPPGTPTPMTPPRPPLTPATTTTTTTTGATAAAGPVIPATAATGAAAAATYGTAAAATAGTATA
20 20 A G T 5S+ 0 0 55 89 5 GGGGGGGGGpGGGGGGggLGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
21 21 A T < - 0 0 44 88 57 TTTTTTTTTpTTATTAri.TTTSATSTAATAATGGSGGGTTFTTSGGGAGGGGGGAAGAGGGGTLVAGAG
22 22 A P >> - 0 0 57 89 44 PAPPPPPPPSSPPPSPPSPPPPTPPPSSSSSSSSSSSSSPPPYAPSSSSSSSSSSSSASSSSSSSPSSSS
23 23 A E H 3> S+ 0 0 64 89 49 EERRRRRRRPRTCRRRERARRRIRRRRRRRRRRRRRRRRQRITKRRRRRRRRRRRMQRMRRRRMRSMRRR
24 24 A A H 3> S+ 0 0 54 89 60 APPPPPPPPAPFPPPPTESPPPAPPPPPPPPPPPPEPPDTPGEGPPPPSEDDNDDTPPTDDNDGASTPLP
25 25 A L H <> S+ 0 0 108 89 70 LIVVTVVVVVRITTITIISVVVSVVVTTTTMMTVVTVVTLVKATVFVVVRTTTTTNAVNTTTTLVINVAV
26 26 A c H >X S+ 0 0 6 89 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
27 27 A A H 3X S+ 0 0 7 89 10 AAAAAAAAAAAAAAAAAAAAAAIAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAAAAAAAVAAAA
28 28 A T H 3< S+ 0 0 122 89 71 TTTASTASTRSNKSSKAKDAAAKSSSSKKKKKKSTKTTKRSKKISSTTSSKKKKKNSRNKKKKANGNGNG
29 29 A Y H << S+ 0 0 200 89 73 YDLTLTLITLLMLLLLRKLTTLYLLLLLLLLLLAALAALTLLALLLAALLLLLLLLISLLLLLIRLLAIA
30 30 A T H < S- 0 0 36 89 36 TTSCCCSSCSCCSSSSCCSCCSSCSSSSSSSSSCCSCCSCCSCCSSCCSSSSSSSSTCSSSSSSSSSCCC
31 31 A G < + 0 0 54 89 25 GGGGGRGGRGGRGGGGGDGDDGGGGGGGGGGGGGGGGGGDDGGGGGGGGGGGGGGGGGGGGGGEGGGGGG
32 32 A b - 0 0 1 89 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
33 33 A I E -A 3 0A 92 88 48 IIIKKKKKKK.IKKKKKKKKKKIQKIKQQKQQKKKKKKKIKKKKKKKKKKKKKKKKVKKKKKKKRLKKTK
34 34 A I E +A 2 0A 75 89 37 IIIIIIIIIIKIIIIIIIVIIITINNIIIIIIIIIIIIIHIIILNIIIIIIIIIIIRIIIIIIILLIIII
35 35 A I - 0 0 42 89 23 IIQIVIILIIITIIVIIIIIILIVIIIIIIIIIIIIIIVIIIVIIVIIIIVVVVVVFIVVVVVLVIVITI
36 36 A P S S- 0 0 129 89 56 PPSTESSSSNVRSSNSSSDSSDSGSSSSSSSSSSSSSSDTSTGESDSSSSDDDDDSLKSDDDDSNGSSSS
37 37 A G S S+ 0 0 40 88 28 GGDGGGGVGGDRGSGGSGGSSVGGGGGGGGGGGGGAGGGTGGGGGGGGGAGGGGGGGGGGGGGVGGGGGG
38 38 A A - 0 0 73 87 68 AASTPNTTNSGNSGSSGTGGGTTSTTSNNDNNDPPSPP GSTSTT PPSSNNNNNTSSTNNNNTSSTSTS
39 39 A T - 0 0 143 75 56 TTTKTKGKKTTETR TNTTKKKQTKKTKKKKKKTTTTT NKTTTK TTTT TTTT K KTKTK
40 40 A a - 0 0 32 76 0 CCCCCCCCCCCCCC CCCCCCCCCCCCCCCCCCCCCCC CCCCCC CCCC CCCC C CCCCC
41 41 A P >> - 0 0 73 76 17 PPKPPPPPPPPPPP PPPPPPPPPPPBKKKKKKPPPPP PPPPPP PPDP PPPP P PPPPP
42 42 A G T 34 S+ 0 0 76 76 69 GGPPPKPSKNDNSN SPSTPPPNSNNSPPPPPPRRSRR RANSPN RRSS PPNP S NPSNS
43 43 A D T 34 S+ 0 0 82 68 36 DDPDGDGNDGGDDG DGDGGGDGGGGG DDNDD SDGGGG DDGD G GG N GGDDD
44 44 A Y T <4 S+ 0 0 109 67 16 YYYY YYLYYYYYY YYYWYYYYYYYW YYYYY HYYYYY YYWY Y FY L YYYLY
45 45 A A B < b 4 0B 65 64 47 AAPP P PPTSPPT P PTEE PTTPB PPPPP PPSTTT PPNP T PT P TTPRP
46 46 A N 0 0 87 54 54 NN K K KNKKKH K KNKK HQHHH KKKKK KRKHHH KKHK H H HKKK
## ALIGNMENTS 71 - 88
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....8....:....9....:....0....:....1....:....2....:....3....:....4
1 1 A T 0 0 80 2 0
2 2 A T E -A 34 0A 53 83 7 SSSSSSSSSSSSSSSSSS
3 3 A a E +A 33 0A 11 88 0 CCCCCCCCCCCCCCCCCC
4 4 A b B -b 45 0B 4 88 3 CCCCCCCCCCCCYCCCCC
5 5 A P S S+ 0 0 83 88 45 KKPPPPPKKKKKPPKKKK
6 6 A S > - 0 0 42 88 60 TSTNSSNNNNNNTNNDDD
7 7 A I H > S+ 0 0 119 88 34 TTSTTTTTTTTTKTTTTT
8 8 A V H > S+ 0 0 110 89 58 FTTTTTTTSSSSSTSLLL
9 9 A A H > S+ 0 0 17 89 32 AAAGAAGGSSSSAGSAAA
10 10 A R H X S+ 0 0 39 89 4 RIRRRRRRRRRRRRRRRR
11 11 A S H X S+ 0 0 49 89 32 NNNNNNNNFLLLNNLNNN
12 12 A N H X S+ 0 0 64 89 68 CCICIVICCCCCTICCCC
13 13 A F H X S+ 0 0 14 89 1 YYYYYYYYYYYYFYYYYY
14 14 A N H X S+ 0 0 82 89 10 NNNNNNNNNNNNDNNNNN
15 15 A V H >< S+ 0 0 78 89 66 TVSVTVTAAAAAVTATTT
16 16 A c H 3<>S+ 0 0 33 89 0 CCCCCCCCCCCCCCCCCC
17 17 A R H 3<5S+ 0 0 122 89 8 RRRRRRRHRRRRRRRRRR
18 18 A L T <<5S+ 0 0 130 89 41 FLVLFAFFKKKKLFKFFF
19 19 A P T 5S- 0 0 105 89 66 AAVTGGAALAALTGAAAA
20 20 A G T 5S+ 0 0 55 89 5 GGGGGGGGGGGGGGGGGG
21 21 A T < - 0 0 44 88 57 GAGAGGGGGGGGTGGGGG
22 22 A P >> - 0 0 57 89 44 SPSPSSSSTSSTSSSSSS
23 23 A E H 3> S+ 0 0 64 89 49 RRRRRRRRRQQRMRQRRR
24 24 A A H 3> S+ 0 0 54 89 60 PPPPPPEPPGGLGEGPPP
25 25 A L H <> S+ 0 0 108 89 70 VVTIVTRVEAAELVAVVV
26 26 A c H >X S+ 0 0 6 89 0 CCCCCCCCCCCCCCCCCC
27 27 A A H 3X S+ 0 0 7 89 10 AAAASTAAAAAAAAAAAA
28 28 A T H 3< S+ 0 0 122 89 71 GGGNGSKTSSSSASSGGG
29 29 A Y H << S+ 0 0 200 89 73 APLGLVLARTTRILTAAA
30 30 A T H < S- 0 0 36 89 36 CCSCSSSCCCCCSSCCCC
31 31 A G < + 0 0 54 89 25 GGGGGGGGGGGGEGGRRR
32 32 A b - 0 0 1 89 0 CCCCCCCCCCCCCCCCCC
33 33 A I E -A 3 0A 92 88 48 KKKKKKKKIIIIKKIKKK
34 34 A I E +A 2 0A 75 89 37 ILIIIIIIHHHHIIHIII
35 35 A I - 0 0 42 89 23 ILVVIVIIIIIILIIIII
36 36 A P S S- 0 0 129 89 56 SDDNGDSSSDDSSSDSSS
37 37 A G S S+ 0 0 40 88 28 GVGSG.AGGGGGVAGGGG
38 38 A A - 0 0 73 87 68 STSGTKSPNKKNTSKPPP
39 39 A T - 0 0 143 75 56 KT.SKTTTRRRKKTRKKK
40 40 A a - 0 0 32 76 0 CCCCCCCCCCCCCCCCCC
41 41 A P >> - 0 0 73 76 17 PPPPPPPPPPPPPPPPPP
42 42 A G T 34 S+ 0 0 76 76 69 SSNNNPSRAAAASSASSS
43 43 A D T 34 S+ 0 0 82 68 36 DDGGGGDDDDDDN DDDD
44 44 A Y T <4 S+ 0 0 109 67 16 YWYYYYYYYYYYL YYYY
45 45 A A B < b 4 0B 65 64 47 PPPTTPPPPPPPP PPPP
46 46 A N 0 0 87 54 54 KKNHHHKK KKKK
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 2 0 0 0.000 0 1.00
2 2 A 0 0 0 0 0 0 0 0 0 0 96 4 0 0 0 0 0 0 0 0 83 0 0 0.155 5 0.92
3 3 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 88 0 0 0.000 0 1.00
4 4 A 0 0 0 0 1 0 1 0 0 0 0 0 98 0 0 0 0 0 0 0 88 0 0 0.124 4 0.96
5 5 A 0 0 0 0 0 0 0 0 0 68 0 0 0 0 6 26 0 0 0 0 88 0 0 0.775 25 0.54
6 6 A 0 0 0 0 0 0 0 0 0 0 42 16 0 0 3 0 0 0 31 8 88 0 0 1.336 44 0.39
7 7 A 0 0 7 1 0 0 0 0 0 1 2 82 0 0 1 3 0 1 0 1 88 0 0 0.803 26 0.65
8 8 A 2 3 7 2 3 1 0 0 7 0 11 62 0 0 0 0 0 0 1 0 89 0 0 1.407 46 0.42
9 9 A 0 0 0 0 0 0 0 17 70 1 9 0 0 0 0 0 0 0 0 3 89 0 0 0.933 31 0.67
10 10 A 0 0 1 0 0 0 0 0 0 0 0 0 0 0 98 1 0 0 0 0 89 0 0 0.123 4 0.96
11 11 A 0 4 1 0 1 0 1 0 0 0 3 0 0 0 0 1 0 1 87 0 89 0 0 0.631 21 0.68
12 12 A 19 0 26 0 1 0 0 0 0 0 0 6 42 0 0 0 2 0 4 0 89 0 0 1.468 48 0.31
13 13 A 0 0 0 0 7 0 93 0 0 0 0 0 0 0 0 0 0 0 0 0 89 0 0 0.247 8 0.99
14 14 A 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 94 3 89 0 0 0.270 9 0.90
15 15 A 27 1 2 0 0 0 0 1 24 0 13 31 0 0 0 0 0 0 0 0 89 0 0 1.514 50 0.33
16 16 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 89 0 0 0.000 0 1.00
17 17 A 0 0 1 0 0 0 0 0 0 0 0 0 1 1 97 0 0 0 0 0 89 0 0 0.184 6 0.91
18 18 A 2 43 8 0 38 0 0 0 2 0 1 0 0 0 0 6 0 0 0 0 89 0 0 1.314 43 0.59
19 19 A 2 3 1 1 0 0 1 10 34 21 0 25 0 0 1 0 0 0 0 0 89 0 0 1.675 55 0.34
20 20 A 0 1 0 0 0 0 0 98 0 1 0 0 0 0 0 0 0 0 0 0 89 1 3 0.123 4 0.95
21 21 A 1 1 1 0 1 0 0 41 17 1 5 31 0 0 1 0 0 0 0 0 88 0 0 1.476 49 0.43
22 22 A 0 0 0 0 0 0 1 0 3 31 61 3 0 0 0 0 0 0 0 0 89 0 0 0.946 31 0.55
23 23 A 0 0 2 6 0 0 0 0 1 1 1 2 1 0 74 1 6 4 0 0 89 0 0 1.107 36 0.51
24 24 A 0 2 0 0 1 0 0 8 6 56 3 6 0 0 0 0 0 7 2 9 89 0 0 1.581 52 0.40
25 25 A 38 6 8 2 1 0 0 0 7 0 2 26 0 0 3 1 0 2 3 0 89 0 0 1.846 61 0.30
26 26 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 89 0 0 0.000 0 1.00
27 27 A 1 0 1 0 0 0 0 0 94 0 2 1 0 0 0 0 0 0 0 0 89 0 0 0.291 9 0.89
28 28 A 0 0 1 0 0 0 0 11 9 0 26 13 0 0 3 27 0 0 8 1 89 0 0 1.851 61 0.29
29 29 A 1 53 6 1 0 0 3 1 16 1 1 10 0 0 4 1 0 0 0 1 89 0 0 1.628 54 0.26
30 30 A 0 0 0 0 0 0 0 0 0 0 54 4 42 0 0 0 0 0 0 0 89 0 0 0.837 27 0.64
31 31 A 0 0 0 0 0 0 0 85 0 0 0 0 0 0 7 0 0 2 0 6 89 0 0 0.564 18 0.75
32 32 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 89 1 0 0.000 0 1.00
33 33 A 1 1 15 0 0 0 0 0 0 0 0 1 0 0 1 75 6 0 0 0 88 0 0 0.865 28 0.52
34 34 A 1 4 81 0 0 0 0 0 0 0 0 1 0 7 1 1 0 0 3 0 89 0 0 0.809 26 0.63
35 35 A 25 6 65 0 1 0 0 0 0 0 0 2 0 0 0 0 1 0 0 0 89 0 0 0.972 32 0.76
36 36 A 1 1 0 0 0 0 0 4 0 3 56 3 0 0 1 1 0 2 4 21 89 1 0 1.448 48 0.43
37 37 A 6 0 0 0 0 0 0 80 5 0 6 1 0 0 1 0 0 0 0 2 88 0 0 0.836 27 0.71
38 38 A 0 0 0 0 0 0 0 9 3 13 25 22 0 0 0 5 0 0 21 2 87 1 0 1.831 61 0.32
39 39 A 0 0 0 0 0 0 0 1 0 0 1 48 0 0 7 37 1 1 3 0 75 0 0 1.228 40 0.43
40 40 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 76 0 0 0.000 0 1.00
41 41 A 0 0 0 0 0 0 0 0 0 88 0 0 0 0 0 9 0 0 0 1 76 0 0 0.388 12 0.83
42 42 A 0 0 0 0 0 0 0 4 8 26 28 1 0 0 11 3 0 0 18 1 76 0 0 1.793 59 0.31
43 43 A 0 0 0 0 0 0 0 41 0 1 1 0 0 0 0 0 0 0 6 50 68 0 0 1.003 33 0.63
44 44 A 0 6 0 0 1 6 85 0 0 0 0 0 0 1 0 0 0 0 0 0 67 0 0 0.600 20 0.84
45 45 A 0 0 0 0 0 0 0 0 5 63 3 22 0 0 2 0 0 3 2 0 64 0 0 1.116 37 0.52
46 46 A 0 0 0 0 0 0 0 0 0 0 0 0 0 28 2 57 2 0 11 0 54 0 0 1.066 35 0.46
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
10 20 45 1 pAp
17 20 39 1 gCr
18 20 21 1 gTi
//