Complet list of 1cxr hssp file
Complete list of 1cxr.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1CXR
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-04-10
HEADER PLANT PROTEIN 30-AUG-99 1CXR
COMPND MOL_ID: 1; MOLECULE: CRAMBIN; CHAIN: A; FRAGMENT: ISOFORM OF CRAMBIN (
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: CRAMBE HISPANICA SUBSP. ABYSSINICA; OR
AUTHOR Y.XU,J.WU,D.GORENSTEIN,W.BRAUN
DBREF 1CXR A 1 46 UNP P01542 CRAM_CRAAB 1 46
SEQLENGTH 46
NCHAIN 1 chain(s) in 1CXR data set
NALIGN 98
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : CRAM_CRAAB 1YV8 0.96 1.00 1 46 1 46 46 0 0 46 P01542 Crambin OS=Crambe hispanica subsp. abyssinica GN=THI2 PE=1 SV=2
2 : Q9S979_CRAAB 0.86 0.93 3 46 9 52 44 0 0 118 Q9S979 Crambin=THIONIN variant THI2CA5 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
3 : Q9S976_CRAAB 0.55 0.82 2 45 26 69 44 0 0 134 Q9S976 Crambin=THIONIN variant THI2CA10 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
4 : Q8LSZ9_AVESA 0.54 0.73 2 38 30 66 37 0 0 136 Q8LSZ9 Thionin Asthi5 OS=Avena sativa GN=Asthi5 PE=2 SV=1
5 : Q43227_TULGE 0.53 0.78 2 46 14 58 45 0 0 112 Q43227 Thionin class 1 (Precursor) OS=Tulipa gesneriana GN=Thi1-4 PE=2 SV=1
6 : R0GYM0_9BRAS 0.52 0.76 2 43 108 149 42 0 0 215 R0GYM0 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10005712mg PE=4 SV=1
7 : R0H6L0_9BRAS 0.52 0.72 2 46 26 71 46 1 1 135 R0H6L0 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10018220mg PE=4 SV=1
8 : THN_PYRPU 0.52 0.74 2 46 2 47 46 1 1 47 P07504 Thionin OS=Pyrularia pubera GN=THI1 PE=1 SV=1
9 : Q8LT03_AVESA 0.51 0.76 2 46 30 74 45 0 0 137 Q8LT03 Leaf thionin Asthi1 OS=Avena sativa GN=Asthi1 PE=2 SV=1
10 : R0I5Q0_9BRAS 0.51 0.76 2 46 26 69 45 1 1 133 R0I5Q0 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10014899mg PE=4 SV=1
11 : THN_DENCL 0.51 0.84 2 44 2 44 43 0 0 46 P01541 Denclatoxin-B OS=Dendrophthora clavata PE=1 SV=1
12 : Q9S975_CRAAB 0.50 0.82 2 45 22 65 44 0 0 130 Q9S975 Crambin=THIONIN variant THI2CA11 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
13 : K3YJV8_SETIT 0.49 0.66 2 42 66 106 41 0 0 173 K3YJV8 Uncharacterized protein OS=Setaria italica GN=Si014527m.g PE=4 SV=1
14 : K3YK37_SETIT 0.49 0.66 2 42 35 75 41 0 0 142 K3YK37 Uncharacterized protein OS=Setaria italica GN=Si014606m.g PE=4 SV=1
15 : K3YK38_SETIT 0.49 0.66 2 42 35 75 41 0 0 142 K3YK38 Uncharacterized protein OS=Setaria italica GN=Si014607m.g PE=4 SV=1
16 : K3YK39_SETIT 0.49 0.68 2 42 35 75 41 0 0 142 K3YK39 Uncharacterized protein OS=Setaria italica GN=Si014608m.g PE=4 SV=1
17 : K3YK46_SETIT 0.49 0.68 2 42 35 75 41 0 0 140 K3YK46 Uncharacterized protein OS=Setaria italica GN=Si014608m.g PE=4 SV=1
18 : K3YK47_SETIT 0.49 0.66 2 42 35 75 41 0 0 140 K3YK47 Uncharacterized protein OS=Setaria italica GN=Si014607m.g PE=4 SV=1
19 : M0URZ0_HORVD 0.49 0.77 8 46 1 39 39 0 0 102 M0URZ0 Uncharacterized protein (Fragment) OS=Hordeum vulgare var. distichum PE=4 SV=1
20 : Q8LT01_AVESA 0.49 0.64 2 46 30 74 45 0 0 137 Q8LT01 Leaf thionin Asthi3 OS=Avena sativa GN=Asthi3 PE=2 SV=1
21 : Q8LT02_AVESA 0.49 0.71 2 46 30 74 45 0 0 137 Q8LT02 Leaf thionin Asthi2 OS=Avena sativa GN=Asthi2 PE=2 SV=1
22 : THNA_PHOLI 0.49 0.76 2 46 2 46 45 0 0 46 P01540 Ligatoxin-A OS=Phoradendron liga PE=1 SV=1
23 : Q41609_TULGE 0.48 0.70 2 44 20 63 44 1 1 124 Q41609 Thionin class 4 (Precursor) OS=Tulipa gesneriana GN=Thi4-1 PE=2 SV=1
24 : C9W327_HORVD 0.47 0.76 2 46 30 74 45 0 0 137 C9W327 Lemma thionin OS=Hordeum vulgare var. distichum GN=LemThio1 PE=2 SV=1
25 : D0VWT3_VISAL3C8P 0.47 0.76 2 46 2 46 45 0 0 46 D0VWT3 Viscotoxin A1 OS=Viscum album PE=1 SV=1
26 : F2D5I9_HORVD 0.47 0.76 2 46 30 74 45 0 0 137 F2D5I9 Predicted protein OS=Hordeum vulgare var. distichum PE=2 SV=1
27 : F2D7C5_HORVD 0.47 0.76 2 46 30 74 45 0 0 137 F2D7C5 Predicted protein OS=Hordeum vulgare var. distichum PE=2 SV=1
28 : I1Q2L0_ORYGL 0.47 0.69 2 37 29 64 36 0 0 150 I1Q2L0 Uncharacterized protein OS=Oryza glaberrima PE=4 SV=1
29 : Q9S981_CRAAB 0.47 0.73 2 46 26 70 45 0 0 134 Q9S981 Crambin=THIONIN variant THI2CA2 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
30 : Q9S9A2_VISAL 0.47 0.78 2 46 28 72 45 0 0 111 Q9S9A2 Viscotoxin A OS=Viscum album PE=4 SV=1
31 : R0F7R7_9BRAS 0.47 0.78 2 46 26 70 45 0 0 134 R0F7R7 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006002mg PE=4 SV=1
32 : R0HUF3_9BRAS 0.47 0.72 2 37 26 61 36 0 0 133 R0HUF3 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10021136mg PE=4 SV=1
33 : THN3_VISAL 1OKH 0.47 0.78 2 46 28 72 45 0 0 111 P01538 Viscotoxin-A3 OS=Viscum album GN=THI2.1 PE=1 SV=2
34 : THN5_HORVU 0.47 0.76 2 46 30 74 45 0 0 137 P09617 Leaf-specific thionin OS=Hordeum vulgare GN=THI1.5 PE=2 SV=2
35 : THN7_HORVU 0.47 0.76 2 46 30 74 45 0 0 137 Q42838 Thionin BTH7 OS=Hordeum vulgare PE=2 SV=1
36 : THNB_PHOLI 0.47 0.76 2 46 2 46 45 0 0 46 P59358 Ligatoxin-B OS=Phoradendron liga PE=1 SV=1
37 : THNB_VISAL 2V9B 0.47 0.76 2 46 8 52 45 0 0 103 P08943 Viscotoxin-B (Fragment) OS=Viscum album GN=THI2.2 PE=1 SV=2
38 : A2YDH3_ORYSI 0.46 0.65 2 38 29 65 37 0 0 135 A2YDH3 Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_23164 PE=4 SV=1
39 : A2YDH5_ORYSI 0.46 0.65 2 38 29 65 37 0 0 135 A2YDH5 Thionin OS=Oryza sativa subsp. indica GN=OsI_23165 PE=2 SV=1
40 : B9FTI4_ORYSJ 0.46 0.65 2 38 29 65 37 0 0 110 B9FTI4 Putative uncharacterized protein OS=Oryza sativa subsp. japonica GN=OsJ_21501 PE=4 SV=1
41 : B9FTI5_ORYSJ 0.46 0.65 2 38 29 65 37 0 0 110 B9FTI5 Putative uncharacterized protein OS=Oryza sativa subsp. japonica GN=OsJ_21502 PE=2 SV=1
42 : C7J3B8_ORYSJ 0.46 0.65 2 38 29 65 37 0 0 104 C7J3B8 Os06g0515301 protein OS=Oryza sativa subsp. japonica GN=Os06g0515301 PE=4 SV=1
43 : Q5Z434_ORYSJ 0.46 0.65 2 38 29 65 37 0 0 135 Q5Z434 Os06g0512700 protein OS=Oryza sativa subsp. japonica GN=OSJNBa0061G23.50 PE=2 SV=1
44 : Q5Z4K0_ORYSJ 0.46 0.65 2 38 29 65 37 0 0 135 Q5Z4K0 Os06g0513781 protein OS=Oryza sativa subsp. japonica GN=OSJNBa0022O06.36 PE=2 SV=1
45 : Q5Z4S4_ORYSJ 0.46 0.65 2 38 29 65 37 0 0 135 Q5Z4S4 Os06g0514100 protein OS=Oryza sativa subsp. japonica GN=P0597A07.39 PE=2 SV=1
46 : Q8LT04_ORYSJ 0.46 0.65 2 38 29 65 37 0 0 135 Q8LT04 Os06g0514800 protein OS=Oryza sativa subsp. japonica GN=Osthi1 PE=2 SV=1
47 : R0H2A6_9BRAS 0.46 0.73 2 38 26 62 37 0 0 136 R0H2A6 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10005988mg PE=4 SV=1
48 : D2CVR6_BRANA 0.45 0.73 3 46 27 70 44 0 0 134 D2CVR6 Thionin OS=Brassica napus PE=2 SV=1
49 : D7L3I4_ARALL 0.45 0.73 3 42 30 69 40 0 0 89 D7L3I4 Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_899723 PE=4 SV=1
50 : M4C9D6_BRARP 0.45 0.73 3 46 27 70 44 0 0 134 M4C9D6 Thionin OS=Brassica rapa subsp. pekinensis GN=THI2 PE=4 SV=1
51 : THN_BRARP 0.45 0.73 3 46 27 70 44 0 0 133 Q9SBK8 Thionin OS=Brassica rapa subsp. pekinensis GN=THI2 PE=2 SV=1
52 : A2YDG9_ORYSI 0.44 0.64 2 37 29 64 36 0 0 135 A2YDG9 Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_23158 PE=4 SV=1
53 : B8B341_ORYSI 0.44 0.64 2 37 29 64 36 0 0 127 B8B341 Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_23169 PE=4 SV=1
54 : F2D1X7_HORVD 0.44 0.73 2 46 30 74 45 0 0 137 F2D1X7 Predicted protein OS=Hordeum vulgare var. distichum PE=2 SV=1
55 : I1H3Q6_BRADI 0.44 0.64 2 46 30 74 45 0 0 139 I1H3Q6 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI1G57400 PE=4 SV=1
56 : I1Q2K9_ORYGL 0.44 0.64 2 37 29 64 36 0 0 135 I1Q2K9 Uncharacterized protein OS=Oryza glaberrima PE=4 SV=1
57 : M0WAI3_HORVD 0.44 0.73 2 46 30 74 45 0 0 94 M0WAI3 Uncharacterized protein OS=Hordeum vulgare var. distichum PE=4 SV=1
58 : M0WAI4_HORVD 0.44 0.73 2 46 30 74 45 0 0 85 M0WAI4 Uncharacterized protein OS=Hordeum vulgare var. distichum PE=4 SV=1
59 : Q38L62_SALMI 0.44 0.67 2 46 20 63 45 1 1 121 Q38L62 Thionin OS=Salvia miltiorrhiza PE=2 SV=1
60 : Q43225_TULGE 0.44 0.76 2 46 20 64 45 0 0 114 Q43225 Thionin class 1 (Precursor) OS=Tulipa gesneriana GN=Thi1-2 PE=4 SV=1
61 : Q43226_TULGE 0.44 0.76 2 46 20 64 45 0 0 107 Q43226 Thionin class 1 (Precursor) OS=Tulipa gesneriana GN=Thi1-3 PE=4 SV=1
62 : Q5Z4W6_ORYSJ 0.44 0.64 2 37 29 64 36 0 0 135 Q5Z4W6 Putative thionin Osthi1 OS=Oryza sativa subsp. japonica GN=OSJNBb0071G09.18 PE=4 SV=1
63 : Q9S980_CRAAB 0.44 0.72 2 44 26 68 43 0 0 133 Q9S980 Crambin=THIONIN variant THI2CA3 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
64 : R0F343_9BRAS 0.44 0.69 2 46 26 69 45 1 1 77 R0F343 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006928mg PE=4 SV=1
65 : R0FRU0_9BRAS 0.44 0.80 2 46 26 70 45 0 0 134 R0FRU0 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10018222mg PE=4 SV=1
66 : R0G7T6_9BRAS 0.44 0.71 2 46 24 68 45 0 0 132 R0G7T6 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10014903mg PE=4 SV=1
67 : R0G868_9BRAS 0.44 0.78 2 46 24 68 45 0 0 132 R0G868 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10014907mg PE=4 SV=1
68 : R0GLD4_9BRAS 0.44 0.67 2 46 26 70 45 0 0 109 R0GLD4 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006071mg PE=4 SV=1
69 : R0GP19_9BRAS 0.44 0.73 2 46 26 70 45 0 0 134 R0GP19 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10012371mg PE=4 SV=1
70 : R0H0H7_9BRAS 0.44 0.69 2 46 26 69 45 1 1 133 R0H0H7 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006551mg PE=4 SV=1
71 : R0I1C8_9BRAS 0.44 0.78 2 46 24 68 45 0 0 132 R0I1C8 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10014906mg PE=4 SV=1
72 : THN2_VISAL 1JMN 0.44 0.76 2 42 2 42 41 0 0 46 P32880 Viscotoxin-A2 OS=Viscum album GN=THI2.3 PE=1 SV=1
73 : THNC_VISAL 1ORL 0.44 0.76 2 46 2 46 45 0 0 46 P83554 Viscotoxin-C1 OS=Viscum album PE=1 SV=1
74 : THNX_HORVU 0.44 0.76 2 46 30 74 45 0 0 137 Q8H0Q5 Probable leaf thionin OS=Hordeum vulgare PE=2 SV=1
75 : I1H3P3_BRADI 0.43 0.66 2 45 56 99 44 0 0 161 I1H3P3 Uncharacterized protein (Fragment) OS=Brachypodium distachyon GN=BRADI1G57290 PE=4 SV=1
76 : I1H3Q0_BRADI 0.43 0.73 2 45 30 73 44 0 0 139 I1H3Q0 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI1G57337 PE=4 SV=1
77 : Q9S974_CRAAB 0.43 0.75 2 45 27 70 44 0 0 135 Q9S974 Crambin=THIONIN variant THI2CA12 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
78 : R0F708_9BRAS 0.43 0.68 2 38 26 62 37 0 0 133 R0F708 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006008mg PE=4 SV=1
79 : R0HVZ8_9BRAS 0.43 0.70 2 45 26 69 44 0 0 134 R0HVZ8 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10016370mg PE=4 SV=1
80 : R0IA88_9BRAS 0.43 0.68 2 38 26 62 37 0 0 133 R0IA88 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10021135mg PE=4 SV=1
81 : I1H3P1_BRADI 0.42 0.67 2 46 30 74 45 0 0 135 I1H3P1 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI1G57280 PE=4 SV=1
82 : I1H3P4_BRADI 0.42 0.67 2 46 56 100 45 0 0 127 I1H3P4 Uncharacterized protein (Fragment) OS=Brachypodium distachyon GN=BRADI1G57290 PE=4 SV=1
83 : M0WAF4_HORVD 0.42 0.71 2 46 30 74 45 0 0 137 M0WAF4 Uncharacterized protein OS=Hordeum vulgare var. distichum PE=4 SV=1
84 : Q9S9A0_VISAL 0.42 0.76 2 46 28 72 45 0 0 114 Q9S9A0 Thionin OS=Viscum album PE=4 SV=1
85 : Q9S9D7_HORVU 0.42 0.71 2 46 30 74 45 0 0 137 Q9S9D7 Thionin OS=Hordeum vulgare PE=4 SV=1
86 : R0F705_9BRAS 0.42 0.76 2 46 26 70 45 0 0 134 R0F705 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006003mg PE=4 SV=1
87 : R0FHE8_9BRAS 0.42 0.71 2 46 26 70 45 0 0 136 R0FHE8 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10002233mg PE=4 SV=1
88 : R0HES6_9BRAS 0.42 0.78 2 46 24 68 45 0 0 132 R0HES6 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10024304mg PE=4 SV=1
89 : THN6_HORVU 0.42 0.71 2 46 30 74 45 0 0 137 P09618 Leaf-specific thionin BTH6 OS=Hordeum vulgare PE=2 SV=3
90 : I1H3P2_BRADI 0.41 0.64 2 45 30 73 44 0 0 135 I1H3P2 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI1G57285 PE=4 SV=1
91 : I1H3P5_BRADI 0.41 0.61 2 45 30 73 44 0 0 135 I1H3P5 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI1G57296 PE=4 SV=1
92 : Q9S977_CRAAB 0.41 0.73 2 45 27 70 44 0 0 135 Q9S977 Crambin=THIONIN variant THI2CA9 (Precursor) OS=Crambe hispanica subsp. abyssinica PE=4 SV=1
93 : THN1_VISAL 0.41 0.68 2 42 2 42 41 0 0 46 P01537 Viscotoxin-1-PS OS=Viscum album GN=THI2.4 PE=1 SV=1
94 : THN24_ARATH 0.41 0.68 3 46 27 70 44 0 0 134 Q9C8D6 Probable thionin-2.4 OS=Arabidopsis thaliana GN=At1g66100 PE=2 SV=1
95 : Q43224_TULGE 0.40 0.67 2 46 20 64 45 0 0 121 Q43224 Thionin class 1 (Precursor) OS=Tulipa gesneriana GN=Thi1-1 PE=2 SV=1
96 : Q8LT00_AVESA 0.40 0.71 2 46 30 74 45 0 0 142 Q8LT00 Thionin Asthi4 OS=Avena sativa GN=Asthi4 PE=2 SV=1
97 : THN3_HORVU 0.40 0.71 2 46 30 74 45 0 0 137 P08772 Leaf-specific thionin DB4 OS=Hordeum vulgare GN=THI1.3 PE=2 SV=2
98 : THN_PHOTO 0.40 0.69 2 46 2 46 45 0 0 46 P01539 Phoratoxin OS=Phoradendron tomentosum PE=1 SV=1
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A T 0 0 139 2 0 T
2 2 A T - 0 0 29 92 7 T SSSSSSSSSSSSSSSS TSSSSSSSSSSSSSSSSSSSSSSSSSSS SSSSSSSSSSSSSSSSSSS
3 3 A a + 0 0 3 98 0 CCCCCCCCCCCCCCCCCC CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
4 4 A b - 0 0 0 98 3 CCCCCCCCCCCCCCCCCC CCCFCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
5 5 A P S S+ 0 0 82 98 44 PPPPPPPRKPPPPPPPPP KKPPKPKKPPPPPPKKPPPPPPPPPPPPPPPPPPKKPKKPRRPPPPPPPPP
6 6 A S - 0 0 53 98 61 SNSSSNNNDSTSSSSSSS DDSSNSNNSTNNSNNNSNSSSSSSSSSTRSRRSSTTSTTSNNSTTNSSDNS
7 7 A I S > S+ 0 0 131 98 35 ITITTTRTITTITTTTTT DTTTTTTTTPTTTTTTTTTTTTTTTTTTTETTTTTTTTTTTTTMSTTTKTT
8 8 A V G > S+ 0 0 109 99 58 VTTSATTWMNATTTTTTTTITTATTTTIITTSTTTTTTTTTTTTTTTISIITIFTTFFTTTTATTTTTAT
9 9 A A G > S+ 0 0 21 99 31 AAAAAAAAAAAAAAAAAAGAAAAGGGGAAGAAGGGAGAAAAAAAAAADSDDAAAAAAAAAAAAAAAASPA
10 10 A R G X S+ 0 0 65 99 3 RRRRRRRRRRRRRRRRRRRRRRKRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
11 11 A S G X S+ 0 0 64 99 32 SSNNNNNNNNNNNNNNNNNNNNYNNNNNKNNNNNNNNNNNNNNNNNNNENNNNNNNNNNNNNINNNNNNN
12 12 A N G X S+ 0 0 86 99 68 NNTCCCVCCIQTVVIVVICCCICCICCVTIIIICCIIVVVVVVVVVVINIIIVCCICCICCIQICIICCV
13 13 A F G X> S+ 0 0 74 99 1 FFYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYFYYYYYYYYYYYYYYYYYYYYY
14 14 A N G <4 S+ 0 0 75 99 10 NNNNNNNNNNNNNNNNNNNNNNNNNNNNVNNNNNNNNNNNNNNNNNNNDNNNNNNNNNNNNNNNNNNNNN
15 15 A V G <4 S+ 0 0 67 99 67 VVIVVVVVVAIVTTTTTTVVVTATTAASVATTAAATTSSSSSSSSSTALAASSTVSTTTVVSASATTAGV
16 16 A c T <4 S+ 0 0 24 99 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
17 17 A R S < S+ 0 0 182 99 12 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRIRRRRRRRRRRRRRRRRRRRRR
18 18 A L S > S- 0 0 89 99 40 LLLLFLLLILLLLLILLIFIILLFLFFFLLLFLFFLLFFFFFFFFFLLSLLFFFFFFFLLLFAVFFFLIA
19 19 A P T 3 S+ 0 0 136 99 66 PPPTPPGPPMPPTTTTTTGPPTPATAAATTPTTAATGAAAAAAAAAATYTTTVATTAARPPTLVPGAIPG
20 20 A G T 3 - 0 0 51 99 7 GGGGGGpgGGGGGGGGGGGGGGgGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGLGGGGGGGGGGG
21 21 A T < + 0 0 79 98 55 TTTTTTpiTTTTAATAATGTTTrGSGGGSATGAGGAGGGGGGGGGGLAAAAAGGAAGG.TTATGAGTTTG
22 22 A S S S+ 0 0 88 99 41 PAPSPPSSPSPPSSSSSSSPPSPSSSSSTPPSPSSSSSSSSSSSSSSSSSSSSSSSSSPPPSPSPSPYAS
23 23 A E S S- 0 0 75 99 46 EERRRRPRRRRRRRRRRRRTRRERRRRRICRRRRRRRRRRRRRRRRRMQMMRRRRRRRARRRRRRRRTKR
24 24 A A S S+ 0 0 54 99 61 APPPPPAEPPPPPPPPPPPFPPTPEPPDAPPPPPPSEDDNDDDDNDATPTTDDPLDPPSPPDPPPPPEGP
25 25 A I >> + 0 0 101 99 71 LIVRVTVIVIVVTTTMMTVIVTIVTVVTSTTFTVVVRTTTTTTTTTVNANNKTVAKVVSVVKVTVVVATT
26 26 A c T 34 + 0 0 2 99 1 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCSCCCCCCCCCCCCCCCCC
27 27 A A T 34 S+ 0 0 17 99 16 AAAAAAAAAAAAAAAAAAAAAAAAAAAAIAAAAAAAAAAAAAAAAAAASAACAAACAAAAACAAASAAAT
28 28 A T T X4 S+ 0 0 82 99 72 TTTSASRKTSASKKKKKKSNTSATKTTKKKSSKTTSSKKKKKKKKKNNSNNKKGNKGGDAAKAGSGSKIS
29 29 A Y T 3< S+ 0 0 161 99 73 YDLLTLLKTLLILLLLLLAMTLRALAALYLLLLAALLLLLLLLLLLRLILLILAIIAALTTILLLLLALV
30 30 A T T 3 S- 0 0 4 99 34 TTSCCCSCCSSSSSSSSSCCCSCCSCCSSSSSSCCSSSSSSSSSSSSSTSSSSCCSCCSCCSSSCSSCCS
31 31 A G S < S+ 0 0 41 99 26 GGGGGGGDRGGGGGGGGGGRRGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGDDGGGGGGGGG
32 32 A b - 0 0 2 99 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
33 33 A I - 0 0 45 98 47 IIIKKKKKK.KKQQKQQKKIKKKKKKKKIKKKKKKKKKKKKKKKKKRKVKKKKKTKKKKKKKKKQKKKKK
34 34 A I - 0 0 119 99 36 IIIIIIIIIKIIIIIIIIIIIIIIIIIITIIIIIIIIIIIIIIIIILIRIIIIIIIIIVIIIIIIINILI
35 35 A I - 0 0 16 99 23 IIQVIVIIIIILIIIIIIITIIIIIIIVIIIVIIIIIVVVVVVVVVVVFVVVVITVIIIIIVLVVIIVIV
36 36 A P S S- 0 0 121 99 56 PPSDTENSSVSSSSSSSSSRSSSSSSSDSSSDSSSSSDDDDDDDDDNSLSSDDSSDSSDSSDDDGGSGED
37 37 A G S S- 0 0 48 98 29 GGDGGGGGGNGVGGGGGGGRGGSGAGGGGGSGGGGGAGGGGGGGGGGGGGGGGGGGGGGSSGVGGGGGG.
38 38 A A S S+ 0 0 89 93 67 AASTTPSTNGTTNNDNNDPNNSGPSPP TSG SPPSSNNNNNNNNNSTSTT ST SSGGG TSSTTSTK
39 39 A T - 0 0 83 79 56 TTT KTTTKSGKKKKKKKTEKTNTTTT QTR TTTTT TTTT KT KKTKK K.TKKTTT
40 40 A a - 0 0 42 80 0 CCC CCCCCCCCCCCCCCCCCCCCCCC CCC CCCCC CCCC CC CCCCC CCCCCCCC
41 41 A P - 0 0 77 80 18 PPK PPPPPPPPKKKKKKPPPBPPPPP PPP PPPDP PPPP PP PPPPP PPPPPPPP
42 42 A G S S+ 0 0 50 80 68 GGP PPNSKGPSPPPPPPRNKSPRSRR NSN SRRSS PPPP SN SSTPP PNSNNSPP
43 43 A D S S+ 0 0 129 71 36 DDP DGGDDGGN DDDGGDNDD GDG DDDGD G GG DD DDGGG DGGGGGGG
44 44 A Y S S- 0 0 78 70 16 YYY Y YYYYYL YYYWYYYYY YYY YYYWY Y YY YL YYWYY YYYYYYYY
45 45 A A 0 0 75 67 47 AAP P TPPP P PPPB PPPP PPT PPPNP T TT PR PPTEE PTTTTTP
46 46 A N 0 0 105 58 56 NN K NKKK KKKH KKKK HKH KKKHK H HH KK KKNKK NQHHHHH
## ALIGNMENTS 71 - 98
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....8....:....9....:....0....:....1....:....2....:....3....:....4
1 1 A T 0 0 139 2 0
2 2 A T - 0 0 29 92 7 SSSSSSSSSSSSSSSSSSSSSSS SSSS
3 3 A a + 0 0 3 98 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCC
4 4 A b - 0 0 0 98 3 CCCCCCCCCCCCCCCCCCCCCYCCCCCC
5 5 A P S S+ 0 0 82 98 44 PPPKKKPPPPKKKRKPPPKKKPPPRKKP
6 6 A S - 0 0 53 98 61 TNNNNTTTSNNNDNDNSSDNNTBSTSDT
7 7 A I S > S+ 0 0 131 98 35 TTTTTTKSSSTTTTTTITTTTKTITTTT
8 8 A V G > S+ 0 0 109 99 58 TTTTSMSTTTSSLTLTTTLSSSTQATLT
9 9 A A G > S+ 0 0 21 99 31 AGGGSAAAAASSAGAGSAASSAGAAAAA
10 10 A R G X S+ 0 0 65 99 3 RRRRRRRRRRRRRRRRRRRRRRRRRIRR
11 11 A S G X S+ 0 0 64 99 32 NNNNLNNNNNLLNNNNNNNFLNBTNNNN
12 12 A N G X S+ 0 0 86 99 68 IIICCCTIIVCCCCCCFICCCTIFCCCI
13 13 A F G X> S+ 0 0 74 99 1 YYYYYYFYYYYYYYYYYYYYYFYYYYYY
14 14 A N G <4 S+ 0 0 75 99 10 NNNNNNDNNNNNNNNNKNNNNDBNNNNN
15 15 A V G <4 S+ 0 0 67 99 67 TTTAAVVTTTAATATVSTTAAVTAVVTT
16 16 A c T <4 S+ 0 0 24 99 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCC
17 17 A R S < S+ 0 0 182 99 12 RRRHRRRRRRRRRRRRCRRRRRRLRRHR
18 18 A L S > S- 0 0 89 99 40 LFFFKFLVFVKKFVFLIFFKKLFFLLFF
19 19 A P T 3 S+ 0 0 136 99 66 TGAAAGTVGVAAAPATVAALLTGAGAAG
20 20 A G T 3 - 0 0 51 99 7 GGGGGGGGGGGGGGGGGGGGGGGVGGGG
21 21 A T < + 0 0 79 98 55 SGGGGGTGVGGGGTGAFSGGGTGGTAGG
22 22 A S S S+ 0 0 88 99 41 PSSSSASSPSSSSPSPPPSTTSSSPPSS
23 23 A E S S- 0 0 75 99 46 RRRRQRMRSRQQRRRRIRRRRMRPQRRR
24 24 A A S S+ 0 0 54 99 61 PEEPGPGPSPGGPPPPGPPPLGZSTPPP
25 25 A I >> + 0 0 101 99 71 VVRVAVLSITAAVVVIKVVEELVSLVVV
26 26 A c T 34 + 0 0 2 99 1 CCCCCCCCCCCCCCCCCCCCCCCCCCCC
27 27 A A T 34 S+ 0 0 17 99 16 AAAAAAAAVAAAAAAAAAAAAAAIAAAA
28 28 A T T X4 S+ 0 0 82 99 72 SSKTSRAGGGSSGSGNKSGSSARRRGGK
29 29 A Y T 3< S+ 0 0 161 99 73 LLLATSILLLTTALAGLLARRIINTPAL
30 30 A T T 3 S- 0 0 4 99 34 SSSCCCSSSSCCCCCCSSCCCSSSCCCS
31 31 A G S < S+ 0 0 41 99 26 GGGGGGEGGGGGRDRGGGRGGEGSDGRG
32 32 A b - 0 0 2 99 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCC
33 33 A I - 0 0 45 98 47 IKKKIKKKLKIIKKKKKKKIIKKLIKKK
34 34 A I - 0 0 119 99 36 NIIIHIIILIHHIIIIINIHHIIDHLII
35 35 A I - 0 0 16 99 23 IIIIIILVIVIIIIIVIIIIILIIILII
36 36 A P S S- 0 0 121 99 56 SSSSDKSDGDDDSSSNTSSSSSSSTDSS
37 37 A G S S- 0 0 48 98 29 GAAGGGVGGGGGGGGSGGGGGVAETVGG
38 38 A A S S+ 0 0 89 93 67 TSSPKSTSSSKKPSPGTTPNNTSSGTPT
39 39 A T - 0 0 83 79 56 KTTTRTK K RRKKKSTKKRKKTTNTKK
40 40 A a - 0 0 42 80 0 CCCCCCC C CCCCCCCCCCCCCCCCCC
41 41 A P - 0 0 77 80 18 PPPPPPP P PPPPPPPPPPPPPPPPPD
42 42 A G S S+ 0 0 50 80 68 NSSRANS N AASASNNNSAASSRRSSS
43 43 A D S S+ 0 0 129 71 36 G DDDGN G DDDDDGGGDDDN GSDDG
44 44 A Y S S- 0 0 78 70 16 Y YYYFL Y YYYYYYYYYYYL YHWYW
45 45 A A 0 0 75 67 47 P PPPPP T PPPPPTSTPPPP TPPPN
46 46 A N 0 0 105 58 56 H KK KSKRKHKHK NKKKH
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 2 0 0 0.000 0 1.00
2 2 A 0 0 0 0 0 0 0 0 0 0 97 3 0 0 0 0 0 0 0 0 92 0 0 0.144 4 0.93
3 3 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 98 0 0 0.000 0 1.00
4 4 A 0 0 0 0 1 0 1 0 0 0 0 0 98 0 0 0 0 0 0 0 98 0 0 0.114 3 0.96
5 5 A 0 0 0 0 0 0 0 0 0 70 0 0 0 0 5 24 0 0 0 0 98 0 0 0.743 24 0.56
6 6 A 0 0 0 0 0 0 0 0 0 0 43 16 0 0 3 0 0 0 29 8 98 0 0 1.328 44 0.39
7 7 A 0 0 7 1 0 0 0 0 0 1 4 81 0 0 1 3 0 1 0 1 98 0 0 0.833 27 0.64
8 8 A 2 4 7 2 3 1 0 0 6 0 10 63 0 0 0 0 1 0 1 0 99 0 0 1.414 47 0.41
9 9 A 0 0 0 0 0 0 0 16 72 1 8 0 0 0 0 0 0 0 0 3 99 0 0 0.889 29 0.69
10 10 A 0 0 1 0 0 0 0 0 0 0 0 0 0 0 98 1 0 0 0 0 99 0 0 0.113 3 0.96
11 11 A 0 4 1 0 1 0 1 0 0 0 3 1 0 0 0 1 0 1 86 0 99 0 0 0.645 21 0.68
12 12 A 19 0 29 0 2 0 0 0 0 0 0 5 38 0 0 0 2 0 4 0 99 0 0 1.482 49 0.32
13 13 A 0 0 0 0 6 0 94 0 0 0 0 0 0 0 0 0 0 0 0 0 99 0 0 0.229 7 0.99
14 14 A 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 94 3 99 0 0 0.258 8 0.90
15 15 A 24 1 2 0 0 0 0 1 22 0 16 33 0 0 0 0 0 0 0 0 99 0 0 1.510 50 0.33
16 16 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 99 0 0 0.000 0 1.00
17 17 A 0 1 1 0 0 0 0 0 0 0 0 0 1 2 95 0 0 0 0 0 99 0 0 0.267 8 0.87
18 18 A 4 38 7 0 42 0 0 0 2 0 1 0 0 0 0 5 0 0 0 0 99 0 0 1.324 44 0.60
19 19 A 5 3 1 1 0 0 1 11 32 19 0 25 0 0 1 0 0 0 0 0 99 0 0 1.716 57 0.33
20 20 A 1 1 0 0 0 0 0 97 0 1 0 0 0 0 0 0 0 0 0 0 99 1 3 0.169 5 0.93
21 21 A 1 1 1 0 1 0 0 44 18 1 4 28 0 0 1 0 0 0 0 0 98 0 0 1.439 48 0.45
22 22 A 0 0 0 0 0 0 1 0 3 27 66 3 0 0 0 0 0 0 0 0 99 0 0 0.889 29 0.58
23 23 A 0 0 2 5 0 0 0 0 1 2 1 2 1 0 76 1 5 4 0 0 99 0 0 1.064 35 0.53
24 24 A 0 2 0 0 1 0 0 7 5 55 4 5 0 0 0 0 0 6 2 12 99 0 0 1.579 52 0.39
25 25 A 37 4 8 2 1 0 0 0 6 0 4 25 0 0 3 4 0 2 3 0 99 0 0 1.893 63 0.28
26 26 A 0 0 0 0 0 0 0 0 0 0 1 0 99 0 0 0 0 0 0 0 99 0 0 0.056 1 0.99
27 27 A 1 0 2 0 0 0 0 0 91 0 2 1 3 0 0 0 0 0 0 0 99 0 0 0.443 14 0.84
28 28 A 0 0 1 0 0 0 0 13 8 0 23 12 0 0 5 29 0 0 7 1 99 0 0 1.855 61 0.28
29 29 A 1 52 9 1 0 0 3 1 15 1 1 9 0 0 4 1 0 0 1 1 99 0 0 1.671 55 0.27
30 30 A 0 0 0 0 0 0 0 0 0 0 58 4 38 0 0 0 0 0 0 0 99 0 0 0.815 27 0.65
31 31 A 0 0 0 0 0 0 0 85 0 0 1 0 0 0 7 0 0 2 0 5 99 0 0 0.603 20 0.73
32 32 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 99 1 0 0.000 0 1.00
33 33 A 1 2 13 0 0 0 0 0 0 0 0 1 0 0 1 77 5 0 0 0 98 0 0 0.844 28 0.53
34 34 A 1 4 82 0 0 0 0 0 0 0 0 1 0 6 1 1 0 0 3 1 99 0 0 0.802 26 0.63
35 35 A 28 5 63 0 1 0 0 0 0 0 0 2 0 0 0 0 1 0 0 0 99 0 0 0.973 32 0.77
36 36 A 1 1 0 0 0 0 0 4 0 3 55 3 0 0 1 1 0 2 3 26 99 1 0 1.394 46 0.43
37 37 A 5 0 0 0 0 0 0 80 5 0 5 1 0 0 1 0 0 1 1 1 98 0 0 0.871 29 0.71
38 38 A 0 0 0 0 0 0 0 9 3 13 27 23 0 0 0 4 0 0 19 2 93 1 0 1.811 60 0.32
39 39 A 0 0 0 0 0 0 0 1 0 0 3 47 0 0 6 38 1 1 3 0 79 0 0 1.250 41 0.43
40 40 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 80 0 0 0.000 0 1.00
41 41 A 0 0 0 0 0 0 0 0 0 88 0 0 0 0 0 9 0 0 0 3 80 0 0 0.422 14 0.82
42 42 A 0 0 0 0 0 0 0 5 8 25 30 1 0 0 11 3 0 0 17 0 80 0 0 1.750 58 0.31
43 43 A 0 0 0 0 0 0 0 42 0 1 1 0 0 0 0 0 0 0 6 49 71 0 0 0.995 33 0.63
44 44 A 0 6 0 0 1 7 84 0 0 0 0 0 0 1 0 0 0 0 0 0 70 0 0 0.618 20 0.84
45 45 A 0 0 0 0 0 0 0 0 4 63 1 22 0 0 1 0 0 3 3 0 67 0 0 1.102 36 0.52
46 46 A 0 0 0 0 0 0 0 0 0 0 2 0 0 28 2 55 2 0 12 0 58 0 0 1.149 38 0.44
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
7 20 45 1 pAp
8 20 21 1 gTi
23 20 39 1 gCr
//