Complet list of 1wj2 hssp file
Complete list of 1wj2.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1WJ2
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-11
HEADER DNA BINDING PROTEIN 28-MAY-04 1WJ2
COMPND MOL_ID: 1; MOLECULE: PROBABLE WRKY TRANSCRIPTION FACTOR 4; CHAIN: A; F
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: ARABIDOPSIS THALIANA; ORGANISM_COMMON:
AUTHOR K.YAMASAKI,M.INOUE,T.KIGAWA,S.YOKOYAMA,RIKEN STRUCTURAL GENOMICS/PROTE
DBREF 1WJ2 A 399 469 UNP Q9XI90 WRKY4_ARATH 399 469
SEQLENGTH 71
NCHAIN 1 chain(s) in 1WJ2 data set
NALIGN 72
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : G3FFC2_9ROSI 0.91 0.98 14 71 1 58 58 0 0 102 G3FFC2 WRKY transcription factor 2-6 (Fragment) OS=Dimocarpus longan PE=2 SV=1
2 : D6MK30_9ASPA 0.89 0.97 1 71 33 103 71 0 0 140 D6MK30 Transcription factor (Fragment) OS=Lycoris longituba PE=2 SV=1
3 : G3FFA0_9ROSI 0.82 0.93 1 71 34 104 71 0 0 107 G3FFA0 WRKY transcription factor 36-3 (Fragment) OS=Dimocarpus longan PE=2 SV=1
4 : B3VKU1_SOLLC 0.80 0.91 16 71 1 56 56 0 0 69 B3VKU1 WRKY (Fragment) OS=Solanum lycopersicum PE=2 SV=1
5 : D8RWE6_SELML 0.80 0.94 1 71 1 71 71 0 0 71 D8RWE6 Putative uncharacterized protein WRKY_15 (Fragment) OS=Selaginella moellendorffii GN=WRKY_15 PE=4 SV=1
6 : G3FFB6_9ROSI 0.78 0.90 12 70 1 59 59 0 0 79 G3FFB6 WRKY transcription factor 29-4 (Fragment) OS=Dimocarpus longan PE=2 SV=1
7 : S8EGG5_9LAMI 0.77 0.93 1 71 10 80 71 0 0 80 S8EGG5 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_02986 PE=4 SV=1
8 : V5KY51_9MYRT 0.76 0.88 13 70 1 58 58 0 0 59 V5KY51 WRKY9 protein (Fragment) OS=Lumnitzera racemosa PE=4 SV=1
9 : D8TZY8_VOLCA 0.70 0.79 15 70 1 56 56 0 0 56 D8TZY8 Putative uncharacterized protein (Fragment) OS=Volvox carteri GN=VOLCADRAFT_48963 PE=4 SV=1
10 : A9SSY9_PHYPA 0.69 0.84 1 67 23 89 67 0 0 89 A9SSY9 Transcription factor WRKY17 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY17 PE=4 SV=1
11 : G3FF92_9ROSI 0.68 0.84 3 71 12 80 69 0 0 84 G3FF92 WRKY transcription factor 29-1 (Fragment) OS=Dimocarpus longan PE=2 SV=1
12 : A9S4M6_PHYPA 0.67 0.81 15 67 1 54 54 1 1 54 A9S4M6 Transcription factor WRKY13 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY13 PE=4 SV=1
13 : G3FF97_9ROSI 0.67 0.83 3 71 15 83 69 0 0 98 G3FF97 WRKY transcription factor 29-2 (Fragment) OS=Dimocarpus longan PE=2 SV=1
14 : A9RF14_PHYPA 0.66 0.81 1 67 18 84 67 0 0 84 A9RF14 Transcription factor WRKY20 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY20 PE=4 SV=1
15 : D8RMJ9_SELML 0.66 0.81 1 67 14 80 67 0 0 80 D8RMJ9 Putative uncharacterized protein WRKY_11 (Fragment) OS=Selaginella moellendorffii GN=WRKY_11 PE=4 SV=1
16 : D8SLN2_SELML 0.66 0.81 1 67 24 90 67 0 0 90 D8SLN2 Putative uncharacterized protein WRKY_29 (Fragment) OS=Selaginella moellendorffii GN=WRKY_29 PE=4 SV=1
17 : A5AYL2_VITVI 0.65 0.78 1 68 19 86 68 0 0 104 A5AYL2 Putative uncharacterized protein (Fragment) OS=Vitis vinifera GN=VITISV_014490 PE=4 SV=1
18 : C1KGD3_9LILI 0.65 0.80 14 66 1 54 54 1 1 54 C1KGD3 Putative WRKY transcription factor WRKY4-like protein (Fragment) OS=Areca catechu PE=4 SV=1
19 : D8R033_SELML 0.65 0.81 10 70 1 62 62 1 1 62 D8R033 Putative uncharacterized protein WRKY_23 (Fragment) OS=Selaginella moellendorffii GN=WRKY_23 PE=4 SV=1
20 : Q00A39_COCNU 0.65 0.78 14 66 1 54 54 1 1 54 Q00A39 WRKY2 (Fragment) OS=Cocos nucifera PE=4 SV=1
21 : S8DJ48_9LAMI 0.64 0.78 1 69 7 75 69 0 0 95 S8DJ48 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_11942 PE=4 SV=1
22 : A9S0E7_PHYPA 0.63 0.81 1 67 6 72 67 0 0 72 A9S0E7 Transcription factor WRKY26 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY26 PE=4 SV=1
23 : A9S0E9_PHYPA 0.63 0.82 1 67 20 86 67 0 0 86 A9S0E9 Transcription factor WRKY6 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY6 PE=4 SV=1
24 : A9SZ55_PHYPA 0.63 0.84 1 67 18 84 67 0 0 84 A9SZ55 Transcription factor WRKY3 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY3 PE=4 SV=1
25 : A9U1V7_PHYPA 0.63 0.81 1 67 6 72 67 0 0 72 A9U1V7 Transcription factor WRKY15 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY15 PE=4 SV=1
26 : B2KJ57_HORVD 0.63 0.78 1 68 2 69 68 0 0 88 B2KJ57 WRKY transcription factor 14 (Fragment) OS=Hordeum vulgare var. distichum GN=WRKY14 PE=2 SV=1
27 : D8QN56_SELML 0.63 0.81 1 67 21 87 67 0 0 87 D8QN56 Putative uncharacterized protein WRKY_1 (Fragment) OS=Selaginella moellendorffii GN=WRKY_1 PE=4 SV=1
28 : S8DI39_9LAMI 0.63 0.79 1 68 22 89 68 0 0 91 S8DI39 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_15732 PE=4 SV=1
29 : S8E871_9LAMI 0.63 0.77 1 70 12 81 70 0 0 91 S8E871 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_06118 PE=4 SV=1
30 : A9RIL7_PHYPA 0.62 0.77 12 71 1 61 61 1 1 61 A9RIL7 Transcription factor WRKY33 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY33 PE=4 SV=1
31 : A9T3C9_PHYPA 0.62 0.80 10 69 1 61 61 1 1 61 A9T3C9 Transcription factor WRKY19 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY19 PE=4 SV=1
32 : A9RK12_PHYPA 0.61 0.79 10 69 1 61 61 1 1 61 A9RK12 Transcription factor WRKY5 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY5 PE=4 SV=1
33 : A9RYG9_PHYPA 0.61 0.84 1 67 10 76 67 0 0 76 A9RYG9 Transcription factor WRKY29 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY29 PE=4 SV=1
34 : A9SYJ7_PHYPA 0.61 0.79 1 67 17 83 67 0 0 83 A9SYJ7 Transcription factor WRKY28 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY28 PE=4 SV=1
35 : A9T511_PHYPA 0.61 0.79 1 67 17 83 67 0 0 83 A9T511 Transcription factor WRKY22 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY22 PE=4 SV=1
36 : A9TJU6_PHYPA 0.61 0.79 10 69 1 61 61 1 1 61 A9TJU6 Transcription factor WRKY10 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY10 PE=4 SV=1
37 : D8R084_SELML 0.61 0.81 1 67 16 82 67 0 0 82 D8R084 Putative uncharacterized protein WRKY_24 (Fragment) OS=Selaginella moellendorffii GN=WRKY_24 PE=4 SV=1
38 : D8SZP8_SELML 0.61 0.81 1 67 15 81 67 0 0 81 D8SZP8 Putative uncharacterized protein WRKY_32 (Fragment) OS=Selaginella moellendorffii GN=WRKY_32 PE=4 SV=1
39 : E7CEY8_CUCSA 0.61 0.80 10 70 1 61 61 0 0 79 E7CEY8 WRKY protein OS=Cucumis sativus GN=WRKY54 PE=4 SV=1
40 : W5CBN1_WHEAT 0.61 0.81 12 68 2 58 57 0 0 62 W5CBN1 Uncharacterized protein OS=Triticum aestivum PE=4 SV=1
41 : A9T3J4_PHYPA 0.60 0.84 1 67 6 72 67 0 0 72 A9T3J4 Transcription factor WRKY18 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY18 PE=4 SV=1
42 : A9TBC7_PHYPA 0.60 0.84 1 67 10 76 67 0 0 76 A9TBC7 Transcription factor WRKY31 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY31 PE=4 SV=1
43 : A9U0W6_PHYPA 0.60 0.85 1 67 24 90 67 0 0 90 A9U0W6 Transcription factor WRKY12 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY12 PE=4 SV=1
44 : Q09I57_PHYPA 0.60 0.74 12 67 13 69 57 1 1 71 Q09I57 WRKY transcription factor 7 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY7 PE=2 SV=1
45 : S8CMZ8_9LAMI 0.60 0.76 1 68 5 72 68 0 0 87 S8CMZ8 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_08613 PE=4 SV=1
46 : W5GM66_WHEAT 0.60 0.79 10 66 1 58 58 1 1 58 W5GM66 Uncharacterized protein OS=Triticum aestivum PE=4 SV=1
47 : A9SRR4_PHYPA 0.59 0.85 1 71 1 71 71 0 0 71 A9SRR4 Transcription factor WRKY11 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY11 PE=4 SV=1
48 : N1R081_AEGTA 0.59 0.65 1 70 234 334 101 1 31 386 N1R081 Putative WRKY transcription factor 3 OS=Aegilops tauschii GN=F775_20298 PE=4 SV=1
49 : A9TJH5_PHYPA 0.58 0.80 1 71 1 71 71 0 0 71 A9TJH5 Transcription factor WRKY21 (Fragment) OS=Physcomitrella patens subsp. patens GN=WRKY21 PE=4 SV=1
50 : H9MC19_PINRA 0.58 0.71 9 66 11 69 59 1 1 69 H9MC19 Uncharacterized protein (Fragment) OS=Pinus radiata GN=CL1104Contig1_03 PE=4 SV=1
51 : H9VAJ3_PINTA 0.58 0.71 9 66 11 69 59 1 1 69 H9VAJ3 Uncharacterized protein (Fragment) OS=Pinus taeda GN=CL1104Contig1_03 PE=4 SV=1
52 : K7P1T9_PINMU 0.58 0.71 9 66 11 69 59 1 1 69 K7P1T9 Uncharacterized protein (Fragment) OS=Pinus mugo GN=CL1104Contig1_03 PE=4 SV=1
53 : K7P3N8_PINMU 0.58 0.71 9 66 11 69 59 1 1 69 K7P3N8 Uncharacterized protein (Fragment) OS=Pinus mugo GN=CL1104Contig1_03 PE=4 SV=1
54 : W5CKI9_WHEAT 0.58 0.76 1 70 8 78 71 1 1 78 W5CKI9 Uncharacterized protein OS=Triticum aestivum PE=4 SV=1
55 : C0IMM6_PINMO 0.57 0.70 16 66 1 56 56 2 5 56 C0IMM6 Putative WRKY transcription factor PmWRKY33 (Fragment) OS=Pinus monticola PE=4 SV=1
56 : C4N0W2_BRANA 0.57 0.67 1 71 233 328 96 1 25 344 C4N0W2 WRKY26-1 transcription factor OS=Brassica napus GN=WRKY26-1 PE=2 SV=1
57 : K7UNB8_MAIZE 0.57 0.64 1 71 476 579 104 1 33 729 K7UNB8 Putative WRKY DNA-binding domain superfamily protein OS=Zea mays GN=ZEAMMB73_355776 PE=4 SV=1
58 : M7ZTD7_TRIUA 0.57 0.65 1 71 246 343 98 1 27 398 M7ZTD7 Putative WRKY transcription factor 4 OS=Triticum urartu GN=TRIUR3_28514 PE=4 SV=1
59 : K7P1T4_ABIAL 0.56 0.69 9 66 11 69 59 1 1 69 K7P1T4 Uncharacterized protein (Fragment) OS=Abies alba GN=CL1104Contig1_03 PE=4 SV=1
60 : G3FF87_9ROSI 0.55 0.76 1 65 15 80 66 1 1 80 G3FF87 WRKY transcription factor 6-3 (Fragment) OS=Dimocarpus longan PE=2 SV=1
61 : K7LFU7_SOYBN 0.54 0.68 1 68 145 222 78 1 10 238 K7LFU7 Uncharacterized protein OS=Glycine max PE=4 SV=1
62 : M1D6V7_SOLTU 0.53 0.79 1 71 12 83 72 1 1 102 M1D6V7 Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400033177 PE=4 SV=1
63 : S8E7D7_9LAMI 0.53 0.66 12 70 10 68 59 0 0 71 S8E7D7 Transcription factor (Fragment) OS=Genlisea aurea GN=M569_02987 PE=4 SV=1
64 : F0ZCP3_DICPU 0.52 0.76 5 70 1 66 66 0 0 66 F0ZCP3 Putative uncharacterized protein wrky2 (Fragment) OS=Dictyostelium purpureum GN=wrky2 PE=4 SV=1
65 : M1D315_SOLTU 0.50 0.74 1 70 43 118 76 1 6 138 M1D315 Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG401031196 PE=4 SV=1
66 : N1QX68_AEGTA 0.50 0.57 1 71 286 399 114 1 43 454 N1QX68 Putative WRKY transcription factor 4 OS=Aegilops tauschii GN=F775_08639 PE=4 SV=1
67 : B2KJ85_HORVD 0.45 0.63 1 71 37 109 73 2 2 144 B2KJ85 WRKY transcription factor 44 (Fragment) OS=Hordeum vulgare var. distichum GN=WRKY44 PE=2 SV=1
68 : C3SA50_BRADI 0.45 0.52 1 70 411 529 119 2 49 584 C3SA50 WRKY-like protein OS=Brachypodium distachyon PE=4 SV=1
69 : C1KGD2_9LILI 0.41 0.62 14 66 1 73 73 1 20 73 C1KGD2 Putative WRKY transcription factor WRKY3-like protein (Fragment) OS=Areca catechu PE=4 SV=1
70 : S8D2A8_9LAMI 0.41 0.70 1 71 11 83 73 2 2 107 S8D2A8 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_03225 PE=4 SV=1
71 : Q014I5_OSTTA 0.38 0.64 1 68 72 143 74 4 8 180 Q014I5 DNA-binding protein WRKY1 (ISS) OS=Ostreococcus tauri GN=Ot07g04210 PE=4 SV=1
72 : K7U6F7_MAIZE 0.32 0.45 16 71 242 341 100 2 44 527 K7U6F7 Putative WRKY DNA-binding domain superfamily protein OS=Zea mays GN=ZEAMMB73_393959 PE=4 SV=1
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 399 A V 0 0 185 44 36 VV V V I IIIF FIFIIFLFF III II III F IVI V VVV VFF FVVV V
2 400 A Q + 0 0 181 44 56 HQ Q Q Q QQQK QHKHQKQKM KQQ QQ KKK H KQK R QQQ RQS RQKQ H
3 401 A T + 0 0 136 46 31 TT T T TT TTTTT TTTTTTTTT TTT TT TTT T TTT A TTT ATT FTRT E
4 402 A T + 0 0 142 46 69 TT L V RK KRRRM RNRRRMRMK RRP KK RRR R KTR R TTT RRK RTVT M
5 403 A S - 0 0 95 47 36 SS S S SS SSSSS SSSSTSSSS TSS SS TTT S TSA S SSS SST NSSAS T
6 404 A E - 0 0 183 47 39 ED E E DE EEEEE QDEEEDDDE DID DD DDE E HED E DEE EDQ SEEAE Q
7 405 A V - 0 0 119 47 56 VI I V VV IVVVV VVVIVVVVI VLV KK VMV Q TVT A IVI AVL ILIDV E
8 406 A D - 0 0 151 47 24 DD D D DD DDDDD DDDDDDDDD DDE EE DDD D DDD P DDD PDE NEDGD E
9 407 A L - 0 0 91 52 41 LI I I IH HVIIV IIVVVVIVQ VII II IVV V VLVIIIII IILIMVI IVLTL V
10 408 A L - 0 0 153 58 45 LL L L ML LLMML M LIIMILMLL MMLMMIMMM MLM LMMLLPPPPI LLLPILL ILLSL C
11 409 A D + 0 0 106 58 41 DD D D EE EEDDD N DDDEDDDDD NNDEENDDD DDE DNDDDAAAAA DDDATDD SDDAD G
12 410 A D S S- 0 0 108 63 0 DD DDD DD DDDDD D DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD DDDDDDDDDDDDD D
13 411 A G S S+ 0 0 36 64 18 GG GGGG GG GGGGG G GGGGGGGGGGGGGGGGGGGGGGGEGGGGGEEEEG GGGEGGGGGGGPG S
14 412 A Y S S- 0 0 45 68 14 YYY YFYF YY YYYYYYCYYYYYYYFYYCCCFYYCYYYYFFYYYCYYYYYYYC YYYYCYYYYFYWYYW
15 413 A R - 0 0 166 70 58 RRR RRRRRKRQRKRRKQQQRRKKKKRKRHQQKKKQRRKRKKKSKQKRKSSSLQ RRRSQKKNQKRARQA
16 414 A W B -A 36 0A 15 73 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
17 415 A R - 0 0 161 73 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
18 416 A K + 0 0 80 73 0 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK
19 417 A Y - 0 0 120 73 0 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY
20 418 A G E -B 33 0A 25 73 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGgGG
21 419 A Q E -B 32 0A 123 73 12 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQKQQQQQHQQQQQQQQQQQQQQQQKQQKQQqKQ
22 420 A K E -B 31 0A 126 73 1 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRKK
23 421 A V - 0 0 94 73 76 VVVVVVLVQAAMAAAAVPMPAPPAPVAVGMMMAAAMAASFAAPPSVPVAPPPPMMVVVPMVTLNMVPELP
24 422 A V > - 0 0 47 73 32 VVVVVVVVVVVAVVVVVIAIVVVVVVVVVAAAVVVAVVVIVVVIVAVVVIIIIATVVVIAVVVVVVIVVI
25 423 A K T 3 S+ 0 0 169 73 4 KKKKKKRKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK
26 424 A G T 3 S+ 0 0 64 73 45 GGGGGGGGGNNGNNNNNGGGNNSDNNNNNGGGNNNGNNNNNNNGNGSGNGGGGGNGGGGGNSGGNGGLNG
27 425 A N < - 0 0 32 73 49 NNNTNNNNSSSNSSSSTSNSNSSSSTSTSNNNSSSNSSSNSSSSSNSNSSSSSNNNNNSNSnSSSNSHSS
28 426 A P S S+ 0 0 98 73 37 PPPQPPPPPPPPPTPPQPPPKHPPVQPQPPPPPPPPPPPPPPPPSPPPPPPPPPPPPPPPLnESSPPAPL
29 427 A Y S S- 0 0 140 73 61 YHNHHYNYFYYCYHHHHYCYFHHFHHHHFCCWHHHCYYNHHHHHHCHHHHHHHCLNNHHCHYFHNHYHNY
30 428 A P - 0 0 67 73 6 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPLPPPPPGPP
31 429 A R E -B 22 0A 45 73 8 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRrRRRRRRRrrsRRsRRRrrRtrr
32 430 A S E -BC 21 47A 11 73 56 SSSSYSSSASSASSSSSGAGSNNSNSSSSAAANSSASSNSNNNGSANsNGGGGASsssGAsNSHnsGsnn
33 431 A Y E +BC 20 46A 32 73 0 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY
34 432 A Y E - C 0 45A 36 73 0 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYFYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY
35 433 A K E - C 0 44A 60 73 27 KRKRKRKRKRRRRRRRRKRKRKRRKRRRRRRRRRRRRRKKRRRKCRRKRKKKKRKKKKKRRKKKKKRKRR
36 434 A C E -AC 16 43A 0 73 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
37 435 A T E + C 0 42A 52 73 34 TTTTSTTTTTTtTTTTTstgTTTTTTTTTtttTTTtTTSTTTTsTtTTAsssstaTTTstTSTTSTsTSs
38 436 A T - 0 0 27 72 90 TTHYSNSGHNTaSSNNQlslHTTNTQNASlasTNHmYYSSTTTiHaTFTvvvvaaFYYvaHIYYSYdFSl
39 437 A P S S- 0 0 124 73 78 PAPPSLILAPQPQHTTDRPRQPAQADSDTRPPPPMSTTEAPPARHPPAPRRRRTPTAQRVNRPPGQKAQK
40 438 A G S S+ 0 0 74 73 64 GGAGGKGKGDKGKTKKNGGGGNNTNKKNPGGGLKMGKKGRQLHGAANGNGGGGGGGGGYGNGNGGGAGGA
41 439 A C + 0 0 5 73 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
42 440 A G E +C 37 0A 61 73 66 NNPNANANTPSPTPPPRPPPNLPPPRPRGPPPPPPPHHNSPPPPGPPNPPPPPPPCSDPPREENNDENSS
43 441 A V E -C 36 0A 2 73 31 VVVVVVVVVVVVVVVVVAVAVVVVVVVVVVVVVVVVVVVAVVVAVVVVVAAAAVVVVVAVVVAVVVAVVA
44 442 A R E -CD 35 62A 131 73 24 RRRRRRRRRRKRKRKKKRRRKKRRRKKKKRRRRRRRKKKKRRRRKRRRRRRRRRKRRKRRKKKRKKRRKK
45 443 A K E -CD 34 61A 23 73 0 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKkKKKKKKKKKKKKKKK
46 444 A H E -CD 33 60A 49 73 54 HHHQHHHYHRRQRRRRRHQHQQRRRRRRRQQQRKRQRRKHRRRHQQRHRHHHHQqQHHHQRRIQRHMHRQ
47 445 A V E +CD 32 59A 12 73 7 VVVVVVVVVVVVVIVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVIVVIVVVVVVVVVIVVVVFVVIVIVV
48 446 A E E - D 0 58A 45 73 19 EEEEEEEEEEEQEEEEEEQEQEEEEEEEEQQQEEEQEEEEEEEEQQEEEEEEEQQEEEEQEEEEEEEEEE
49 447 A R E - D 0 57A 98 73 8 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRKRRRRRRRRRRRRRRGRRRRRRRRRRRRRRR
50 448 A A E > + D 0 56A 25 73 71 AAAAAAAASKSCSKSSLACALCSKCLSLSCCCSSSCSSDSSSSSLCSACSSSSCCAACSCLDSVDCCADS
51 449 A A T 5S+ 0 0 39 73 79 SASSSSRSAAFAYASSAPAPSTIATACAPAAENAAASSRTSKTMAQTSILLLLAVFSSLASGPSNSRSPP
52 450 A T T 5S+ 0 0 147 73 51 STHTNDNDEDQDQDEEEDEDREEGDEEDEDDEEDEKKKEDEEEENEESEDDDDEEHNQDEEHDNEQDSQN
53 451 A D T 5S- 0 0 94 73 7 DDDDDDDDDDDDDDDDDDDDDNDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDGGDDDDND
54 452 A P T 5S- 0 0 120 73 65 PPLPPPPPEHPIPPQQPPTPEPPAPPPPRLVTASTTSSAPAAPSKMPPPPPPPRTALPPRCSKSSPPPSP
55 453 A K S - 0 0 47 73 32 LI
25 423 A K T 3 S+ 0 0 169 73 4 SK
26 424 A G T 3 S+ 0 0 64 73 45 GG
27 425 A N < - 0 0 32 73 49 QS
28 426 A P S S+ 0 0 98 73 37 PP
29 427 A Y S S- 0 0 140 73 61 HY
30 428 A P - 0 0 67 73 6 PP
31 429 A R E -B 22 0A 45 73 8 Rr
32 430 A S E -BC 21 47A 11 73 56 Ag
33 431 A Y E +BC 20 46A 32 73 0 YY
34 432 A Y E - C 0 45A 36 73 0 YY
35 433 A K E - C 0 44A 60 73 27 KR
36 434 A C E -AC 16 43A 0 73 0 CC
37 435 A T E + C 0 42A 52 73 34 Ts
38 436 A T - 0 0 27 72 90 .s
39 437 A P S S- 0 0 124 73 78 SK
40 438 A G S S+ 0 0 74 73 64 GG
41 439 A C + 0 0 5 73 0 CC
42 440 A G E +C 37 0A 61 73 66 gS
43 441 A V E -C 36 0A 2 73 31 lA
44 442 A R E -CD 35 62A 131 73 24 QR
45 443 A K E -CD 34 61A 23 73 0 kK
46 444 A H E -CD 33 60A 49 73 54 rQ
47 445 A V E +CD 32 59A 12 73 7 IV
48 446 A E E - D 0 58A 45 73 19 AE
49 447 A R E - D 0 57A 98 73 8 SR
50 448 A A E > + D 0 56A 25 73 71 KS
51 449 A A T 5S+ 0 0 39 73 79 GR
52 450 A T T 5S+ 0 0 147 73 51 RN
53 451 A D T 5S- 0 0 94 73 7 ED
54 452 A P T 5S- 0 0 120 73 65 KP
55 453 A K S