Complet list of 1iuf hssp file
Complete list of 1iuf.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1IUF
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-04-30
HEADER DNA BINDING PROTEIN 04-MAR-02 1IUF
COMPND MOL_ID: 1; MOLECULE: CENTROMERE ABP1 PROTEIN; CHAIN: A; FRAGMENT: N-TE
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: SCHIZOSACCHAROMYCES POMBE; ORGANISM_CO
AUTHOR J.KIKUCHI,J.IWAHARA,T.KIGAWA,Y.MURAKAMI,T.OKAZAKI, S.YOKOYAMA,RIKEN ST
DBREF 1IUF A 1 141 UNP P49777 ABP1_SCHPO 1 141
SEQLENGTH 144
NCHAIN 1 chain(s) in 1IUF data set
NALIGN 64
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : ABP1_SCHPO 1.00 1.00 4 144 1 141 141 0 0 522 P49777 ARS-binding protein 1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=abp1 PE=1 SV=2
2 : S9Q4J5_SCHOY 0.58 0.80 4 141 1 139 139 1 1 516 S9Q4J5 ARS-binding protein OS=Schizosaccharomyces octosporus (strain yFS286) GN=SOCG_02041 PE=4 SV=1
3 : S9XA93_SCHCR 0.57 0.81 4 141 1 139 139 1 1 516 S9XA93 Uncharacterized protein OS=Schizosaccharomyces cryophilus (strain OY26 / ATCC MYA-4695 / CBS 11777 / NBRC 106824 / NRRL Y48691) GN=SPOG_05545 PE=4 SV=1
4 : S9PVC1_SCHOY 0.50 0.77 7 143 8 145 138 1 1 516 S9PVC1 ARS-binding protein OS=Schizosaccharomyces octosporus (strain yFS286) GN=SOCG_00803 PE=4 SV=1
5 : S9VXS0_SCHCR 0.49 0.76 4 143 21 161 141 1 1 532 S9VXS0 Uncharacterized protein OS=Schizosaccharomyces cryophilus (strain OY26 / ATCC MYA-4695 / CBS 11777 / NBRC 106824 / NRRL Y48691) GN=SPOG_05273 PE=4 SV=1
6 : S9RGD4_SCHOY 0.47 0.74 4 143 1 141 141 1 1 567 S9RGD4 ARS-binding protein OS=Schizosaccharomyces octosporus (strain yFS286) GN=SOCG_00860 PE=4 SV=1
7 : CBH1_SCHPO 0.46 0.76 9 144 5 141 137 1 1 514 O14423 CENP-B homolog protein 1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=cbh1 PE=1 SV=2
8 : CBH2_SCHPO 0.46 0.73 4 144 1 142 142 1 1 514 O60108 CENP-B homolog protein 2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=cbh2 PE=4 SV=1
9 : S9X0E3_SCHCR 0.45 0.73 4 143 1 141 141 1 1 567 S9X0E3 Uncharacterized protein OS=Schizosaccharomyces cryophilus (strain OY26 / ATCC MYA-4695 / CBS 11777 / NBRC 106824 / NRRL Y48691) GN=SPOG_05200 PE=4 SV=1
10 : S9RDS3_SCHOY 0.44 0.68 4 140 1 138 138 1 1 505 S9RDS3 ARS-binding protein OS=Schizosaccharomyces octosporus (strain yFS286) GN=SOCG_04915 PE=4 SV=1
11 : S9VTB2_SCHCR 0.43 0.73 9 141 7 139 134 2 2 527 S9VTB2 Uncharacterized protein OS=Schizosaccharomyces cryophilus (strain OY26 / ATCC MYA-4695 / CBS 11777 / NBRC 106824 / NRRL Y48691) GN=SPOG_05336 PE=4 SV=1
12 : S9X540_SCHCR 0.43 0.69 4 142 1 140 140 1 1 506 S9X540 Uncharacterized protein OS=Schizosaccharomyces cryophilus (strain OY26 / ATCC MYA-4695 / CBS 11777 / NBRC 106824 / NRRL Y48691) GN=SPOG_05660 PE=4 SV=1
13 : S9RHG1_SCHOY 0.41 0.73 9 141 7 139 134 2 2 517 S9RHG1 ARS-binding protein OS=Schizosaccharomyces octosporus (strain yFS286) GN=SOCG_02703 PE=4 SV=1
14 : C5PE34_COCP7 0.37 0.60 8 144 3 138 137 1 1 519 C5PE34 Mariner-Tc1 transposon family protein OS=Coccidioides posadasii (strain C735) GN=CPC735_019520 PE=4 SV=1
15 : S8B290_PENO1 0.37 0.60 12 141 14 142 131 2 3 617 S8B290 Uncharacterized protein OS=Penicillium oxalicum (strain 114-2 / CGMCC 5302) GN=PDE_03473 PE=4 SV=1
16 : W6PT71_PENRO 0.36 0.64 16 144 16 142 129 1 2 520 W6PT71 Probable transposable element OS=Penicillium roqueforti GN=PROQFM164_S01g001199 PE=4 SV=1
17 : A7EBY8_SCLS1 0.34 0.60 7 144 3 140 139 2 2 436 A7EBY8 Putative uncharacterized protein OS=Sclerotinia sclerotiorum (strain ATCC 18683 / 1980 / Ss-1) GN=SS1G_02824 PE=4 SV=1
18 : Q5AQD5_EMENI 0.34 0.61 12 144 7 139 134 2 2 513 Q5AQD5 Uncharacterized protein OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=AN9495.2 PE=4 SV=1
19 : T5AAY7_OPHSC 0.34 0.64 6 144 6 142 140 3 4 532 T5AAY7 CENP-B protein 2 OS=Ophiocordyceps sinensis (strain Co18 / CGMCC 3.14243) GN=OCS_05303 PE=4 SV=1
20 : W3XMM6_9PEZI 0.34 0.55 9 140 4 146 143 3 11 188 W3XMM6 Uncharacterized protein OS=Pestalotiopsis fici W106-1 GN=PFICI_01038 PE=4 SV=1
21 : I8U8N7_ASPO3 0.33 0.57 7 144 1 135 138 2 3 476 I8U8N7 DNA-binding centromere protein B OS=Aspergillus oryzae (strain 3.042) GN=Ao3042_11441 PE=4 SV=1
22 : M7UY69_BOTF1 0.33 0.58 9 144 5 140 137 2 2 437 M7UY69 Uncharacterized protein OS=Botryotinia fuckeliana (strain BcDW1) GN=BcDW1_2499 PE=4 SV=1
23 : B2B3A8_PODAN 0.32 0.59 7 143 25 160 138 2 3 471 B2B3A8 Podospora anserina S mat+ genomic DNA chromosome 6, supercontig 2 OS=Podospora anserina (strain S / ATCC MYA-4624 / DSM 980 / FGSC 10383) GN=PODANS_6_560 PE=4 SV=1
24 : C7YJU8_NECH7 0.32 0.55 9 144 5 143 139 2 3 419 C7YJU8 Putative uncharacterized protein OS=Nectria haematococca (strain 77-13-4 / ATCC MYA-4622 / FGSC 9596 / MPVI) GN=NECHADRAFT_75804 PE=4 SV=1
25 : C7Z6U2_NECH7 0.32 0.60 9 143 3 137 136 2 2 354 C7Z6U2 Putative uncharacterized protein (Fragment) OS=Nectria haematococca (strain 77-13-4 / ATCC MYA-4622 / FGSC 9596 / MPVI) GN=NECHADRAFT_15043 PE=4 SV=1
26 : C9SFX9_VERA1 0.32 0.58 5 143 76 214 140 2 2 430 C9SFX9 Putative uncharacterized protein OS=Verticillium alfalfae (strain VaMs.102 / ATCC MYA-4576 / FGSC 10136) GN=VDBG_03492 PE=4 SV=1
27 : E9F298_METAR 0.32 0.59 10 143 27 160 135 2 2 380 E9F298 Centromere binding protein Cbh2 OS=Metarhizium anisopliae (strain ARSEF 23 / ATCC MYA-3075) GN=MAA_06296 PE=4 SV=1
28 : F7W287_SORMK 0.32 0.58 9 143 40 173 136 2 3 426 F7W287 WGS project CABT00000000 data, contig 2.21 OS=Sordaria macrospora (strain ATCC MYA-333 / DSM 997 / K(L3346) / K-hell) GN=SMAC_04719 PE=4 SV=1
29 : F8MID4_NEUT8 0.32 0.58 9 143 139 272 136 2 3 517 F8MID4 Putative uncharacterized protein OS=Neurospora tetrasperma (strain FGSC 2508 / ATCC MYA-4615 / P0657) GN=NEUTE1DRAFT_120840 PE=4 SV=1
30 : G0RBK8_HYPJQ 0.32 0.60 9 143 23 157 136 2 2 373 G0RBK8 Predicted protein OS=Hypocrea jecorina (strain QM6a) GN=TRIREDRAFT_75196 PE=4 SV=1
31 : G2XIB2_VERDV 0.32 0.58 5 143 73 211 140 2 2 452 G2XIB2 Putative uncharacterized protein OS=Verticillium dahliae (strain VdLs.17 / ATCC MYA-4575 / FGSC 10137) GN=VDAG_09894 PE=4 SV=1
32 : G2XV29_BOTF4 0.32 0.58 9 144 5 140 137 2 2 437 G2XV29 Uncharacterized protein OS=Botryotinia fuckeliana (strain T4) GN=BofuT4_P059290.1 PE=4 SV=1
33 : G4UI05_NEUT9 0.32 0.58 9 143 132 265 136 2 3 510 G4UI05 CenpB-DNA-bind-domain-containing protein OS=Neurospora tetrasperma (strain FGSC 2509 / P0656) GN=NEUTE2DRAFT_156589 PE=4 SV=1
34 : G9N577_HYPVG 0.32 0.60 9 143 23 157 136 2 2 373 G9N577 Uncharacterized protein OS=Hypocrea virens (strain Gv29-8 / FGSC 10586) GN=TRIVIDRAFT_182621 PE=4 SV=1
35 : M1W516_CLAP2 0.32 0.56 8 140 13 145 134 2 2 543 M1W516 Uncharacterized protein OS=Claviceps purpurea (strain 20.1) GN=CPUR_00470 PE=4 SV=1
36 : Q5ASL3_EMENI 0.32 0.61 12 144 7 139 134 2 2 524 Q5ASL3 Uncharacterized protein OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=AN8717.2 PE=4 SV=1
37 : Q7S9S3_NEUCR 0.32 0.58 9 143 37 170 136 2 3 419 Q7S9S3 Uncharacterized protein OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=NCU06592 PE=4 SV=2
38 : W3XB86_9PEZI 0.32 0.61 9 143 27 160 136 2 3 366 W3XB86 Uncharacterized protein OS=Pestalotiopsis fici W106-1 GN=PFICI_04583 PE=4 SV=1
39 : B6QEL4_PENMQ 0.31 0.55 4 144 6 144 141 2 2 520 B6QEL4 Cenp-B, putative OS=Penicillium marneffei (strain ATCC 18224 / CBS 334.59 / QM 7333) GN=PMAA_089560 PE=4 SV=1
40 : B8MC88_TALSN 0.31 0.56 5 144 6 142 140 2 3 411 B8MC88 Jerky, putative OS=Talaromyces stipitatus (strain ATCC 10500 / CBS 375.48 / QM 6759 / NRRL 1006) GN=TSTA_122660 PE=4 SV=1
41 : B8NCE0_ASPFN 0.31 0.53 7 144 1 135 138 2 3 517 B8NCE0 Cenp-B, putative OS=Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM 12722 / SRRC 167) GN=AFLA_041150 PE=4 SV=1
42 : E9DUH6_METAQ 0.31 0.57 3 143 20 160 142 2 2 384 E9DUH6 Cenp-B, putative OS=Metarhizium acridum (strain CQMa 102) GN=MAC_01274 PE=4 SV=1
43 : F9FBL2_FUSOF 0.31 0.58 2 143 41 182 143 2 2 371 F9FBL2 Uncharacterized protein OS=Fusarium oxysporum (strain Fo5176) GN=FOXB_03789 PE=4 SV=1
44 : G0S7L7_CHATD 0.31 0.60 5 143 22 159 140 2 3 416 G0S7L7 Putative uncharacterized protein OS=Chaetomium thermophilum (strain DSM 1495 / CBS 144.50 / IMI 039719) GN=CTHT_0036780 PE=4 SV=1
45 : G7XZQ3_ASPKW 0.31 0.49 5 144 6 142 140 2 3 523 G7XZQ3 Jerky OS=Aspergillus kawachii (strain NBRC 4308) GN=AKAW_10526 PE=4 SV=1
46 : G9NG56_HYPAI 0.31 0.59 6 143 230 367 139 2 2 584 G9NG56 Putative uncharacterized protein OS=Hypocrea atroviridis (strain ATCC 20476 / IMI 206040) GN=TRIATDRAFT_280163 PE=4 SV=1
47 : I1RMR8_GIBZE 0.31 0.59 2 143 41 182 143 2 2 404 I1RMR8 Uncharacterized protein OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC 9075 / NRRL 31084) GN=FG05264.1 PE=4 SV=1
48 : J4URE0_BEAB2 0.31 0.59 9 143 30 164 136 2 2 381 J4URE0 Centromere binding protein B OS=Beauveria bassiana (strain ARSEF 2860) GN=BBA_02958 PE=4 SV=1
49 : J9MZU8_FUSO4 0.31 0.58 2 143 41 182 143 2 2 401 J9MZU8 Uncharacterized protein OS=Fusarium oxysporum f. sp. lycopersici (strain 4287 / CBS 123668 / FGSC 9935 / NRRL 34936) GN=FOXG_08442 PE=4 SV=1
50 : K3UKL3_FUSPC 0.31 0.58 3 143 42 182 142 2 2 404 K3UKL3 Uncharacterized protein OS=Fusarium pseudograminearum (strain CS3096) GN=FPSE_07228 PE=4 SV=1
51 : N1REG9_FUSC4 0.31 0.58 2 143 41 182 143 2 2 401 N1REG9 Uncharacterized protein OS=Fusarium oxysporum f. sp. cubense (strain race 4) GN=FOC4_g10011815 PE=4 SV=1
52 : N4UCF5_FUSC1 0.31 0.58 2 143 41 182 143 2 2 401 N4UCF5 Uncharacterized protein OS=Fusarium oxysporum f. sp. cubense (strain race 1) GN=FOC1_g10010228 PE=4 SV=1
53 : N4VRD2_COLOR 0.31 0.58 6 143 48 185 139 2 2 413 N4VRD2 Centromere binding protein B OS=Colletotrichum orbiculare (strain 104-T / ATCC 96160 / CBS 514.97 / LARS 414 / MAFF 240422) GN=Cob_08162 PE=4 SV=1
54 : S0DRH1_GIBF5 0.31 0.58 2 143 41 182 143 2 2 400 S0DRH1 Uncharacterized protein OS=Gibberella fujikuroi (strain CBS 195.34 / IMI 58289 / NRRL A-6831) GN=FFUJ_04880 PE=4 SV=1
55 : T0L848_COLGC 0.31 0.58 6 143 47 184 139 2 2 412 T0L848 Centromere binding protein B OS=Colletotrichum gloeosporioides (strain Cg-14) GN=CGLO_13015 PE=4 SV=1
56 : W7MCZ4_GIBM7 0.31 0.58 2 143 41 182 143 2 2 400 W7MCZ4 Uncharacterized protein OS=Gibberella moniliformis (strain M3125 / FGSC 7600) GN=FVEG_06234 PE=4 SV=1
57 : B8MJ39_TALSN 0.30 0.53 5 144 31 167 140 2 3 546 B8MJ39 Jerky, putative OS=Talaromyces stipitatus (strain ATCC 10500 / CBS 375.48 / QM 6759 / NRRL 1006) GN=TSTA_051370 PE=4 SV=1
58 : E3QV27_COLGM 0.30 0.58 8 143 45 180 137 2 2 407 E3QV27 Centromere binding protein B OS=Colletotrichum graminicola (strain M1.001 / M2 / FGSC 10212) GN=GLRG_09861 PE=4 SV=1
59 : F0X8G0_GROCL 0.30 0.58 5 143 81 218 140 2 3 508 F0X8G0 Centromere-binding protein OS=Grosmannia clavigera (strain kw1407 / UAMH 11150) GN=CMQ_3976 PE=4 SV=1
60 : J3NP63_GAGT3 0.30 0.58 1 143 110 251 144 2 3 398 J3NP63 Uncharacterized protein OS=Gaeumannomyces graminis var. tritici (strain R3-111a-1) GN=GGTG_03069 PE=4 SV=1
61 : M7TS18_EUTLA 0.30 0.60 1 143 42 183 144 2 3 402 M7TS18 Putative centromere binding protein b protein OS=Eutypa lata (strain UCR-EL1) GN=UCREL1_3521 PE=4 SV=1
62 : Q2U0W4_ASPOR 0.30 0.53 7 144 1 135 138 2 3 476 Q2U0W4 DNA-binding centromere protein B OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=AO090011000280 PE=4 SV=1
63 : R8BPD7_TOGMI 0.30 0.56 5 143 15 152 140 2 3 394 R8BPD7 Putative-dna-bind-domain-containing protein OS=Togninia minima (strain UCR-PA7) GN=UCRPA7_3425 PE=4 SV=1
64 : S3D5N5_GLAL2 0.30 0.55 9 144 28 163 137 2 2 436 S3D5N5 Homeo OS=Glarea lozoyensis (strain ATCC 20868 / MF5171) GN=GLAREA_04192 PE=4 SV=1
## ALIGNMENTS 1 - 64
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 -2 A G 0 0 121 3 59 AS
2 -1 A I + 0 0 161 10 47 V V V VV V V TL
3 0 A H - 0 0 172 12 91 QG G GGGG G G TS
4 1 A M + 0 0 177 22 78 MMM LM MMM M L QA A AAAA A A TH
5 2 A G - 0 0 75 30 63 GAA KP PPP P P P DA TSPS S SSSS S SA PPT S
6 3 A K + 0 0 74 34 57 KPP AP PPP P K A A KQ TPAKPP PPPPPPPPQ QSP P
7 4 A I + 0 0 27 40 84 IVVIIV LVV V V S M R K K HRMKKKRKK KKKKKKKKR KKKMK
8 5 A K S S- 0 0 136 43 72 KRRRRR RRR R R S R Q E E E K HHQEEETEE EEEEEEEEREEEEQE
9 6 A R S S- 0 0 155 60 0 RRRRRRRRRRRRRR R RRRRRRRR RRRRRRRR RRRRRRRRRRRRRRRRRRRRRRRRRRRR
10 7 A R S S+ 0 0 153 61 75 RRRQQQQQQQIQIT T QKRTHTHHHNNHHTNHV NHNKRHHHCHHHHHHHHHHHKSHHHRHN
11 8 A A + 0 0 4 61 62 AAAGGAAAAVTVTA P AAPPSASSSSSSSPSSP SSAAPSSSCSSSSSSSSSSSASSSSPSS
12 9 A I + 0 0 38 64 30 IIILLIVLIIVIVII ILIIIILVLLLLLLLILLILLLIIILLLILLLLLLLLLLLILLLLILI
13 10 A T - 0 0 15 64 48 TTTTTTSTTTSTSSP SSSTSSTSTTTTTTTSTTTSTTLPSTTTSTTTTTTTTTTTPSTTTSTP
14 11 A E S >> S+ 0 0 19 64 90 EEEQQILLIKGKGIE HDQNNHLHLLLLLLLHLLIDLLNNNLLLNLLLLLLLLLLLNLLLLNLH
15 12 A H H 3> S+ 0 0 121 64 70 HKKDDAAASNRIKEA SVATSSDSDDDDDDDSDDGVDDSESDDDSDDDDDDDDDDDEDDDDSDS
16 13 A E H 3> S+ 0 0 11 65 33 EEEEEEEEEEEEEQQQQQQQQQQQQQQQQQQQQQQQQQQWQQQQQQQQQQQQQQQQWQQQQQQQ
17 14 A K H <> S+ 0 0 16 65 27 KKKKKKKKKKKKKKKKRRRRKRRRRRRRRRRRRRRRRRKKKRRRKRRRRRRRRRRRKRRRRKRR
18 15 A R H X S+ 0 0 130 65 66 RKKKKKKKKKKKKKKQIKKRLVRIRRRRRRRVRRKKRRAALRRRARRRRRRRRRRRARRRRLRC
19 16 A A H X S+ 0 0 36 65 14 AEEAAAAAAEAAAEAAAAQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
20 17 A L H X S+ 0 0 0 65 9 LLLFFIIIILLLLLLLLLILLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
21 18 A R H X S+ 0 0 61 65 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
22 19 A H H X S+ 0 0 85 65 72 HNNDDNDKNRANASAIRDRKSRRHRRRRRRRRRRKDRRQASRRRTRRRRRRRRRRRARRRRSRR
23 20 A Y H < S+ 0 0 47 65 54 YYYYYFYHFFFYFHWWYWMWQYWYWWWWWWWYWWHWWWHQQWWWQWWWWWWWWWWWQWWWWQWY
24 21 A F H < S+ 0 0 86 65 89 FFFYYYYYYYYYYKHAYVAFHYAYAAAAAAAYAAHVAAHKHAAAHAAAAAAAAAAAHAAAAHAF
25 22 A F H < S+ 0 0 156 65 90 FFYFFSNFNFYFYARQQHRFAKNQNNENNNNKNNAHNHQRAENNQNNSNNNNNNNNRNNNHANQ
26 23 A Q S < S+ 0 0 126 65 79 QNNSSGQEGGNGNLASTSEDLTSTGAASSNATSTESSSRILSSSLNSSSSSSASASIATSSLAS
27 24 A L S S- 0 0 84 65 82 LSSLLFSSFCQCQNQQMQNPKTQAQQQQQQQTQQYQQQNNKQQQKQQQQQQQQQQQNQQQQKQT
28 25 A Q S S- 0 0 185 65 71 QSSTTEAAEVAVEPENNSATPTPNTTTTTTTTTTPSTTPQPTPPPTPVPPPPSPTPHTPVTPTT
29 26 A N S S+ 0 0 156 65 96 NDDEESHTSEREKSKPPRKTYPVPINIVVINPVISRVTTHYIIVQIIMIIIIIIIIHINVTYVP
30 27 A R - 0 0 205 65 56 RKKKKKRKKKKKKLRRKRWKLKRKRRRRRRRKRRLRRRMLLRRRLRRRRRRRRRRRLRRRRLRK
31 28 A S - 0 0 17 65 39 SPPPPPIPPPPPPSPPPPRDTPPPPPPPPPPPPPKPPPQTTPPPTPPPPPPPPPPPTPPPPTPP
32 29 A G >> - 0 0 22 65 69 GTTTTGPSGSTSTNTSSTHgQTSTSSPSSSSTSSQTSSYHQPSSNSSSSSSSSSSSHSSSSQSS
33 30 A Q H 3> S+ 0 0 97 55 34 QQQQQQQQQQQQQ.HQQQ.h.QHQHHHHHHHQHH.QHH...HHH.HHHHHHHHHHH.HHHH.HH
34 31 A Q H 3> S+ 0 0 126 65 67 QSSKKKKQKQVQIKASSKTALSKSKKKKKKKSKKTKKKKKLKKKIKKRKKKKKKKKQKKKKLKA
35 32 A D H <> S+ 0 0 57 65 60 DDDDDAEEAKEKEAAQDADDQDAAATAAAATDAASAAADDQAAADAAAAAAAAAAANAQASQAD
36 33 A L H >X S+ 0 0 3 65 94 LLLLLIVLILLLVLSCLCVALLCLCCCCCCCLCCLCCCLLLCCCLCCCCCCCCCCCLCCCCLCL
37 34 A I H 3X S+ 0 0 7 65 72 IIIIIKTIKMAMAKRSRIASCRIQIIIIIIIRIIAIIIATCIIIIIIIIIIIIIIIRIIIICIR
38 35 A E H 3X S+ 0 0 103 65 64 ESSDDEESDADADQEAAANFQADMEEAEEEEAEEVAEEKKQAEEKEEDEEEEDEDEKDEEDQEA
39 36 A W H >S+ 0 0 37 65 43 SSSTTTTTTSTSTSTTITTTSITMTTTTTTTITTTTTTTSSTTTSTTTTTTTTTTTTTTTTSTS
53 50 A V T 45S+ 0 0 2 65 13 VVVVVIIVIVLVLVLIVVIVVVVVVVVVVVVVVVIVVIIIVVVVVVVVVVVVVVVVVVVVIVVI
54 51 A S T >5S+ 0 0 66 65 6 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTSSSSSSSSSSSSSSSSTSSSSSSS
55 52 A Q H >>S+ 0 0 114 65 69 QQQTTTESTQRQRELTRDEERRHRHHHHHHHRHHEDHHRRRHHHRHHHHHHHHHHHRHHHHRHR
56 53 A I H <5S+ 0 0 13 65 68 IIIIIVIIVIIIIIIISITIISSSSSSSSSSSSSSISSTIISSSISSSSSSSSSSSISSSSISS
57 54 A L H 4X5S+ 0 0 75 65 69 SSSPPSPSSSDSDKSKDPNYSKPDPPAPPPPKPPVPPPSYSAPPPPPPPPPPPPPPSPPPPSPD
60 57 A K H 3>X S+ 0 0 131 65 95 YYYYYYYFYFYFYHYHYYYPSYRHRRRRRRRYRREYRRFFSRRRYRRRRRRRRRRRFRRRRSRS
64 61 A L T 3< S+ 0 0 9 65 7 LLLLLLLLLLLLLLLLLLTLLLLLLLLLLLLLLLLLLLLILLLLLLLLLLLLLLLLILLLLLLL
65 62 A D T 34 S+ 0 0 91 65 0 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
66 63 A N T <4 S+ 0 0 143 65 72 NSHSSHDNHEDEDRLAVSDnGVGiSGGGGGGVGGdSGNEDGGSTAGSGSSSSGSGSEGGSNGGN
67 64 A T S < S- 0 0 44 23 82 TLLQQNGLTT.A.P.....aA..p..........t...MLA...I...........L....A..
68 65 A V - 0 0 78 51 76 VNTDDESDDSTTTV..CE.YSC.TDDN..DDC.DQE..HESND.PDDEDDDDDDDDQD...S.G
69 66 A E S S+ 0 0 108 65 81 EDDFFIVALSDSDNGGVCACSVDSNNNDDNNVDNHCDDGPSNNDDNNHNNNNNNNNENEDDSDG
70 67 A K S S+ 0 0 177 65 84 KIVIIVREVEQEQRSPANKQHAPQPPPPPPPAPPFNPNKHHPPTHPPPPPPPPPPPYPPANHPP
71 68 A P S S- 0 0 47 65 72 PDDRRNLNNTNANTAAAPPIQSQVQQQQQQQSQQKPQQAQQQQSRQQQQQQQQQQQQQQQQQQV
72 69 A W S S- 0 0 77 65 82 WGGGGEGSESLALSASDSSDLDLVLLLLLLLDLLSSLFVLLLLLLLLLLLLLLLLLLLLLFLLN
73 70 A D S S+ 0 0 106 65 67 DDDDDGDQGEHEHNKTFSKSRSSSSSSSSSSSSSSSSSNKRSSSNSSSSSSSSSSSKSSSSRSS
74 71 A V S S+ 0 0 8 65 72 VVVLLAIIAKAKAQTLGAARAEGEGGGGGGGEGGSAGGGDAGGGDGGGGGGGGGGGDGGGGAGR
75 72 A K S S+ 0 0 168 65 83 KKKKKVKRVPKTKKQRYTLKKYSFSSSSSSSYSSTTSSKKKSSSKSSSSSSSSSSSKSSSSKSY
76 73 A R + 0 0 163 65 15 RRRRRKKRKNRNRRRRRRKRRRRRRRRRRRRRRRKRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
77 74 A N + 0 0 46 65 89 NNNNNLINLINTNYQQTKRQRTLVLLLLLLLTLLLKLLIRRLLLRLLLLLLLLLLLRLLLLRLI
78 75 A R - 0 0 125 65 23 RRRRRRRRRLRLRRQLRGRRRRRRRRRRRRRRRRRGRRRRRRRRRRRRRRRRRRRRRRRRRRRR
79 76 A P - 0 0 86 65 103 PPPTTPAQPPSPSRPANILPANFTYFFFFFFNFFKIFFAVAFYFTFYFYYYYFYFYFFFFFAFD
80 77 A P - 0 0 88 65 62 PAAPPAPGAAPAPEPPCGGPECGCGGGGGGGCGGEGGGEEEGGGEGGGGGGGGGGGEGGGGEGC
81 78 A K S S+ 0 0 199 65 64 KKKKKKKKKKKKKHQQQQQKSQNQNNNNNNNQNNHQNNNQSNNNLNNNNNNNNNNNQNNNNSNQ
82 79 A Y > + 0 0 110 65 10 YYYYYYFYFYFYYWWWWWFWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
83 80 A P H > S+ 0 0 90 65 11 PPPPPPPPPPPPPPPPPQPEPPPPPPPPPPPPPPQQPPPPPPPPPPPPPPPPPPPPPPPPPPPP
84 81 A L H > S+ 0 0 144 65 88 LIIAAVLIVVAVAEILWDLTEWDWDDDDDDDWDDEDDDEEEDDDEDDDDDDDDDDDEDEDDEDW
85 82 A L H > S+ 0 0 34 65 30 LLLLLLLLLLLLLLLLILLLLLVLVVVVVVVLVVLLVVLLLVVVLVVVVVVVVVVVLVVVVLVI
86 83 A E H X S+ 0 0 33 65 2 EEEEEEDEEEEEEEEEEEEEEEEDEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
87 84 A A H X S+ 0 0 67 65 66 AKKSSNNNNSKSKTATTAKERTKLKKKKKKKTKKIAKKSKRKKKAKKKKKKKKKKKKKKKKRKT
88 85 A A H X S+ 0 0 22 65 73 ALLAAAAAAAAAAARPLIAEALLLLLLLLLLLLLAILLTAALLLALLLLLLLLLLLALLLLALE
89 86 A L H X S+ 0 0 1 65 29 LLLLLLVLLLLLLLLLLLLLLLVLVVVVVVVLVVLLVVLVLVVVVVVVVVVVVVVVVVVVVLVL
90 87 A F H >X S+ 0 0 75 65 65 FFFFFAYIAVYVYYTCTYIIYTLSLLLLLLLTLLLYLLSMYLLLILLLLLLLLLLLMLLLLYLT
91 88 A E H 3X S+ 0 0 96 65 88 EEEEEEEDEEDENLEEDEEKENLDILLLLLLNLLTELLEDELILDLILIIIIRIRIDRLLLELC
92 89 A W H 3X S+ 0 0 10 65 2 WWWWWWWWWWWWWWWWWWLWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
93 90 A Q H << S+ 0 0 16 65 88 QQQQQRLQRIIIIILLLHVAILHLYYYHHYYLHYRHHHIIIYYHLYFYYFYYYYYYIYYHHIHL
94 91 A V H >< S+ 0 0 79 65 56 VQQQQLQTQQQQQQKAQHYFRQQDQQRQQQQQQQQHQQRRRRQQEQQQQQQQQQQQRQQSGRQQ
95 92 A Q H 3< S+ 0 0 170 65 64 QRRQQRQRRRDRDQRSETRWREQAQQQQQQQEQQRTQQRLRQQQRQQQQQQQQQQQLQQEQRSD
96 93 A Q T 3< S+ 0 0 93 65 72 QAAEEMRLMTSTAAAQVLLHVIMVVVVMMVVIMVFLMVAAVAVMAVVVVVVVVVVVAVMMVVMI
97 94 A G X + 0 0 20 65 61 GQQEEEEQEQHQHEREEDNEQELEQQQVVQQEVQGDVVKEQQQVQQQQQQQQQQQQEQTVVQVE
98 95 A D T 3 S- 0 0 164 65 77 DSSKKKVKKKLKFTEAACSCDAARAAAAAAAAAAACAANTDAANSAAAAAAAAAAATANNADNR
99 96 A D T 3 S+ 0 0 145 65 79 DQQRRQEQQSTSTSTGQKAVQRNQSSASSASGSAEKSSQEQASQYASSSSSSTSTSETARSQSR
100 97 A A < + 0 0 53 65 61 ANNSSNGDNDRDRIGLSGGAESGSGGGGGGGSGGDGGGDAEGGGIGGGGGGGGGGGAGGGGEGS
101 98 A T + 0 0 111 65 92 TqqggvlglititKeMgahgSgrgrrrrrrrgrrTarrLPSrrrTrrrrrrrrrrrPrrrrSrv
102 99 A L - 0 0 124 60 75 LivvvviiviavaLaPvitvLvpvpppppppvppIipp.I.ppp.ppppppppppp.pppp.pv
103 100 A S S S+ 0 0 56 61 49 STTSSSSTSNTNTSTPSSNTSTSSTTTSSTTTSTTSSSAS.TTS.TTTTTTTTTTT.TTSS.ST
104 101 A G S >> S+ 0 0 9 65 70 GGGCCSGGSEGENRGANGGGQNNNNNNNNNNNNNTGNNVQLNNNINNNNNNNNNNNINNNNLNN
105 102 A E H 3> S+ 0 0 134 65 44 EEEDDEDNEEDEDEDDKEDAHKEDEEDEEEEKEEDDEESESDEESEEDEEEEEEEESEEEESEL
106 103 A T H 3> S+ 0 0 48 65 80 TAAQQSMASVIVIVATIISMLAEVEEEEEEEAEELIEDQAQEEEQEEEEEEEEEEEQEEEEQEE
107 104 A I H <>>S+ 0 0 2 65 44 IIILLIIIIILILIIIILILIILILLLLLLLILLLLLLEIHLLLELLLLLLLLLLLELLLLHLI
108 105 A K H X5S+ 0 0 65 65 90 KKKRRRKKRRKRKRCIGIIKLGAKGGGAAGGGAGRIAAIRLGGAVGGGGGGGGGGGAGAVALAA
109 106 A R H X5S+ 0 0 186 65 79 REELLHQKHKAKALNARESDQHDREEEEEEEHEEEEEDLYIEDEIEDEDDDDEDEDIEDQDIES
110 107 A A H X5S+ 0 0 15 65 62 ATTAAAASAVATAKKKKKLRKKKKKKKKKKKKKKVKKKRKLKKKRKKKKKKKKKKKRKKKKLKK
111 108 A A H X5S+ 0 0 0 65 19 AAAAAAAAAAAAAAAAAAAGAAAAAAAAAAAAAAAAAAQAQAAAEAAAAAAAAAAAYAAAAQAA
112 109 A A H <> S+ 0 0 9 65 73 NNNNNNNNNSNSNNQNTSQENHPTPPPPPPPHPPNSPPNNNPPPNPPPPPPPPPPPNPPPPNPT
132 129 A G H 3> S+ 0 0 39 65 5 GGGGGGGGGNGNGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
133 130 A W H 3> S+ 0 0 22 65 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
134 131 A L H X> S+ 0 0 2 65 36 LLLLLLLLLLVLVLLLVLLQMVIVIIIIIIIVIILLIILLMIIILIIIIIIIIIIILIIVIMIV
135 132 A E H 3X S+ 0 0 60 65 77 EEEEEEDEEKEEEEYNMHSCRIHAHHHHHHHIHHDHHHEFRHHHRHHHHHHHHHHHFHHHHRHV
136 133 A G H 3X S+ 0 0 26 65 65 GNNKKKKKKRGKGGKRKRGRGKRKRRRRRRRKRRKRRRRVGRRRGRRRRRRRRRRRGRRRRGRN
137 134 A F H S+ 0 0 12 65 0 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF
138 135 A R H <5S+ 0 0 58 65 41 RKKKKKRKKQRQRQRKRKKRQRKRKKKKKKKRKKGKKKQQQKKKQKKKKKKKKKKKQKKKKQKK
139 136 A K H <5S+ 0 0 147 65 52 KRRRRKRKKKRNRKQKRQQTSRKKKKKKKKKRKKKQKKATSKKKYKKKKKKKKKKKTKKKKSKR
140 137 A R H <5S+ 0 0 168 65 9 RRRRRRRRRHRHRRRRRRRRHRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRHRR
141 138 A H T <5S+ 0 0 151 62 62 HHHHHHHCH YYYRFYHYQ WHYHYYYYYYYHYY YYYQNWYYYKYYYYYYYYYYYNYYYYWYH
142 139 A I S