Complet list of 1msf hssp file
Complete list of 1msf.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1MSF
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-04
HEADER DNA BINDING PROTEIN/DNA 24-JAN-95 1MSF
COMPND MOL_ID: 1; MOLECULE: DNA (5'- D(*CP*CP*TP*AP*AP*CP*TP*GP*AP*CP*AP*CP*A
SOURCE MOL_ID: 1; SYNTHETIC: YES; MOL_ID: 2; SYNTHETIC: YES; MOL_ID: 3; ORGAN
AUTHOR K.OGATA,S.MORIKAWA,H.NAKAMURA,A.SEKIKAWA,T.INOUE,H.KANAI, A.SARAI,S.IS
DBREF 1MSF C 90 193 UNP P06876 MYB_MOUSE 90 193
DBREF 1MSF A 1 16 PDB 1MSF 1MSF 1 16
DBREF 1MSF B 17 32 PDB 1MSF 1MSF 17 32
SEQLENGTH 105
NCHAIN 1 chain(s) in 1MSF data set
NALIGN 113
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : A1YF09_9PRIM 0.99 0.99 2 105 43 146 104 0 0 207 A1YF09 MYB (Fragment) OS=Gorilla gorilla GN=MYB PE=4 SV=1
2 : A1YG86_PANPA 0.99 0.99 2 88 43 129 87 0 0 129 A1YG86 MYB (Fragment) OS=Pan paniscus GN=MYB PE=4 SV=1
3 : A2D4Z2_ATEGE 0.99 0.99 2 88 19 105 87 0 0 105 A2D4Z2 MYB (Fragment) OS=Ateles geoffroyi GN=MYB PE=4 SV=1
4 : A2T6Q4_MACNE 0.99 0.99 2 88 43 129 87 0 0 129 A2T6Q4 MYB (Fragment) OS=Macaca nemestrina GN=MYB PE=4 SV=1
5 : A2T757_PANTR 0.99 0.99 2 88 43 129 87 0 0 129 A2T757 MYB (Fragment) OS=Pan troglodytes GN=MYB PE=4 SV=1
6 : P97753_9MURI 0.99 0.99 2 102 91 191 101 0 0 191 P97753 Gag/env/c-myb protein (Fragment) OS=Mus sp. GN=gag/env/c-myb PE=2 SV=1
7 : G5E3X0_9PIPI 0.97 0.98 14 105 1 91 92 1 1 99 G5E3X0 Putative v-myb myeloblastosis viral oncogene (Fragment) OS=Pipa carvalhoi PE=2 SV=1
8 : Q6LC31_MOUSE 0.95 0.99 2 87 43 128 86 0 0 128 Q6LC31 A-myb protein (Fragment) OS=Mus musculus GN=A-myb PE=4 SV=1
9 : Q5ZM07_CHICK 0.93 0.98 2 105 85 188 104 0 0 215 Q5ZM07 Uncharacterized protein OS=Gallus gallus GN=RCJMB04_3o7 PE=2 SV=1
10 : F2Q6J7_SPAAU 0.89 0.99 2 93 41 132 92 0 0 132 F2Q6J7 C-Myb (Fragment) OS=Sparus aurata PE=2 SV=1
11 : R7VL53_CAPTE 0.83 0.93 2 104 2 104 103 0 0 104 R7VL53 Uncharacterized protein (Fragment) OS=Capitella teleta GN=CAPTEDRAFT_87885 PE=4 SV=1
12 : A7RPT8_NEMVE 0.82 0.98 2 105 52 155 104 0 0 155 A7RPT8 Predicted protein (Fragment) OS=Nematostella vectensis GN=v1g38726 PE=4 SV=1
13 : B6ZL86_BOMMO 0.82 0.96 2 104 53 155 103 0 0 156 B6ZL86 Myb (Fragment) OS=Bombyx mori GN=BMmyb PE=2 SV=1
14 : B3S5T2_TRIAD 0.78 0.98 2 105 50 153 104 0 0 153 B3S5T2 Putative uncharacterized protein (Fragment) OS=Trichoplax adhaerens GN=TRIADDRAFT_17350 PE=4 SV=1
15 : D2UY60_NAEGR 0.78 0.95 2 105 3 106 104 0 0 112 D2UY60 Predicted protein (Fragment) OS=Naegleria gruberi GN=NAEGRDRAFT_29336 PE=4 SV=1
16 : V4AEU1_LOTGI 0.78 0.92 2 104 44 146 103 0 0 146 V4AEU1 Uncharacterized protein (Fragment) OS=Lottia gigantea GN=LOTGIDRAFT_88057 PE=4 SV=1
17 : H3CH46_TETNG 0.76 0.88 2 105 14 117 104 0 0 117 H3CH46 Uncharacterized protein (Fragment) OS=Tetraodon nigroviridis PE=4 SV=1
18 : T1EI44_HELRO 0.76 0.90 2 104 44 146 103 0 0 150 T1EI44 Uncharacterized protein (Fragment) OS=Helobdella robusta GN=HELRODRAFT_134329 PE=4 SV=1
19 : D8QRN8_SELML 0.72 0.92 2 105 44 147 104 0 0 147 D8QRN8 Putative uncharacterized protein (Fragment) OS=Selaginella moellendorffii GN=SELMODRAFT_75978 PE=4 SV=1
20 : F0Y971_AURAN 0.72 0.90 13 104 1 92 92 0 0 94 F0Y971 Putative uncharacterized protein (Fragment) OS=Aureococcus anophagefferens GN=AURANDRAFT_26333 PE=4 SV=1
21 : B8CFV0_THAPS 0.71 0.88 2 105 48 151 104 0 0 158 B8CFV0 MYB DNA binding protein/ transcription factor-like protein (Fragment) OS=Thalassiosira pseudonana GN=MYB5 PE=4 SV=1
22 : E9HSG4_DAPPU 0.71 0.93 10 99 1 90 90 0 0 90 E9HSG4 Putative uncharacterized protein (Fragment) OS=Daphnia pulex GN=DAPPUDRAFT_6813 PE=4 SV=1
23 : L1IYE0_GUITH 0.71 0.84 7 101 2 96 95 0 0 98 L1IYE0 Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_158254 PE=4 SV=1
24 : Q5QT29_ORYSA 0.71 0.90 2 104 29 131 103 0 0 135 Q5QT29 Transcription factor (Fragment) OS=Oryza sativa GN=myb15 PE=4 SV=1
25 : B7G6Q9_PHATC 0.70 0.85 2 105 14 117 104 0 0 119 B7G6Q9 Predicted protein (Fragment) OS=Phaeodactylum tricornutum (strain CCAP 1055/1) GN=PHATRDRAFT_15016 PE=4 SV=1
26 : B8CD10_THAPS 0.70 0.86 4 105 1 103 103 1 1 105 B8CD10 Predicted protein (Fragment) OS=Thalassiosira pseudonana GN=THAPSDRAFT_18795 PE=4 SV=1
27 : C1MP23_MICPC 0.67 0.84 2 105 27 130 104 0 0 131 C1MP23 Predicted protein (Fragment) OS=Micromonas pusilla (strain CCMP1545) GN=MICPUCDRAFT_15661 PE=4 SV=1
28 : Q9LE88_SECCE 0.67 0.88 2 97 3 98 96 0 0 98 Q9LE88 C-myb-like transcription factor (Fragment) OS=Secale cereale GN=MYB3R-1 PE=2 SV=1
29 : F0YEP8_AURAN 0.66 0.81 4 105 1 101 102 1 1 101 F0YEP8 Putative uncharacterized protein (Fragment) OS=Aureococcus anophagefferens GN=AURANDRAFT_7774 PE=4 SV=1
30 : L1IW43_GUITH 0.66 0.81 3 105 20 122 103 0 0 123 L1IW43 Uncharacterized protein (Fragment) OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_75666 PE=4 SV=1
31 : F0ZGK6_DICPU 0.65 0.81 3 103 1 101 101 0 0 101 F0ZGK6 Putative uncharacterized protein (Fragment) OS=Dictyostelium purpureum GN=DICPUDRAFT_11392 PE=4 SV=1
32 : A9TTE3_PHYPA 0.64 0.84 3 97 1 95 95 0 0 95 A9TTE3 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_29378 PE=4 SV=1
33 : B7G0J2_PHATC 0.64 0.86 3 103 1 101 101 0 0 101 B7G0J2 Predicted protein (Fragment) OS=Phaeodactylum tricornutum (strain CCAP 1055/1) GN=PHATRDRAFT_6839 PE=4 SV=1
34 : L1IJK8_GUITH 0.64 0.80 3 105 1 103 103 0 0 119 L1IJK8 Uncharacterized protein (Fragment) OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_79200 PE=4 SV=1
35 : L1IXC9_GUITH 0.64 0.81 4 105 8 109 102 0 0 117 L1IXC9 Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_75327 PE=4 SV=1
36 : L1JLD2_GUITH 0.64 0.82 3 103 16 122 107 1 6 160 L1JLD2 Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_68537 PE=4 SV=1
37 : A9S7B8_PHYPA 0.63 0.79 3 104 16 117 102 0 0 117 A9S7B8 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_125088 PE=4 SV=1
38 : A9SF39_PHYPA 0.63 0.79 3 104 27 128 102 0 0 129 A9SF39 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_128885 PE=4 SV=1
39 : A9SQ68_PHYPA 0.63 0.84 2 104 3 105 103 0 0 105 A9SQ68 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_49319 PE=4 SV=1
40 : B8BVC5_THAPS 0.63 0.83 2 104 2 104 103 0 0 104 B8BVC5 Predicted protein (Fragment) OS=Thalassiosira pseudonana GN=THAPSDRAFT_32015 PE=4 SV=1
41 : A9T941_PHYPA 0.62 0.83 2 104 3 105 103 0 0 105 A9T941 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_49308 PE=4 SV=1
42 : L1IEK6_GUITH 0.62 0.79 2 104 2 104 103 0 0 104 L1IEK6 Uncharacterized protein (Fragment) OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_49707 PE=4 SV=1
43 : B7XIH3_ENTBH 0.61 0.83 3 103 10 110 101 0 0 113 B7XIH3 Myb superfamily protein OS=Enterocytozoon bieneusi (strain H348) GN=EBI_27143 PE=4 SV=1
44 : A0DQG0_PARTE 0.60 0.81 4 105 10 112 103 1 1 136 A0DQG0 Chromosome undetermined scaffold_6, whole genome shotgun sequence OS=Paramecium tetraurelia GN=GSPATT00002677001 PE=4 SV=1
45 : A8J0K3_CHLRE 0.60 0.77 2 105 12 115 104 0 0 115 A8J0K3 Predicted protein (Fragment) OS=Chlamydomonas reinhardtii GN=CHLREDRAFT_103798 PE=4 SV=1
46 : C1E9V1_MICSR 0.60 0.80 1 104 29 132 104 0 0 132 C1E9V1 Predicted protein (Fragment) OS=Micromonas sp. (strain RCC299 / NOUM17) GN=MICPUN_83790 PE=4 SV=1
47 : D8RRF1_SELML 0.60 0.77 3 104 21 122 102 0 0 122 D8RRF1 Putative uncharacterized protein (Fragment) OS=Selaginella moellendorffii GN=SELMODRAFT_99462 PE=4 SV=1
48 : D8RXB4_SELML 0.60 0.77 3 104 21 122 102 0 0 125 D8RXB4 Putative uncharacterized protein (Fragment) OS=Selaginella moellendorffii GN=SELMODRAFT_103999 PE=4 SV=1
49 : A4S3U0_OSTLU 0.59 0.79 4 104 9 109 101 0 0 109 A4S3U0 Predicted protein (Fragment) OS=Ostreococcus lucimarinus (strain CCE9901) GN=OSTLU_8749 PE=4 SV=1
50 : E1Z785_CHLVA 0.59 0.80 10 104 1 95 95 0 0 99 E1Z785 Putative uncharacterized protein (Fragment) OS=Chlorella variabilis GN=CHLNCDRAFT_20303 PE=4 SV=1
51 : A9RHS3_PHYPA 0.58 0.77 4 104 1 101 101 0 0 102 A9RHS3 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_8347 PE=4 SV=1
52 : C1EIQ8_MICSR 0.58 0.78 3 104 1 102 102 0 0 102 C1EIQ8 Predicted protein (Fragment) OS=Micromonas sp. (strain RCC299 / NOUM17) GN=MICPUN_74639 PE=4 SV=1
53 : D8QTC6_SELML 0.58 0.78 3 104 12 113 102 0 0 113 D8QTC6 Putative uncharacterized protein (Fragment) OS=Selaginella moellendorffii GN=SELMODRAFT_113522 PE=4 SV=1
54 : D8U1W9_VOLCA 0.58 0.76 2 105 13 116 104 0 0 128 D8U1W9 Transcription factor Myb13 OS=Volvox carteri GN=myb13 PE=4 SV=1
55 : F0Y4P7_AURAN 0.58 0.80 7 105 29 132 104 1 5 132 F0Y4P7 Putative uncharacterized protein (Fragment) OS=Aureococcus anophagefferens GN=AURANDRAFT_24448 PE=4 SV=1
56 : F0YL55_AURAN 0.58 0.78 7 105 9 113 105 1 6 134 F0YL55 Putative uncharacterized protein OS=Aureococcus anophagefferens GN=AURANDRAFT_39143 PE=4 SV=1
57 : G5A1P6_PHYSP 0.58 0.74 7 104 1 98 98 0 0 101 G5A1P6 Putative uncharacterized protein (Fragment) OS=Phytophthora sojae (strain P6497) GN=PHYSODRAFT_434669 PE=4 SV=1
58 : I0Z538_9CHLO 0.58 0.75 4 104 7 107 101 0 0 107 I0Z538 Uncharacterized protein (Fragment) OS=Coccomyxa subellipsoidea C-169 GN=COCSUDRAFT_12896 PE=4 SV=1
59 : L1IVB9_GUITH 0.58 0.70 2 105 1 103 110 2 13 117 L1IVB9 Uncharacterized protein (Fragment) OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_75814 PE=4 SV=1
60 : R1DY79_EMIHU 0.58 0.74 7 103 3 108 106 1 9 108 R1DY79 Uncharacterized protein (Fragment) OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_76507 PE=4 SV=1
61 : B5YND9_THAPS 0.57 0.81 4 105 5 106 103 2 2 108 B5YND9 MYB DNA binding protein/ transcription factor-like protein (Fragment) OS=Thalassiosira pseudonana GN=MYB23 PE=4 SV=1
62 : I0Z387_9CHLO 0.57 0.85 3 104 1 102 102 0 0 102 I0Z387 Homeodomain-like protein (Fragment) OS=Coccomyxa subellipsoidea C-169 GN=COCSUDRAFT_9603 PE=4 SV=1
63 : R1BM94_EMIHU 0.57 0.73 1 103 12 117 106 2 3 117 R1BM94 Uncharacterized protein (Fragment) OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_76882 PE=4 SV=1
64 : R1F7Z2_EMIHU 0.57 0.76 2 104 3 105 103 0 0 105 R1F7Z2 Uncharacterized protein (Fragment) OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_46684 PE=4 SV=1
65 : R1F7Z3_EMIHU 0.57 0.76 4 104 1 101 101 0 0 104 R1F7Z3 Uncharacterized protein (Fragment) OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_60784 PE=4 SV=1
66 : A8JBC5_CHLRE 0.56 0.78 4 104 1 101 101 0 0 110 A8JBC5 Predicted protein (Fragment) OS=Chlamydomonas reinhardtii GN=CHLREDRAFT_106655 PE=4 SV=1
67 : C1MID0_MICPC 0.56 0.78 5 104 1 102 102 1 2 102 C1MID0 Predicted protein (Fragment) OS=Micromonas pusilla (strain CCMP1545) GN=MICPUCDRAFT_8157 PE=4 SV=1
68 : R1CWU4_EMIHU 0.56 0.77 3 103 2 102 101 0 0 102 R1CWU4 Uncharacterized protein (Fragment) OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_42066 PE=4 SV=1
69 : R1E939_EMIHU 0.56 0.71 7 103 1 87 97 2 10 87 R1E939 Uncharacterized protein (Fragment) OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_46436 PE=4 SV=1
70 : A9TCW1_PHYPA 0.55 0.82 3 105 1 103 103 0 0 109 A9TCW1 Predicted protein OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_143649 PE=4 SV=1
71 : R1CS15_EMIHU 0.55 0.71 4 104 1 106 106 1 5 110 R1CS15 Uncharacterized protein (Fragment) OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_58747 PE=4 SV=1
72 : A8IWU4_CHLRE 0.54 0.75 3 104 1 111 111 1 9 111 A8IWU4 Predicted protein (Fragment) OS=Chlamydomonas reinhardtii GN=CHLREDRAFT_117291 PE=4 SV=1
73 : D8SJI8_SELML 0.54 0.83 3 105 8 110 103 0 0 116 D8SJI8 Putative uncharacterized protein OS=Selaginella moellendorffii GN=SELMODRAFT_118412 PE=4 SV=1
74 : D8TVF3_VOLCA 0.53 0.70 2 104 2 104 103 0 0 104 D8TVF3 Transcription factor Myb9 (Fragment) OS=Volvox carteri GN=myb9 PE=4 SV=1
75 : F0Y4D5_AURAN 0.53 0.78 2 104 18 119 103 1 1 122 F0Y4D5 Putative uncharacterized protein (Fragment) OS=Aureococcus anophagefferens GN=AURANDRAFT_23692 PE=4 SV=1
76 : G0QWF7_ICHMG 0.53 0.75 1 105 1 111 111 1 6 172 G0QWF7 Myb protein, putative (Fragment) OS=Ichthyophthirius multifiliis (strain G5) GN=IMG5_131840 PE=4 SV=1
77 : L1K518_GUITH 0.53 0.79 6 105 1 100 100 0 0 100 L1K518 Uncharacterized protein (Fragment) OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_44603 PE=4 SV=1
78 : D8UBS2_VOLCA 0.52 0.76 3 96 1 103 103 1 9 103 D8UBS2 Transcription factor Myb7 (Fragment) OS=Volvox carteri GN=myb7 PE=4 SV=1
79 : A8ITV7_CHLRE 0.51 0.67 2 104 17 123 107 1 4 123 A8ITV7 Predicted protein (Fragment) OS=Chlamydomonas reinhardtii GN=CHLREDRAFT_116056 PE=4 SV=1
80 : B8BTP0_THAPS 0.51 0.70 4 104 3 102 101 1 1 105 B8BTP0 Predicted protein (Fragment) OS=Thalassiosira pseudonana GN=THAPSDRAFT_18820 PE=4 SV=1
81 : A8HP47_CHLRE 0.50 0.72 5 105 11 111 101 0 0 111 A8HP47 Predicted protein (Fragment) OS=Chlamydomonas reinhardtii GN=CHLREDRAFT_116461 PE=4 SV=1
82 : A9SCW7_PHYPA 0.50 0.66 2 105 4 108 105 1 1 117 A9SCW7 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_37293 PE=4 SV=1
83 : A9SG26_PHYPA 0.50 0.74 4 104 3 103 101 0 0 109 A9SG26 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_8318 PE=4 SV=1
84 : A9TBS6_PHYPA 0.50 0.73 4 104 4 104 101 0 0 114 A9TBS6 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_17294 PE=4 SV=1
85 : A9TIJ6_PHYPA 0.50 0.67 2 105 4 108 105 1 1 117 A9TIJ6 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_37294 PE=4 SV=1
86 : B7G9A5_PHATC 0.50 0.72 4 104 1 100 101 1 1 103 B7G9A5 Predicted protein (Fragment) OS=Phaeodactylum tricornutum (strain CCAP 1055/1) GN=PHATRDRAFT_15516 PE=4 SV=1
87 : D8RS30_SELML 0.50 0.72 4 104 1 101 101 0 0 101 D8RS30 Putative uncharacterized protein (Fragment) OS=Selaginella moellendorffii GN=SELMODRAFT_38644 PE=4 SV=1
88 : Q20CJ8_EUPAE 0.50 0.66 1 103 25 133 109 1 6 143 Q20CJ8 EMYB6 OS=Euplotes aediculatus GN=emyb6 PE=4 SV=1
89 : R1FIL2_EMIHU 0.50 0.67 7 104 1 101 101 1 3 107 R1FIL2 Uncharacterized protein (Fragment) OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_60792 PE=4 SV=1
90 : D8SJG6_SELML 0.49 0.73 4 105 1 102 102 0 0 102 D8SJG6 Putative uncharacterized protein (Fragment) OS=Selaginella moellendorffii GN=SELMODRAFT_8484 PE=4 SV=1
91 : D8U5P1_VOLCA 0.49 0.75 5 105 1 110 110 1 9 110 D8U5P1 Transcription factor Myb2 (Fragment) OS=Volvox carteri GN=myb2 PE=4 SV=1
92 : S8CMK2_9LAMI 0.49 0.71 4 104 4 104 101 0 0 107 S8CMK2 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_08762 PE=4 SV=1
93 : S8EIZ5_9LAMI 0.46 0.70 2 104 2 105 104 1 1 108 S8EIZ5 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_02172 PE=4 SV=1
94 : A9T187_PHYPA 0.45 0.62 2 99 12 118 107 1 9 118 A9T187 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_19381 PE=4 SV=1
95 : Q8S3Z3_SORBI 0.45 0.63 2 105 2 106 105 1 1 106 Q8S3Z3 Typical A-type R2R3 Myb protein (Fragment) OS=Sorghum bicolor GN=MybY1 PE=4 SV=1
96 : B8CFL3_THAPS 0.44 0.64 4 99 2 103 102 3 6 103 B8CFL3 MYB DNA binding protein/ transcription factor-like protein (Fragment) OS=Thalassiosira pseudonana GN=MYB7 PE=4 SV=1
97 : F1LKD1_PETHY 0.44 0.64 2 104 2 105 104 1 1 105 F1LKD1 Myb-like transcription factor EOBI (Fragment) OS=Petunia hybrida PE=2 SV=1
98 : Q9FXC3_ARATH 0.44 0.62 2 99 12 118 107 1 9 118 Q9FXC3 Putative myb transcription factor-partial protein (Fragment) OS=Arabidopsis thaliana GN=F25P12.1 PE=2 SV=1
99 : A9SSU0_PHYPA 0.42 0.65 2 105 1 106 106 2 2 108 A9SSU0 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_37391 PE=4 SV=1
100 : S8CA46_9LAMI 0.42 0.66 2 105 12 117 106 1 2 123 S8CA46 Myb transcription factor (Fragment) OS=Genlisea aurea GN=M569_13569 PE=4 SV=1
101 : S8CFI4_9LAMI 0.42 0.64 2 105 12 117 106 2 2 118 S8CFI4 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_11493 PE=4 SV=1
102 : A9TP14_PHYPA 0.41 0.59 2 99 12 123 112 1 14 123 A9TP14 Predicted protein (Fragment) OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_8874 PE=4 SV=1
103 : V5K203_SALMI 0.40 0.62 2 105 12 117 106 2 2 118 V5K203 R2R3 MYB transcription factor 87 (Fragment) OS=Salvia miltiorrhiza GN=MYB87 PE=4 SV=1
104 : D5FFI5_POPCA 0.36 0.52 2 105 12 132 121 3 17 148 D5FFI5 MYB transcription factor (Fragment) OS=Populus canadensis GN=MYBA1 PE=2 SV=1
105 : S8CSC4_9LAMI 0.36 0.59 2 104 5 111 107 2 4 111 S8CSC4 Anthocyanin 2 (Fragment) OS=Genlisea aurea GN=M569_07002 PE=4 SV=1
106 : A5BR18_VITVI 0.35 0.54 2 105 19 154 136 1 32 234 A5BR18 Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_029986 PE=4 SV=1
107 : D7MJD9_ARALL 0.35 0.52 7 105 11 138 128 1 29 147 D7MJD9 Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_916274 PE=4 SV=1
108 : K3Y1M3_SETIT 0.34 0.50 2 104 12 138 127 5 24 230 K3Y1M3 Uncharacterized protein OS=Setaria italica GN=Si008089m.g PE=4 SV=1
109 : A9T1M6_PHYPA 0.33 0.54 4 105 12 134 123 1 21 153 A9T1M6 Predicted protein OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_138897 PE=4 SV=1
110 : M0SLA8_MUSAM 0.32 0.46 4 105 17 155 139 5 37 290 M0SLA8 Uncharacterized protein OS=Musa acuminata subsp. malaccensis PE=4 SV=1
111 : M8D623_AEGTA 0.31 0.47 1 105 1 142 143 4 39 284 M8D623 Transcription factor RAX3 OS=Aegilops tauschii GN=F775_07913 PE=4 SV=1
112 : C0PM92_MAIZE 0.30 0.46 4 105 14 151 138 5 36 249 C0PM92 Putative MYB DNA-binding domain superfamily protein OS=Zea mays GN=ZEAMMB73_660042 PE=2 SV=1
113 : W2QU82_PHYPN 0.30 0.50 2 100 1 131 131 1 32 189 W2QU82 Uncharacterized protein OS=Phytophthora parasitica (strain INRA-310) GN=PPTG_06996 PE=4 SV=1
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 89 C M 0 0 211 6 33 L V
2 90 C L + 0 0 166 57 29 LLLLLL LLLLLLLVLLLL L LL LL ILIL IK I L VV
3 91 C I - 0 0 124 76 74 IIIIII IIVVIVIVIVIV V IV VV IIIIV VIIIVIAI KIII IVK N VHA V M
4 92 C K S S+ 0 0 143 98 16 KKKKKK KKKKKKKKKKKK K KKKKKKKKKKKKKKKKKKRKKKKKKK KKRK KK KKKKKR K K
5 93 C G S S- 0 0 44 101 26 GGGGGG GGGGGGGGGGGG G GGGGGGCGGGGAGGGGGGKGKGGGGG GGGG GS KGKRRGGR G
6 94 C P S S- 0 0 118 102 63 PPPPPP PPPPPPPPPPSP P PPPPPPPPLPQPPPPQAQPPPAGPPQ PLPA AP PNPPPPAA Q
7 95 C W + 0 0 28 110 0 WWWWWW WWWWWWWWWWWW W WWWWWWWWWWWWWWWWWWWWWWWWWWW WWWWWWWWWWWWWWWWWWWW
8 96 C T + 0 0 72 109 51 TTTTTT TTTTTTTTTTTT T TTTTTSATRTTTTGSSTTSTTTTTSSS STSTSSTTTTTLTQQTTATG
9 97 C K S S+ 0 0 177 110 72 KKKKKK KKKKKKKQKAKK P KQEDKKPKEIAKPKPPPPPKKDPPPPA PPPPRQPPKPDAEPPPTEPA
10 98 C E S S+ 0 0 97 112 41 EEEEEE EEEEEEEEEAEE EQEEDREEEEDEDEEDEEEQEEEYEEEEEQEEEEKQEEEEEDEAAEQDEA
11 99 C E S > S+ 0 0 21 112 3 EEEEEE EEEEEEEEEEEE EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEEEEEEEEEEEE
12 100 C D H > S+ 0 0 20 112 1 DDDDDD DDDDDDDDDDDD DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
13 101 C Q H 4 S+ 0 0 103 113 69 QQQQQQ QQQDEEENDQKDQHDEDRADDADKREEEDVVRQREQSAAAAGDAAAADSAEKAQDRDDKERAR
14 102 C R H >> S+ 0 0 59 114 83 RRRRRRRRRKKKKKQKKKRKLLTQKKKVLRMHTLEVVVYMYLRQLHAARLAHNLAAVAIKKRARRQLLKV
15 103 C V H 3X S+ 0 0 0 114 25 VVVVVVVVVVVVVIVVVVIVVVVIVVIIVVLLVVVVLLLVLILVLLLLLLLLLLIIILVLILLLLLLLLL
16 104 C I H 3X S+ 0 0 49 114 77 IIIIIIIIIILVVVVVILIILVIITRIIVAIIVIRINNVIVLVMTLQQARQLQTVKLKILGKVLLTAMLR
17 105 C K H <> S+ 0 0 153 114 63 EEEEEEEEEDHEEQEEDEEERQRDEEEQEDQERKKRRREDGKKQKRKKDQQRKKRDRREREQAEEDRERK
18 106 C L H X>S+ 0 0 21 113 39 LLLLLLLLLLLLLLLLLLLLLLLLLLLMLLLLLYLLLLLLLLILLLLLLLLLLLLLFLLMLLALLLLVML
19 107 C V H <5S+ 0 0 2 114 16 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVIVIVVVVIVVIIVVVVSVVVVVVVVVVV
20 108 C Q H <5S+ 0 0 117 114 79 QQQQQQQQQHRNAHKVESNKQHEKQGKKEANKKSDGEEEREAQEAEDDAKEEQTMMRLWAEKQQQLKGAK
21 109 C K H <5S+ 0 0 129 114 72 KKKKKKKKKKHKKQKTKEKQKRLKKKAKRRKRTMKQKKRIMTKLTEKKKEKEKTTREMSQKTKRRSERQQ
22 110 C Y T <5S- 0 0 91 114 60 YYYYYYYYCYYYYYYYFHYHYYYYYYLYHFYYHYHYFFHHHHYYYHYYYYYHYYHYCHAYYEYFFCHLYY
23 111 C G < - 0 0 10 114 11 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGLGGGGGGGGGGG
24 112 C P S S+ 0 0 38 111 69 PPPPPPPPPPPPPPAPTPAAAPPPAAAPPVAQTTDPAAQQQTAPTTAATPATATTTTP.P.EPPSATAPS
25 113 C K S S+ 0 0 162 113 70 KKKKKKKKKKKKKKKKKKKRKKTTKKKTKKRQKKKKRRQKQRRHKKRRRKRKRKKRKK.SHGCHHQKQSR
26 114 C R >> + 0 0 103 113 69 RRRRRRRRRRRKKRKRRRKKKKRKKKQKKSERKQSRNNRKRKNKNENNRNNENNRSRN.SKQRRRRARSR
27 115 C W H 3> S+ 0 0 43 113 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW.WWWWWWWWWWW
28 116 C S H 3> S+ 0 0 78 113 56 SSSSSSSSSSTSTSSTSTSSSTASSSSSSSSTSAVSSSSSSSATSSSSSSSSSSASAS.SSSSGGSSSSS
29 117 C V H X> S+ 0 0 52 113 82 VVVVVVVLLVVLLLLLLIVQLQSVTQKTTATLLRLLLLLFLVLFVKLLHILKLIVVKL.QKAVKKTELQL
30 118 C I H >X S+ 0 0 0 113 13 IIIIIIIIIIIIIIIIIIIIIIIIIIIIILIIIIVIIIIIIIIIVIIIIIIIIVIIII.IIIIIIIIIII
31 119 C A H >< S+ 0 0 4 114 33 AAAAAAAAAASAAAASASAAAAAAAAAAAAAAAAAAAAAAASGAAASSAASASASAAAAASAAAAAAAAS
32 120 C K H << S+ 0 0 143 114 75 KKKKKKKKKKKQREQKKKQQSKSKREQQATYNRLSSRRTRTSKSARKKRNKRKAQEAGSQKRTSSERSQT
33 121 C H H << S+ 0 0 74 114 85 HHHHHHHHHHHHHKHHHYNQHHNAHEQAHYRYQVFNGGQQQQVKGLGGAGGLGGEHLGHHEHSHHSLQHF
34 122 C L S X< S- 0 0 5 114 47 LLLLLLLLLLLLLLLLLILLLLLLLLLLLMILLLLLIILLLLMLILIILIILVILILILLIFILLILMLM
35 123 C K T 3 S+ 0 0 138 114 73 KKKKKKKKKQKKKQPRHKPPPNPPPPPPPPPSNPPPPPTQPTEPKHAAEKPHPRnvPHPPPPsPPPpDPA
36 124 C G T 3 S+ 0 0 53 114 53 GGGGGGGGGGGGGGGGSGGGGGGGGGGGGGGGGGNGGGGGGGTGGMGGNGGMGGfgGGGGGGgGGGqGGN
37 125 C R S < S+ 0 0 20 114 7 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
38 126 C I > - 0 0 93 114 75 IIIIIIIIIITIIIITNTIIIINIVIIIVTTILKTTSSILISIISASSVSSDSSSTTSTVIIVIISRVVS
39 127 C G T 3> S+ 0 0 35 114 13 GGGGGGGGGGGGGGGGGGGSGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGPGGGGGGGGGGGGG
40 128 C K H 3> S+ 0 0 148 114 4 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK
41 129 C Q H <> S+ 0 0 38 114 54 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQSSQQQQQQSQSSQSSQSSQQQSQQQQQQQSQQQQ
42 130 C C H > S+ 0 0 0 114 5 CCCCCCCCCCCCCCCCICCCCCCCCCCCCICCCCICCCCCCCCCCCCCCCCCCCCCCCICCCCCCCCCCC
43 131 C R H < S+ 0 0 65 114 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
44 132 C E H < S+ 0 0 68 114 64 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEELLEEEEEELELLELLELLTETLEETEEEELEEEE
45 133 C R H < S+ 0 0 55 114 1 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRrRRRRRRRHRRRRRRRRRRRRRRRRRR
46 134 C W S < S+ 0 0 0 114 2 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWFWwWWWWWWWWWWWWWWWWWWWWWWWWWW
47 135 C H S S+ 0 0 49 114 82 HHHHHHHHHHHHHHHHHHHHHHHHCHYHHHLHYHHHCCHCHKHHHHCCNCCHCHLHLCHHLHNHHWIFHV
48 136 C N S S- 0 0 43 114 5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
49 137 C H S S+ 0 0 21 114 52 HHHHHHHHHHHHHHHHHHHHHHQHHHHHVHSHHHQQQQHHHQHHQHQQHQQHQQHHFQQHHQHQQHHHHH
50 138 C L + 0 0 28 114 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
51 139 C N > - 0 0 13 114 70 NNNNNNNNNNNNNNNNNNNNNDDNDNNNDDNRDNDDNNRSRDNNNQSSANSQSNDDDDDSDRCCCSDCSH
52 140 C P T 3 S+ 0 0 58 114 5 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
53 141 C E T 3 S+ 0 0 116 114 64 EEEEEEEEEEEHCDDDTENEGNAEGDEGENEDNDNNGGDDDSSLDNQQDEQDQDAHAGNEADNRRQTEED
54 142 C V S < S- 0 0 34 114 24 VVVVVVVVVVIIIIIIVIIIIIIIIIIIIVVIIIIIVVIIIIVIVIVVIVVIVVIIIVVVIIVVVVIVDI
55 143 C K - 0 0 177 113 44 KKKKKKKKKKKSKKNKKKKSSNKRCKKNSRKKNVMKKKKKKKDKKKEEKKQKEKKNDIRKDKKNNKKK.R
56 144 C K + 0 0 134 113 38 KKKKKKKKKKKKKKKKKKRKRKKKKKRKKKKKKKKRRRRRRKKKKRHHRKHRHKRKKKKTRRKKKKRK.K
57 145 C T - 0 0 93 113 63 TTTTTTTSSSTSTSGTSTDAGSEDESEDQETEGEGEKKDGDESSEGRRGDRGKEEDADDESESDDGGG.E
58 146 C S - 0 0 57 113 70 SSSSSSSSSSASAAPASAAAAPSAAPEAPRNGEAPEPPGEGPAPPPPPSPPPPPPAPPRNPAGEEPAE.G
59 147 C W S S+ 0 0 42 113 21 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWFFWWWWFWFWFFWFFWFFWWWFWFFWWWWFWW.W
60 148 C T S S- 0 0 108 113 47 TTTTTTTTTTTTTTTTTTTTSTTTKSSTSTTSTSTTTTNTNSTDSTTTSSTTSSSTRSTTRTLTTSTT.T
61 149 C E S > S+ 0 0 95 112 68 EEEEEEEEEQEEEQEEAEQEFEETLERQ.PSSDTEEEETTTARLEEAATQTECEEAAEPEEDEVVDKA.T
62 150 C E T 4 S+ 0 0 144 113 46 EEEEEEEEEEAEHEEQEEQEEEEEEDEEAQKEGWEEEEEEEEDDWEAAEWAEAWGDDWEEEEEEEFEE.E
63 151 C E T >> S+ 0 0 45 113 6 EEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE.E
64 152 C D H 3> S+ 0 0 56 114 27 DDDDDDDDDDDDDDDDDDDDDDDEDDDEDDDEDDDDDDEDEDREDEDDEDDEDDEETDDgDEDDDDEDRE
65 153 C R H 3X S+ 0 0 108 112 82 RRRRRRRRRRRKRQRNLQLRRRRQRRQIRAVERLMRRREDEYKWARAADAERAASRQAArRTQAAATQ.E
66 154 C I H X>>S+ 0 0 43 114 67 IIIIIIIIVILLVIILIMAKTLIAMVKREAIYIKLIAAALAALIVATTTVTAIVVVLVLLVIALLVIILL
67 155 C I I 3X>S+ 0 0 1 114 27 IIIIIIIIIIIIIIIIIILIILLLIILLIIILLLILIILILIILILIIFIILIIIMIILIILIIIILIIL
68 156 C Y I 3<5S+ 0 0 105 114 58 YYYYYYYYYYYYYYYYCYILLAMILLIIVIIVIVMIVVVAVIVFIIIIIIIIIIYALIILLILLLVIELV
69 157 C Q I XX5S+ 0 0 113 114 79 QQQQQQEEEEQEQEQHQEYSQNQNEQVQAEEFGEEDAASKSQTLLAKKRMAAELEDANEQQESHHRRQQH
70 158 C A I >X5S+ 0 0 20 114 58 AAAAAAAAAALAASALALASFAAACAAAAAAVAAAAAAAMAALYAAAAAAAVAAAAAAAAAAGLLSAGAA
71 159 C H I 3X< S+ 0 0 56 114 66 LLLLLLLLLLLMLLYFLLYLVYLYLgYYLLYLMHHaHHLLLLYIHLHHLHHLHHLLLHlHgVMMMYLIHF
75 163 C G G >< S+ 0 0 1 114 12 GGGGGGGGGGGGGGGGGGGGGGGGGgGGGDGGGGGgGGGGGGGSGGGGGGGGGGGGGGpGgGGGGGGGGG
76 164 C N G 3 S+ 0 0 141 114 26 NNNNNNNNNNNNNNNNNNNNNNSNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATNNTTTNNTTN
77 165 C R G X> + 0 0 135 114 38 RRRRRRRRRRRKQRKRRRKRRQAKRRRKRKKRRRKHKKRRRRKKKRKKKAKRKKKKRKVKRKRKKKRKKR
78 166 C W H <> + 0 0 38 114 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
79 167 C A H 3> S+ 0 0 29 114 43 AAAAAAAAAAAAAAAAAAAAAAVAAAAAAATAAAAVAAAAATSAAAAASAAVAAASAAAAAACSSSASAS
80 168 C E H <> S+ 0 0 72 114 77 EEEEEEEEEEEEKEQEDKEEEKEEEDEEEEKDEEEGSSDEDEYEADTTDRTDTAEEEVELEDEKKVDELA
81 169 C I H < S+ 0 0 9 114 9 IIIIIIIIIIIIIIIIIIIIIIIIIIILIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIITIIMMIIIII
82 170 C A H >< S+ 0 0 7 114 26 AAAAAAAAASAAAAAASAAASASAAAASAASAASASAAASAAAAAAAAASAAAAAAAAAVAAITTAAVVA
83 171 C K H 3< S+ 0 0 145 114 50 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRRKKKKKKKARRKKRARKKKKKKKNKKTTKTKKK
84 172 C L T 3< S+ 0 0 147 114 75 LLLLLLLLLLYLLLLYLYFLLLRVRMTFLLLLRMLLMMMHMRFHLELLLLLELLLLLLLLRVsLLLWRLM
85 173 C L S < S- 0 0 13 114 15 LLLLLLLLLLLLLLLLLLLLLLLLLLFLLLLILIVLLLILILLLLILLLLLILLLLLLLILIlLLLLLIL
86 174 C P S S+ 0 0 113 114 25 PPPPPPPPPPPPPPPPPPPPPPPPPNVPPPEPPPPPPPPDPPPESPPPEVPPPCPPPPPPSTSPPPPPPP
87 175 C G S S+ 0 0 22 114 5 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
88 176 C R S S- 0 0 2 112 0 RRRRRR. RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
89 177 C T >> - 0 0 34 109 7 T TT TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT
90 178 C D T 34 S+ 0 0 49 109 9 D DD DDDDDDDDDDDDDDDDDDDDDDAEDDDDDDEDEDDDDEDDEDDDDDDDDDDDDEDDDDEDDD
91 179 C N T 3> S+ 0 0 35 109 8 N NN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
92 180 C A H <> S+ 0 0 4 109 48 A AA SSAAAAAAAASAAAASASAAAAAAAAAAAAAAASSASAAASAAAASAHAGAASASAAAGAAS
93 181 C I H X S+ 0 0 0 109 11 I II IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIVVIIVVIVIVIVIVIIVVIIIVVIII
94 182 C K H > S+ 0 0 62 108 1 K KK K KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK
95 183 C N H < S+ 0 0 70 108 0 N NN N NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
96 184 C H H >X>S+ 0 0 13 108 52 H HH H HHHHHHHHHHHHRHHHHHHRHHRRRRHHHRHHYHHHHHHHHHHHHHHHHRHHRRRRHRRH
97 185 C W H 3<5S+ 0 0 8 107 1 W WW W WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWFWWWWYWW
98 186 C N T 3<5S+ 0 0 82 105 33 N NN N NNNNNNNNNNNNNNNNN NNN NNNNNNNNNNYNNNNNNNNNNNYYYNNNNNYNNNNNNH
99 187 C S T <45S+ 0 0 93 105 46 S SS S SSSSSSSSSSSSSSSSS SSS SSSSSSASASSSAASSASSASASSSSSSAASAASASSA
100 188 C T T <5S+ 0 0 96 100 66 T TT T TTTSTTTTTSS SSSST STT TTTTTTTTTTQGTTTTTTTTTTTFTTTASTLRRTTNAT
101 189 C M S > S+ 0 0 21 112 3 EEEEEEEEEEEEEEEEEEEEEEEEEDEEEEEEEEEEEEKEEEE
12 100 C D H > S+ 0 0 20 112 1 DDDDDDDDDDDDDDDEDDDDDDDDDDDDDNDDDDDDDDDDDDD
13 101 C Q H 4 S+ 0 0 103 113 69 ATKTKKAQATASEDSDDLADAEEEQKLEEAAEAEDRLDEQDEE
14 102 C R H >> S+ 0 0 59 114 83 ITSIIVTIIIVINKIIKAAKHKKKIRIKILKKKRLKKIKKLRL
15 103 C V H 3X S+ 0 0 0 114 25 LLLLIIILLLLLLLLILILLLLLLLLLLLLLLLLLLLLLLLLL
16 104 C I H 3X S+ 0 0 49 114 77 TKVRIKLKRVKIRRILKSIKARMVAMILVIRVKLRATARRKVR
17 105 C K H <> S+ 0 0 153 114 63 ERKQEERRQESSKESRQQAQRENRNENNENDRSREEEAEDKAT
18 106 C L H X>S+ 0 0 21 113 39 LLLLCLFLL.LYLLYELLLLLLYYYLYYYYFFYYCVLYLSHYA
19 107 C V H <5S+ 0 0 2 114 16 VVVVMVVVVKIVVVVKVVIVVVMVIVIIVIIIIIIIVVVIVII
20 108 C Q H <5S+ 0 0 117 114 79 AEKMRALDMRATSSTRAGEALELTAAATRQETEQGEAKSLERG
21 109 C K H <5S+ 0 0 129 114 72 REQSELKETGQKRQKALEALEKNKETNKQQRKNSKVTAQQKSC
22 110 C Y T <5S- 0 0 91 114 60 EFYQGYFFQARHFYHMHISHYYNHHSHHFHFHHHHHYHYHYHH
23 111 C G < - 0 0 10 114 11 GGGGGGGGGLGGGGGYGGgGGGGGGsGGGGgGgGGGGGRGgGG
24 112 C P S S+ 0 0 38 111 69 AETP.VIEPGPEPPEGPEePEPQHEtEHAEgHgYEAAEPVgEV
25 113 C K S S+ 0 0 162 113 70 HGRRLKKGRKRGQQGRQCAQGHGGGAGGRGGGGGGKRGQSRGR
26 114 C R >> + 0 0 103 113 69 KNRNRKDNNKQNNNNKNNRNNNCCSTVCDQNCNKKRRKNCSCR
27 115 C W H 3> S+ 0 0 43 113 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
28 116 C S H 3> S+ 0 0 78 113 56 ASSTMTTSTSTNNNNANGCNSNSSRGNSSRISIHHKNRSSQRD
29 117 C V H X> S+ 0 0 52 113 82 SVLSQVMVSKRSLLSKVTQVPAEVSASSSSAAADMMRELEASF
30 118 C I H >X S+ 0 0 0 113 13 IIIIIIIIIIIVIIVIIIVIIIIVLILVILLVLVVILVIVLLI
31 119 C A H >< S+ 0 0 4 114 33 AASAAAGAAAAQAAQAAAGAAAAPPAAPRPPPPPPAAPAPAPA
32 120 C K H << S+ 0 0 143 114 75 SRTAEQTRSYEKQEKAEEVERQRKKKRKTKQKPSITEQEAESA
33 121 C H H << S+ 0 0 74 114 85 AHFCRKQHCYNHKKHFNRANAKNQNESLKKKQKQRTKKKKTAI
34 122 C L S X< S- 0 0 5 114 47 LLMIIMVLILISLLSLLMMLLLSAAMAAGAAAIAAAMALALAI
35 123 C K T 3 S+ 0 0 138 114 73 snAPPePnPPPgHQgPHsEHnQgggggelageggggqghgvgp
36 124 C G T 3 S+ 0 0 53 114 53 egNGGgNgGGGfGGfGGgGGgGsqldkeplkqknnnvrlrqla
37 125 C R S < S+ 0 0 20 114 7 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRLRIRRRL
38 126 C I > - 0 0 93 114 75 AISSISTISTSCSSCQSTSSISCCCTTCTCCCCCCCECLNCCR
39 127 C G T 3> S+ 0 0 35 114 13 GGGGGGPGGGGGGGGGGGAGGGGGGGGGGGGGGRRGGGDGGGS
40 128 C K H 3> S+ 0 0 148 114 4 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKDKRKT
41 129 C Q H <> S+ 0 0 38 114 54 QQQSQQQQSQSSSSSQSQQSQSSSSQSSSSSSSSSSGSSSGSQ
42 130 C C H > S+ 0 0 0 114 5 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
43 131 C R H < S+ 0 0 65 114 0 RRRRRRRRrRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
44 132 C E H < S+ 0 0 68 114 64 EEELEEEElELLLLLELEELELLLLELLLLLLLLLLILLLALD
45 133 C R H < S+ 0 0 55 114 1 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
46 134 C W S < S+ 0 0 0 114 2 WYWWWWYYWYWWWWWFWWWWWWWWWWWWWWWWWWWWWWWWWWW
47 135 C H S S+ 0 0 49 114 82 FNVLTHKNLVLAFFAVFHLFNFIIIQLIVMLILLLMLLFILMC
48 136 C N S S- 0 0 43 114 5 NHNNNNNHNNNNNNNNNNNNHNNNNSNNNNNNNNNNNNNNNNN
49 137 C H S S+ 0 0 21 114 52 NHHQHHQHQHQHQQHHQHHQHQYYYHYYKYYYYYYYKYQYYYQ
50 138 C L + 0 0 28 114 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
51 139 C N > - 0 0 13 114 70 CRQSDDDRSDSRDDRDDASDSDRRRRRRKRRRRRRRDRDRRRD
52 140 C P T 3 S+ 0 0 58 114 5 PPPPPPPPPPPPPPPPPPPPPPPPAPPPPPPPPAPPPPLPPPP
53 141 C E T 3 S+ 0 0 116 114 64 EDDSNGFDAEANRRNERNDRGRDDDGDDDGNDNDNNRNWGGDS
54 142 C V S < S- 0 0 34 114 24 VIIVLIIIVLLLIILLIVVILILLVLVLLIILLVIIIIILLLI
55 143 C K - 0 0 177 113 44 RKRKKNNKKKKKNNKKNKSNRNKKKKRKKKKKKKKKNKNKKKN
56 144 C K + 0 0 134 113 38 KKKSKKRKSATKRRKKRKKRKRRRRKRRSRHRHRRRKRRRHRR
57 145 C T - 0 0 93 113 63 GDEGGLGDGGSGRRGGREQRGSGGGGGGgGGGGGGGTgRgGgG
58 146 C S - 0 0 57 113 70 GAGPGPPAPEPAPPAEPPKPPPPVNQNAkNEVGRTNAiPfNfA
59 147 C W S S+ 0 0 42 113 21 WWWFWWWWFWWFFFFWFLWFWFFFIWIFFIFFFFFIFSFSFTW
60 148 C T S S- 0 0 108 113 47 STTSTTTTSNETTTTTSTTSGTSSSTTSSSSSTTSSTDTPTDT
61 149 C E S > S+ 0 0 95 112 68 PEKARNDEADPPDEPDEAAEPSPAKSPSPEDAELEDDEEEQDA
62 150 C E T 4 S+ 0 0 144 113 46 EEEEEKAEEDQEDEEDEDEEEEQAEEEEEDEADSDQEEDEAET
63 151 C E T >> S+ 0 0 45 113 6 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEYEEEEGEEDE
64 152 C D H 3> S+ 0 0 56 114 27 DEEDDEDEDEDEEEEEEEDEEEEEETQQEEDEDqVEDdEeeeD
65 153 C R H 3X S+ 0 0 108 112 82 RSEAAKDAAAAREDRADDRDIEENDSLNKDRNNlDDAgEg.gA
66 154 C I H X>>S+ 0 0 43 114 67 VLLVIILLVIVTRRIIRLARLRLLIKLLLLILLVLLKdRlisT
67 155 C I I 3X>S+ 0 0 1 114 27 ILLILIILILIILLILLIILLLIIIMIIVIIIIILIIKLYITL
68 156 C Y I 3<5S+ 0 0 105 114 58 FVVLVFIVLILVLLVIMFIMALLIIKMVVLCICLLLLFLFCFA
69 157 C Q I XX5S+ 0 0 113 114 79 EAQWEEKSWAWEAAEAAKEADAHDKLEEERSDSERRSSAIEIT
70 158 C A I >X5S+ 0 0 20 114 58 SAAAAAAAAMALAALMAYAAACLLLLLLMLLLLLLLASAAMFL
71 159 C H I 3X< S+ 0 0 56 114 66 HYFYFYHFYLYLHHIHHEYHLHLLLLWLLLILILlLLvQdRlF
75 163 C G G >< S+ 0 0 1 114 12 GGGGGGGGGGRGGGGGGGGGGGGGGGGGGGGGGGlGGaGcGaG
76 164 C N G 3 S+ 0 0 141 114 26 TNNNNNNNNNNNNNNNNNTNNNNNNNNNNNSNSNVNNGNYSRN
77 165 C R G X> + 0 0 135 114 38 RRKKAKKRKKRKKKKRKRRKKRRRRKRRKRRRRRLRQRKRCRA
78 166 C W H <> + 0 0 38 114 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
79 167 C A H 3> S+ 0 0 29 114 43 ASSATATSATCAAAAAAASASASSSSSSASSSSSSSTSASSSS
80 168 C E H <> S+ 0 0 72 114 77 FDASKEADSVERMMRNLEELDIQQLQKQKLIQILLLKLMQVLR
81 169 C I H < S+ 0 0 9 114 9 IIIIIIIIIIIMIIMIIILIIIIIIIIIIIIIIKIIIIIIIII
82 170 C A H >< S+ 0 0 7 114 26 VAAAAAAAASAAAAAAASVAAAAAAAAAAAAAAAAAAAVAAAA
83 171 C K H 3< S+ 0 0 145 114 50 KKKKEKKKKKKARRAKRKKRRRATSAKASGSTASGGKARMAGA
84 172 C L T 3< S+ 0 0 147 114 75 LVMHLYLVHQHQLLQNLRALCHRQHqHRCRQQQaRRLRLQQQS
85 173 C L S < S- 0 0 13 114 15 LILLLLIILLLLFFLLFLFFIFLLFgLLLLLLLlLLLLFLLLL
86 174 C P S S+ 0 0 113 114 25 PRPPPPPKPPPPPPPPPGPPPPPPPTPPPPPPPPPPHPSPPPP
87 175 C G S S+ 0 0 22 114 5 GGGGGGGGGGGGGGGGGKGGGGGGGGGGGGGGGGDGRGGGGGG
88 176 C R S S- 0 0 2 112 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
89 177 C T >> - 0 0 34 109 7 TTTTATTTTSTTTTTSTTTTSTTTTSTTTTTTTTTTTTTTTTT
90 178 C D T 34 S+ 0 0 49 109 9 DEDDEDDEDDDDDDDDDDDDEDDDDDDDDDDDDDADDDDDDDD
91 179 C N T 3> S+ 0 0 35 109 8 NNNNSNNNNNNNNNNNNNSNNNNNNNNNNNNNNNNNINNNLNN
92 180 C A H <> S+ 0 0 4 109 48 AASHASAAHDHEAAEDAAAAAAEEEDEEDEDEDEDEAEAEAEA
93 181 C I H X S+ 0 0 0 109 11 IVIIVIIVIVIIVVIVVIIVVVIIIIIIVIIIIVVIVIVVVIV
94 182 C K H > S+ 0 0 62 108 1 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRKKKKKK
95 183 C N H < S+ 0 0 70 108 0 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
96 184 C H H >X>S+ 0 0 13 108 52 RHHRRHHHRHRYHHYHHHRHHHFFYRYLFYYFYYFYQYHHYYR
97 185 C W H 3<5S+ 0 0 8 107 1 WWWWWFW WWWWWWWWWFWWWWWWWWWWWWWWWWWWWWWWWWW
98 186 C N T 3<5S+ 0 0 82 105 33 NNHNNYN NYNNHHNYHYNHNHNNNYRNSNNNNNNNRNHNNNN
99 187 C S T <45S+ 0 0 93 105 46 SAACSSS CSCTVVTSVAAVAVSSSSTSTTTSTSSSHSVSGTS
100 188 C T T <5S+ 0 0 96 100 66 ETATAKT TTTRVVRTITMITIT H R RHK RCHHLTVYTHA
101 189 C M S >