Complet list of 1mp1 hssp file
Complete list of 1mp1.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1MP1
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-04
HEADER RNA BINDING PROTEIN 11-SEP-02 1MP1
COMPND MOL_ID: 1; MOLECULE: SER/ARG-RELATED NUCLEAR MATRIX PROTEIN; CHAIN: A;
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: HOMO SAPIENS; ORGANISM_COMMON: HUMAN;
AUTHOR B.R.SZYMCZYNA,J.BOWMAN,S.MCCRACKEN,A.PINEDA-LUCENA,Y.LU, B.COX,M.LAMBE
DBREF 1MP1 A 27 134 UNP Q8IYB3 SRRM1_HUMAN 27 134
SEQLENGTH 111
NCHAIN 1 chain(s) in 1MP1 data set
NALIGN 48
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : A2A983_MOUSE 1.00 1.00 4 111 27 134 108 0 0 198 A2A983 Serine/arginine repetitive matrix protein 1 (Fragment) OS=Mus musculus GN=Srrm1 PE=2 SV=1
2 : M0QXG5_HUMAN 1.00 1.00 17 111 1 95 95 0 0 102 M0QXG5 Serine/arginine repetitive matrix protein 1 (Fragment) OS=Homo sapiens GN=SRRM1 PE=2 SV=1
3 : M3YWL3_MUSPF 1.00 1.00 4 111 27 134 108 0 0 213 M3YWL3 Uncharacterized protein OS=Mustela putorius furo PE=4 SV=1
4 : F1C6Z7_PERFV 0.93 0.98 4 101 25 122 98 0 0 122 F1C6Z7 Serine/arginine repetitive matrix protein 1 (Fragment) OS=Perca flavescens GN=Srrm1 PE=2 SV=1
5 : F8W280_DANRE 0.93 0.97 4 111 27 134 108 0 0 238 F8W280 Uncharacterized protein OS=Danio rerio GN=srrm1 PE=4 SV=1
6 : G3NWB8_GASAC 0.93 0.97 4 111 28 135 108 0 0 257 G3NWB8 Uncharacterized protein (Fragment) OS=Gasterosteus aculeatus PE=4 SV=1
7 : M3X4V1_FELCA 0.91 0.94 4 111 23 130 108 0 0 247 M3X4V1 Uncharacterized protein (Fragment) OS=Felis catus PE=4 SV=1
8 : Q8QGK3_TAKRU 0.88 0.95 4 111 27 134 108 0 0 153 Q8QGK3 SRm160/300 splicing coactivator (Fragment) OS=Takifugu rubripes GN=300 PE=4 SV=1
9 : H0W2A9_CAVPO 0.83 0.91 4 110 27 132 107 1 1 198 H0W2A9 Uncharacterized protein OS=Cavia porcellus PE=4 SV=1
10 : R7VC08_CAPTE 0.78 0.89 5 111 4 110 107 0 0 195 R7VC08 Uncharacterized protein (Fragment) OS=Capitella teleta GN=CAPTEDRAFT_163872 PE=4 SV=1
11 : A7SUQ0_NEMVE 0.72 0.91 6 111 24 130 107 1 1 139 A7SUQ0 Predicted protein (Fragment) OS=Nematostella vectensis GN=v1g132837 PE=4 SV=1
12 : C3ZLC0_BRAFL 0.72 0.90 4 111 27 134 108 0 0 150 C3ZLC0 Putative uncharacterized protein OS=Branchiostoma floridae GN=BRAFLDRAFT_218692 PE=4 SV=1
13 : B7QAA4_IXOSC 0.71 0.92 4 111 28 136 109 1 1 138 B7QAA4 Serine/arginine regulated nuclear matrix protein, putative (Fragment) OS=Ixodes scapularis GN=IscW_ISCW013151 PE=4 SV=1
14 : H2YMJ4_CIOSA 0.70 0.86 5 111 28 134 107 0 0 205 H2YMJ4 Uncharacterized protein OS=Ciona savignyi GN=Csa.3168 PE=4 SV=1
15 : V4B6K6_LOTGI 0.70 0.90 5 111 29 135 107 0 0 138 V4B6K6 Uncharacterized protein OS=Lottia gigantea GN=LOTGIDRAFT_137917 PE=4 SV=1
16 : M7ART0_CHEMY 0.69 0.72 11 111 1 137 137 2 36 909 M7ART0 Chloride intracellular channel protein 4 (Fragment) OS=Chelonia mydas GN=UY3_14862 PE=4 SV=1
17 : T1G6M5_HELRO 0.67 0.87 5 111 29 136 108 1 1 147 T1G6M5 Uncharacterized protein OS=Helobdella robusta GN=HELRODRAFT_87156 PE=4 SV=1
18 : H2XTZ8_CIOIN 0.65 0.83 5 111 28 134 107 0 0 159 H2XTZ8 Uncharacterized protein OS=Ciona intestinalis GN=LOC100186689 PE=4 SV=1
19 : V5GNQ3_ANOGL 0.63 0.91 17 111 1 95 95 0 0 104 V5GNQ3 Serine/arginine repetitive matrix protein 1 (Fragment) OS=Anoplophora glabripennis GN=SRRM1 PE=4 SV=1
20 : B3RXQ9_TRIAD 0.57 0.81 5 111 21 128 108 1 1 130 B3RXQ9 Putative uncharacterized protein (Fragment) OS=Trichoplax adhaerens GN=TRIADDRAFT_7832 PE=4 SV=1
21 : G2RE61_THITE 0.55 0.75 6 110 14 118 106 2 2 124 G2RE61 Putative uncharacterized protein OS=Thielavia terrestris (strain ATCC 38088 / NRRL 8126) GN=THITE_2057473 PE=4 SV=1
22 : G9N776_HYPVG 0.55 0.76 6 110 5 109 106 2 2 116 G9N776 Uncharacterized protein OS=Hypocrea virens (strain Gv29-8 / FGSC 10586) GN=TRIVIDRAFT_42986 PE=4 SV=1
23 : W4JWE8_9HOMO 0.55 0.79 6 110 26 132 108 3 4 134 W4JWE8 Uncharacterized protein (Fragment) OS=Heterobasidion irregulare TC 32-1 GN=HETIRDRAFT_243565 PE=4 SV=1
24 : B8PPI9_POSPM 0.54 0.76 16 110 48 144 98 3 4 147 B8PPI9 Predicted protein (Fragment) OS=Postia placenta (strain ATCC 44394 / Madison 698-R) GN=POSPLDRAFT_28578 PE=4 SV=1
25 : R1DLD9_EMIHU 0.53 0.73 6 102 24 119 100 3 7 152 R1DLD9 Uncharacterized protein (Fragment) OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_58282 PE=4 SV=1
26 : W2RVX8_9EURO 0.53 0.71 2 110 10 118 110 2 2 125 W2RVX8 Uncharacterized protein OS=Cyphellophora europaea CBS 101466 GN=HMPREF1541_04742 PE=4 SV=1
27 : C1GQ98_PARBA 0.52 0.73 2 110 10 118 110 2 2 226 C1GQ98 Uncharacterized protein OS=Paracoccidioides lutzii (strain ATCC MYA-826 / Pb01) GN=PAAG_00693 PE=4 SV=1
28 : U9SUQ8_RHIID 0.51 0.77 5 111 21 126 109 3 5 141 U9SUQ8 Uncharacterized protein (Fragment) OS=Rhizophagus irregularis (strain DAOM 181602 / DAOM 197198 / MUCL 43194) GN=GLOINDRAFT_70246 PE=4 SV=1
29 : C4JZU3_UNCRE 0.50 0.73 4 110 12 118 108 2 2 134 C4JZU3 Putative uncharacterized protein OS=Uncinocarpus reesii (strain UAMH 1704) GN=UREG_07694 PE=4 SV=1
30 : E3QEB7_COLGM 0.50 0.73 6 110 14 118 106 2 2 139 E3QEB7 PWI domain-containing protein OS=Colletotrichum graminicola (strain M1.001 / M2 / FGSC 10212) GN=GLRG_04367 PE=4 SV=1
31 : H1V4G7_COLHI 0.50 0.73 6 110 14 118 106 2 2 127 H1V4G7 PWI domain-containing protein (Fragment) OS=Colletotrichum higginsianum (strain IMI 349063) GN=CH063_06969 PE=4 SV=1
32 : C1MJF5_MICPC 0.48 0.76 11 111 1 102 103 3 3 102 C1MJF5 Predicted protein (Fragment) OS=Micromonas pusilla (strain CCMP1545) GN=MICPUCDRAFT_9298 PE=4 SV=1
33 : E1ZH12_CHLVA 0.48 0.76 5 111 3 110 109 3 3 110 E1ZH12 Putative uncharacterized protein (Fragment) OS=Chlorella variabilis GN=CHLNCDRAFT_14158 PE=4 SV=1
34 : N1QE56_SPHMS 0.48 0.67 6 110 5 109 106 2 2 146 N1QE56 PWI domain-containing protein OS=Sphaerulina musiva (strain SO2202) GN=SEPMUDRAFT_48538 PE=4 SV=1
35 : S9WYC3_SCHCR 0.46 0.70 3 110 2 121 121 3 14 251 S9WYC3 Splicing coactivator SRRM1 OS=Schizosaccharomyces cryophilus (strain OY26 / ATCC MYA-4695 / CBS 11777 / NBRC 106824 / NRRL Y48691) GN=SPOG_03208 PE=4 SV=1
36 : F0YH42_AURAN 0.45 0.70 6 111 24 137 115 3 10 142 F0YH42 Putative uncharacterized protein (Fragment) OS=Aureococcus anophagefferens GN=AURANDRAFT_30707 PE=4 SV=1
37 : F0ZVM6_DICPU 0.45 0.69 5 102 21 117 99 3 3 132 F0ZVM6 Putative uncharacterized protein (Fragment) OS=Dictyostelium purpureum GN=DICPUDRAFT_24641 PE=4 SV=1
38 : H3G7G6_PHYRM 0.44 0.70 5 109 1 107 108 2 4 107 H3G7G6 Uncharacterized protein OS=Phytophthora ramorum GN=gwEuk.6.138.1 PE=4 SV=1
39 : K2SZH1_MACPH 0.44 0.73 6 110 8 114 108 3 4 123 K2SZH1 Splicing factor PWI OS=Macrophomina phaseolina (strain MS6) GN=MPH_00611 PE=4 SV=1
40 : T0MDH6_COLGC 0.43 0.61 6 110 14 138 126 3 22 163 T0MDH6 PWI domain-containing protein OS=Colletotrichum gloeosporioides (strain Cg-14) GN=CGLO_00662 PE=4 SV=1
41 : T1EIL6_HELRO 0.43 0.66 14 106 1 98 98 2 5 98 T1EIL6 Uncharacterized protein (Fragment) OS=Helobdella robusta GN=HELRODRAFT_137609 PE=4 SV=1
42 : E4Y8J1_OIKDI 0.42 0.65 3 111 1 117 120 3 14 204 E4Y8J1 Whole genome shotgun assembly, allelic scaffold set, scaffold scaffoldA_49 OS=Oikopleura dioica GN=GSOID_T00029163001 PE=4 SV=1
43 : A4S634_OSTLU 0.39 0.59 1 110 23 138 117 4 8 209 A4S634 Predicted protein OS=Ostreococcus lucimarinus (strain CCE9901) GN=OSTLU_26974 PE=4 SV=1
44 : J4C3A4_THEOR 0.39 0.62 11 109 58 178 122 3 24 219 J4C3A4 Splicing factor OS=Theileria orientalis strain Shintoku GN=TOT_020000359 PE=4 SV=1
45 : L0AVI3_BABEQ 0.39 0.61 7 110 32 157 127 3 24 332 L0AVI3 Uncharacterized protein OS=Babesia equi GN=BEWA_021050 PE=4 SV=1
46 : A0C508_PARTE 0.35 0.66 3 111 26 138 117 3 12 171 A0C508 Chromosome undetermined scaffold_15, whole genome shotgun sequence OS=Paramecium tetraurelia GN=GSPATT00006374001 PE=4 SV=1
47 : A0CRF5_PARTE 0.35 0.66 3 111 26 138 117 3 12 171 A0CRF5 Chromosome undetermined scaffold_25, whole genome shotgun sequence OS=Paramecium tetraurelia GN=GSPATT00009687001 PE=4 SV=1
48 : Q4Y9C1_PLABA 0.32 0.62 5 108 29 142 114 2 10 152 Q4Y9C1 Putative uncharacterized protein (Fragment) OS=Plasmodium berghei (strain Anka) GN=PB001198.00.0 PE=4 SV=1
## ALIGNMENTS 1 - 48
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 24 A S 0 0 134 2 0 S
2 25 A H + 0 0 143 4 82 RR S
3 26 A M - 0 0 53 8 71 AR R MA MM
4 27 A Q + 0 0 156 19 49 Q QQQQQQQ QQ QQ Q D VN NN
5 28 A L - 0 0 37 30 53 L LLLLLLLM MLMM MM M TTMT L T MM LA KKL
6 29 A K - 0 0 182 41 28 K KKKKKKKKRKKKK KK KKKK KKKNKKK KKKKKSKK QK QQS
7 30 A F - 0 0 63 42 10 F FFFFFFCFFFFFF FF FFFF FFFFFFF FWFFFFFF YL WWWW
8 31 A A - 0 0 10 42 45 A AAAAAAAAEAHAG AA APPP PPPPPPP PPPDPPPP AA PPPP
9 32 A E S > S+ 0 0 157 42 64 E EEEEAEEEDDDED DD NPTP APPSPPP PEPASKPP EE SEED
10 33 A C G > S+ 0 0 52 42 85 C CCCCCCCIVGVGS NG NEEE HEEEEEE EESSICEE GA SVVI
11 34 A L G > S+ 0 0 2 45 15 L LLLLLLLLLLLLLLLL LFFF FFFFFFFLFFFLYFFF LLFFFFY
12 35 A E G < S+ 0 0 107 45 57 E EDEDEDEEDDNDESDG SNND ENANNNNNSSEDKAEN EKSSNNN
13 36 A K G < S- 0 0 140 45 73 K KKKKKKKKKKKVNTKV VQQK TQEQTQQRTKAQNQEQ QVKKRRY
14 37 A K < - 0 0 143 46 20 K KKKKKKKKKKRKKQKK KKKK KKKKKKKRKKKKKRKKQKKPPKKK
15 38 A V - 0 0 9 46 11 V VVVVVVVVVVVVVVVV IVVV VVVVVVVVVVVLVVVVIVIVVLLI
16 39 A D >> - 0 0 41 47 16 D DDDDDDDDDDDDDDDD DDDDDDDDDDDDNDNDNDDDDDDDNDNNN
17 40 A M T 34 S+ 0 0 20 49 21 MMMMMMMMMMMMMMMMMMMMMMMMMMMLMMMMLIMILMTMLLIIIMMI
18 41 A S T 34 S+ 0 0 87 49 75 SSSTTTSTSTETNKTSTRSEQQRRKQTKNQQSTSKESATQKRDTTKKN
19 42 A K T <4 S+ 0 0 118 49 11 KKKKKKKKKKKKKKKKKKKRKKKKKKKKKKKKKKKKKKKKKRKKKNNK
20 43 A V S < S- 0 0 0 49 7 VVVVVVVVVVIVVVVVVIIVVVVVVVVVVVVVVVVVVVVVVVVVIIII
21 44 A N - 0 0 74 49 34 NNNNNNNNNNNNNRNNNQKNNNNNNNNNNNNKNNNNHQQNNNDRRDDN
22 45 A L S >> S+ 0 0 24 49 29 LLLLLLLLLLLILLVLLLLLVLILLVILILLLWLIVLRMLILLIILLF
23 46 A E G >4 S+ 0 0 132 49 42 EEEEEEEEDDDEDDDEDDDDQQETDEETEQQDDPEEPEDQEDEDDSSD
24 47 A V G 34 S+ 0 0 35 49 46 VVVVVVVVGTVTTTTVTTVTVVVVVVVVVVVVTVVVSVIVAAVAAVVI
25 48 A I G <> S+ 0 0 0 49 35 IIIIIIIIIMLIIMLILIIVMIIIMMMIMMMVLILMLVMMFLIFFIIV
26 49 A K H S+ 0 0 43 49 44 PPPPPPPPPPPPPPPPPPPPKKPPPKKPKKKPPKPRKQKrPPPPPPPK
28 51 A W H > S+ 0 0 41 49 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
29 52 A I H X S+ 0 0 0 49 13 IIIIIIVIRIIIIIIIIIIIIIVVIIIVIIIVIVIVIVIIIIIIIIIV
30 53 A T H X S+ 0 0 16 49 63 TTTTTTTTTTTKTSATTSQSAAAVTAAAAAAAASANTAAANASTTEEQ
31 54 A K H X S+ 0 0 114 49 62 KKKQQQKKKKKQRKQKNKDNNNKKEGGQGSSRKDNEKETSEDRKKKKK
32 55 A R H X S+ 0 0 53 49 37 RRRRRRRRQRRKKRRRKRKKRKKKQKKKKKKRRERRERKKKKERRKKR
33 56 A V H X S+ 0 0 0 49 28 VVVVVVVVVVIIIIIVIILVVIVIVIIIIIIVIILLVLIIIVIVTMML
34 57 A T H X S+ 0 0 28 49 66 TTTTTTINTTTNTTTTTTTLISVISTSVSSSTTTITETQSVSTSAIII
35 58 A E H < S+ 0 0 132 49 32 EEEEEEEDEEDDDEEEQEEEEEEEEEGEDDDEEREEEEQDERAEEQQD
36 59 A I H < S+ 0 0 24 49 43 IIIIIIIIILMMFILILMMIIILLYIILIIILLILLNLIIYLLLLYYL
37 60 A L H < S- 0 0 48 49 21 LLLLLLRLLLLLLLLLLLLLLLVVLLLLLLLLLLILLLLLSLLMMIIL
38 61 A G S < S+ 0 0 57 49 19 GGGGGGGGGKGGGGGGTGKGGGGGGGGGGGGGgNGGGGEGGnGGGGGG
39 62 A F S S- 0 0 117 48 74 FFFFFFFFFFFFMFIFIFIINSFFFNNGNNSIlSFF.FNSTvVVVIIF
40 63 A E - 0 0 71 49 16 EEEEEEEEEEEEEEEEEEDEEEEEEEEEEEEEEDEEYEDEEPEEDEEE
41 64 A D - 0 0 7 49 4 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDPDDDDDD
42 65 A D S > S+ 0 0 116 49 13 DDDDDDDDDDDDEDDDDDDDDDEEEDDEDDDDEDEDEDDDDDEDEEED
43 66 A V H > S+ 0 0 62 49 12 VVVVVVVVAVVVVVVVVVVVVVVVVIVVVVVVVVVVIIMVVVVIIVVI
44 67 A V H > S+ 0 0 0 49 21 VVVVVVVVVVVVVVVVVVVFVVVVVVVVVVVLLVVVTVVVLALVVVVL
45 68 A I H > S+ 0 0 14 49 38 IIIIIIIIIILIVIVITIVIIIVVIIIVIIIVITIVSVVIVIIVVQQC
46 69 A E H X S+ 0 0 114 49 42 EEEEEEEEEEEDEEEEEADDEEEEGEENEEESAEDNDSDEEDGDDRRD
47 70 A F H X S+ 0 0 34 49 39 FFFFFFFFFFFYFFFFFFFYLLYYYLLYLLLMYMFLYMFLFYMYYQQY
48 71 A I H X S+ 0 0 0 49 51 IIIIVIIIIIVVVVVIVVVVICAAVCCVCCCIVMVVIAVCVCICCIIC
49 72 A F H < S+ 0 0 72 49 47 FFFFFFFFVFFFFFYFFYYIFFMMEFFFFFFFYFYEYIYFCYEIIIII
50 73 A N H < S+ 0 0 72 49 43 NNNNNNNNKNNNNNNNNNNNNNGGSNNGNNNNETSNNNNNTKVGSNNS
51 74 A Q H >< S+ 0 0 41 49 72 QQQQQQDQQQMLQSQQQSQMLLLLQLLLLLLVQFMMLLLLQQLQQYYQ
52 75 A L T 3< S+ 0 0 3 49 9 LLLLLLLLLLLLLLLLLLLLIILLLLLLLIIMLLLLLLLILLLLLLLL
53 76 A E T 3 S+ 0 0 134 49 29 EEEEEEEEEEEEEEEEEEEEEDEEREEEEEEEDEETAEDEESEKKEEK
54 77 A V S < S- 0 0 56 49 70 VVVEEEVEVEsEaEEveEEsggddeggEgggmgseqePggisedvqqd
55 78 A K S S+ 0 0 136 43 64 KKKKKKKKKKkEkQRqkQKrrrss.rr.rrrqkkvqn.rrl.kgg..q
56 79 A N S S- 0 0 110 44 88 NNNHNHNHNHHKHHFFQFYHYHQQ.YF.FYYITFEDTKYFI.IEE..N
57 80 A P - 0 0 12 45 44 PPPPPPPPPPPPPPPsPPPPPPpp.PPpPPPhVPaaPVPPf.hst..g
58 81 A D >> - 0 0 79 49 16 DDDDDDVDDDDDDDDdNDDDDDdddDDdNDDdDNddNDDDdnnddddd
59 82 A S H 3> S+ 0 0 5 49 70 SSSSGSSASPPPGPPSGPPPIIPPPIIPIIIGPIPPPEPIQGGPPPPS
60 83 A K H 3> S+ 0 0 46 49 18 KKKKKKKKKKKKRKKKKKKKKKKRKKKRKKKSRKKKKRKKKKAKKKKK
61 84 A M H <> S+ 0 0 51 49 86 MMMMMMIMMEVCMVDMDVKRSSKKKSLMHSSVQQKQELKSQQHRRVVK
62 85 A M H X S+ 0 0 1 49 21 MMMMMMMMMMMMMLMMMLIMLLMMMLLMLLLMLIVLIDLLMMMLLLLL
63 86 A Q H X S+ 0 0 2 49 31 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQYQQQQQPQQQQYQQSSK
64 87 A I H < S+ 0 0 22 49 28 IIIIIIIIIIIIIIIIIIIIIIIILIIIIIIGITLIIRIIIDAIIIII
65 88 A N H >< S+ 0 0 2 49 52 NNNNNNNNNNNNNNNNNNNNQQNNSSQNQQQQSDNQSQAQNSQSNQQN
66 89 A L H >X S+ 0 0 0 49 16 LLLLLLLLLLILLLLLLLLVLLLLLLLLLLLLLILLILLLMILLLIII
67 90 A T T 3< S+ 0 0 32 49 38 TTTTTTTTSTTTTTTTTTTTTTTTTTTTTTTLTSTTTQGTGSETTMMT
68 91 A G T <4 S+ 0 0 35 49 18 GGGGGGGGGGGGGGGGGGGPGGGGGGGGGGGTGGGGGMGGGTSGGGGG
69 92 A F T <4 S+ 0 0 99 49 9 FFFFFFLFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFAFFFLFFFYYF
70 93 A L S < S- 0 0 14 49 8 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLILMMFFI
71 94 A N - 0 0 138 47 57 NNNNNNNNNNQNNNNNNNNQ.DTTEDDEDDDE.DEDETHDSQEAAEEG
72 95 A G S > S+ 0 0 20 48 73 GGGGGGGGGLAGGGPGAGGAD.AKAKKKKKKREKSRQgEKTGKKKKKN
73 96 A K H > S+ 0 0 174 28 14 KKKKKKKK.RKKKKKKKKKKKK...D......K....k..RK.....K
74 97 A N H > S+ 0 0 52 49 51 NNNNNNNNNNNNNNNNNNNPDDSHNADNDDDNNDKQEQDDNPQKKNNK
75 98 A A H > S+ 0 0 0 49 42 AAAAAAAAATAAASAAASAATTTTTATATTTTTAAATAATAATAATTS
76 99 A R H X S+ 0 0 144 49 79 RRRRRRRRTRRRRRRRRRRTPGPPPKPPPAAASASAPGPARREGHLLD
77 100 A E H < S+ 0 0 119 49 85 EEEEEEEEETIEDLIELIEKTPAEPFKIKPPTLPEPPARPMLTVVPPI
78 101 A F H >X S+ 0 0 0 49 4 FFFFFFFFFFFFFFFFFFFFFFFFFCFFFFFFFFFFFFFFFYFFFFFF
79 102 A M H 3X S+ 0 0 0 49 70 MMMMMMMLMMMMMLLMMLMMCCMMMKCVCCCMCCTVMTVCTMCVVMMI
80 103 A G H 3< S+ 0 0 28 48 76 GGGRKRGKGEGGGGQGEGAQRKSSR.KTKKKTKREAEEKKEAVRKTTR
81 104 A E H <4 S+ 0 0 79 49 24 EEEDDDEDEEEEDEEEEEEEDEAADEEEDDDEEQEEKEEEDEQEEEEE
82 105 A L H X S+ 0 0 1 49 1 LLLLLLLLFLLLLLLLLLLLLLLLLLLLLLLLLLLLLLFLLLLLLLLL
83 106 A W H X S+ 0 0 3 49 3 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWL
84 107 A P H > S+ 0 0 45 49 77 PPPPPPPPSVEKVEEPQEANRKAKANNKSKKDQKMKKQSKSPEDDNNE
85 108 A L H > S+ 0 0 17 49 14 LLLLLLQLLLLLLLLLLHLMLLLLLLLLLLLLLLLLLLLLELLLLLLL
86 109 A L H X S+ 0 0 0 49 42 LLLLLLLLLLLLLLLLLLLLLLLLLCCLCCCLLLILLLCCLLLLLLLL
87 110 A L H < S+ 0 0 31 49 38 LLLLLLLLLLVVLAVLLAIALLLLVLLLILLVILLLLLLLQVALLVVI
88 111 A S H >< S+ 0 0 41 49 30 SSSSSSSSCSSSSSSSSSSSSSEESSSSSSSSSSSDSSSSMSSSSDDE
89 112 A A H >< S+ 0 0 0 49 4 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAN
90 113 A Q T 3< S+ 0 0 72 49 27 QQQQQQQQQMQQQQQQMQQQQQQQQQQQQQQQNQSQQQQQVQQQQEEE
91 114 A E T < S+ 0 0 161 49 63 EEEEEEEEKEEDEAEEEDEEASEEESTESTTEQSQDNSSTLSADEGGK
92 115 A N S < S- 0 0 62 49 52 NNNNNNNNNNNNSNNNTNNNSNSSNNSGNSSNTDNAAHnSSSnSNQQN
93 116 A I S S+ 0 0 157 49 80 IIIIIIIIIVIIVGIISDPVPPPPEPPEPPPAGPPPIPtPDPeEEDDE
94 117 A A S S- 0 0 55 49 84 AAAAAAAATGGAGAAAGMTSQQAAHQLNQQQHTKYHGTLQNMKHYSSE
95 118 A G S S+ 0 0 0 49 11 GGGGGGGGRGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGR
96 119 A I - 0 0 23 49 13 IIIIIIIIIIIIIIIIIIIIVVIVIVVIVVVVIIIIIIVVMIVIIIII
97 120 A P >> - 0 0 2 49 10 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPSPPPPPPL
98 121 A S H 3>>S+ 0 0 65 49 76 SSSSSSSSSATAAQSSAQDSKKKRTKKAKKKQQQERESRKPMSQQNNK
99 122 A A H 345S+ 0 0 32 49 74 AAAAAAAVAREEEVKAPVSTEETTKEEIEEEAREKAEAEEAAKLAQQS
100 123 A F H <45S+ 0 0 48 49 31 FFFFFFSFFFFFFFFFFFLMLLFFVLLFLLLFFLFFLILLVMFFFLLQ
101 124 A L H <5S+ 0 0 85 49 21 LLLLLLLLLLLLLLLLLLILLLVVKLLLLLLILLIVLLLLLIMIILLL
102 125 A E T <5S+ 0 0 165 48 24 EEE EEEEEEEDEDEEEEDEEEEEEEEEEEEDDEQEKDEENEKDDDDE
103 126 A L S