Complet list of 1x6f hssp file
Complete list of 1x6f.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1X6F
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-12
HEADER DNA BINDING PROTEIN 17-MAY-05 1X6F
COMPND MOL_ID: 1; MOLECULE: ZINC FINGER PROTEIN 462; CHAIN: A; FRAGMENT: C2H2
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: HOMO SAPIENS; ORGANISM_COMMON: HUMAN;
AUTHOR M.SATO,K.SAITO,S.KOSHIBA,M.INOUE,T.KIGAWA,S.YOKOYAMA,RIKEN STRUCTURAL
DBREF 1X6F A 8 82 UNP Q96JM2 ZN462_HUMAN 719 793
SEQLENGTH 88
NCHAIN 1 chain(s) in 1X6F data set
NALIGN 65
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : F7IRY7_CALJA 0.97 0.99 5 82 1872 1949 78 0 0 2143 F7IRY7 Uncharacterized protein OS=Callithrix jacchus GN=ZNF462 PE=4 SV=1
2 : H3BLX4_HUMAN 0.97 0.99 5 82 754 831 78 0 0 1449 H3BLX4 Zinc finger protein 462 (Fragment) OS=Homo sapiens GN=ZNF462 PE=2 SV=1
3 : H9F978_MACMU 0.97 0.99 5 82 771 848 78 0 0 1406 H9F978 Zinc finger protein 462 (Fragment) OS=Macaca mulatta GN=ZNF462 PE=2 SV=1
4 : F7H0U2_MACMU 0.95 0.96 5 84 1873 1952 80 0 0 2508 F7H0U2 Uncharacterized protein OS=Macaca mulatta GN=ZNF462 PE=4 SV=1
5 : G1S688_NOMLE 0.95 0.96 5 84 1873 1952 80 0 0 2508 G1S688 Uncharacterized protein OS=Nomascus leucogenys GN=ZNF462 PE=4 SV=2
6 : G3R0K4_GORGO 0.95 0.96 5 84 1856 1935 80 0 0 2493 G3R0K4 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101126923 PE=4 SV=1
7 : G3S4M6_GORGO 0.95 0.96 5 84 1871 1950 80 0 0 2506 G3S4M6 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101126923 PE=4 SV=1
8 : G7NEK9_MACMU 0.95 0.96 5 84 1904 1983 80 0 0 2600 G7NEK9 Zinc finger protein 462 OS=Macaca mulatta GN=EGK_07458 PE=4 SV=1
9 : G7PRR3_MACFA 0.95 0.96 5 84 1823 1902 80 0 0 2519 G7PRR3 Zinc finger protein 462 OS=Macaca fascicularis GN=EGM_06761 PE=4 SV=1
10 : H2PSZ0_PONAB 0.95 0.96 5 84 1827 1906 80 0 0 2462 H2PSZ0 Uncharacterized protein OS=Pongo abelii GN=ZNF462 PE=4 SV=1
11 : H2QXN1_PANTR 0.95 0.96 5 84 1871 1950 80 0 0 2506 H2QXN1 Uncharacterized protein OS=Pan troglodytes GN=ZNF462 PE=4 SV=1
12 : K7EVS1_PONAB 0.95 0.96 5 84 1672 1751 80 0 0 2307 K7EVS1 Uncharacterized protein OS=Pongo abelii GN=ZNF462 PE=4 SV=1
13 : Q63HJ5_HUMAN 0.95 0.96 5 84 1871 1950 80 0 0 2506 Q63HJ5 Putative uncharacterized protein DKFZp686L2367 OS=Homo sapiens GN=DKFZp686L2367 PE=2 SV=1
14 : Q68CP0_HUMAN 0.95 0.96 5 84 1768 1847 80 0 0 2403 Q68CP0 Putative uncharacterized protein DKFZp686B2325 (Fragment) OS=Homo sapiens GN=DKFZp686B2325 PE=2 SV=1
15 : U3B8G6_CALJA 0.95 0.96 5 84 1872 1951 80 0 0 2507 U3B8G6 Zinc finger protein 462 OS=Callithrix jacchus GN=ZNF462 PE=2 SV=1
16 : ZN462_HUMAN 1X6F 0.95 0.96 5 84 1871 1950 80 0 0 2506 Q96JM2 Zinc finger protein 462 OS=Homo sapiens GN=ZNF462 PE=1 SV=3
17 : F6UL46_HORSE 0.93 0.96 5 84 1866 1945 80 0 0 2501 F6UL46 Uncharacterized protein OS=Equus caballus GN=ZNF462 PE=4 SV=1
18 : H0V5P7_CAVPO 0.93 0.95 5 84 1820 1899 80 0 0 2455 H0V5P7 Uncharacterized protein OS=Cavia porcellus GN=ZNF462 PE=4 SV=1
19 : I3N957_SPETR 0.93 0.96 5 84 1865 1944 80 0 0 2501 I3N957 Uncharacterized protein OS=Spermophilus tridecemlineatus GN=ZNF462 PE=4 SV=1
20 : L9KNT4_TUPCH 0.93 0.96 5 84 1820 1899 80 0 0 2515 L9KNT4 Zinc finger protein 462 OS=Tupaia chinensis GN=TREES_T100001550 PE=4 SV=1
21 : G1MF44_AILME 0.92 0.99 5 82 1331 1408 78 0 0 1966 G1MF44 Uncharacterized protein (Fragment) OS=Ailuropoda melanoleuca GN=ZNF462 PE=4 SV=1
22 : G1MF50_AILME 0.92 0.99 5 82 1839 1916 78 0 0 2110 G1MF50 Uncharacterized protein OS=Ailuropoda melanoleuca GN=ZNF462 PE=4 SV=1
23 : B1AWL4_MOUSE 0.91 0.97 5 82 712 789 78 0 0 1407 B1AWL4 Protein Zfp462 OS=Mus musculus GN=Zfp462 PE=2 SV=1
24 : B1AWL5_MOUSE 0.91 0.97 5 82 712 789 78 0 0 1408 B1AWL5 Protein Zfp462 OS=Mus musculus GN=Zfp462 PE=2 SV=1
25 : G1T3L7_RABIT 0.91 0.96 5 84 1786 1865 80 0 0 2426 G1T3L7 Uncharacterized protein OS=Oryctolagus cuniculus GN=ZNF462 PE=4 SV=1
26 : H0WJ07_OTOGA 0.91 0.96 5 84 1865 1944 80 0 0 2500 H0WJ07 Uncharacterized protein OS=Otolemur garnettii GN=ZNF462 PE=4 SV=1
27 : L5L305_PTEAL 0.91 0.96 5 84 1864 1943 80 0 0 2560 L5L305 Zinc finger protein 462 OS=Pteropus alecto GN=PAL_GLEAN10009395 PE=4 SV=1
28 : Q69ZA0_MOUSE 0.91 0.97 5 82 70 147 78 0 0 705 Q69ZA0 MKIAA1803 protein (Fragment) OS=Mus musculus GN=Zfp462 PE=2 SV=1
29 : D3ZFG7_RAT 0.90 0.96 5 84 1869 1948 80 0 0 2504 D3ZFG7 Protein Zfp462 OS=Rattus norvegicus GN=Zfp462 PE=4 SV=2
30 : D4A169_RAT 0.90 0.96 5 84 1863 1942 80 0 0 2498 D4A169 Protein Zfp462 OS=Rattus norvegicus GN=Zfp462 PE=4 SV=2
31 : G3GU04_CRIGR 0.90 0.96 5 84 1853 1932 80 0 0 2487 G3GU04 Zinc finger protein 462 OS=Cricetulus griseus GN=I79_001149 PE=4 SV=1
32 : G3T966_LOXAF 0.90 0.96 5 84 1863 1942 80 0 0 2498 G3T966 Uncharacterized protein OS=Loxodonta africana GN=ZNF462 PE=4 SV=1
33 : G3ULI6_LOXAF 0.90 0.96 5 84 1826 1905 80 0 0 2461 G3ULI6 Uncharacterized protein OS=Loxodonta africana GN=ZNF462 PE=4 SV=1
34 : A2SW42_MOUSE 0.89 0.95 5 84 1865 1944 80 0 0 2500 A2SW42 Zinc finger protein 462 OS=Mus musculus GN=Zfp462 PE=2 SV=1
35 : B1AWL2_MOUSE 0.89 0.95 5 84 1860 1939 80 0 0 2495 B1AWL2 Protein Zfp462 OS=Mus musculus GN=Zfp462 PE=2 SV=1
36 : E2R531_CANFA 0.89 0.96 5 84 1867 1946 80 0 0 2502 E2R531 Uncharacterized protein OS=Canis familiaris GN=ZNF462 PE=4 SV=2
37 : M3W1T5_FELCA 0.89 0.96 5 84 1867 1946 80 0 0 2502 M3W1T5 Uncharacterized protein OS=Felis catus GN=ZNF462 PE=4 SV=1
38 : F1MIB3_BOVIN 0.88 0.94 5 84 1865 1944 80 0 0 2502 F1MIB3 Uncharacterized protein OS=Bos taurus GN=ZNF462 PE=4 SV=2
39 : L5LEE0_MYODS 0.88 0.95 5 82 700 777 78 0 0 1331 L5LEE0 Zinc finger protein 462 OS=Myotis davidii GN=MDA_GLEAN10018133 PE=4 SV=1
40 : M3Y749_MUSPF 0.88 0.95 5 84 1868 1947 80 0 0 2503 M3Y749 Uncharacterized protein OS=Mustela putorius furo GN=ZNF462 PE=4 SV=1
41 : W5PBC2_SHEEP 0.88 0.94 5 84 1815 1894 80 0 0 2452 W5PBC2 Uncharacterized protein OS=Ovis aries GN=ZNF462 PE=4 SV=1
42 : G1PI35_MYOLU 0.86 0.94 5 84 1861 1940 80 0 0 2492 G1PI35 Uncharacterized protein OS=Myotis lucifugus GN=ZNF462 PE=4 SV=1
43 : S7P150_MYOBR 0.86 0.94 5 84 1812 1891 80 0 0 2443 S7P150 Zinc finger protein 462 OS=Myotis brandtii GN=D623_10031590 PE=4 SV=1
44 : F6TW63_ORNAN 0.84 0.90 5 84 1867 1946 80 0 0 2560 F6TW63 Uncharacterized protein OS=Ornithorhynchus anatinus GN=ZNF462 PE=4 SV=2
45 : M7BP99_CHEMY 0.84 0.89 5 84 1821 1900 80 0 0 2387 M7BP99 Uncharacterized protein OS=Chelonia mydas GN=UY3_05254 PE=4 SV=1
46 : E1C5J4_CHICK 0.82 0.90 5 84 1824 1903 80 0 0 2452 E1C5J4 Uncharacterized protein OS=Gallus gallus GN=ZNF462 PE=4 SV=2
47 : G1N713_MELGA 0.82 0.90 5 84 1336 1415 80 0 0 1964 G1N713 Uncharacterized protein OS=Meleagris gallopavo GN=ZNF462 PE=4 SV=2
48 : G3W263_SARHA 0.82 0.89 5 84 1846 1925 80 0 0 2483 G3W263 Uncharacterized protein OS=Sarcophilus harrisii GN=ZNF462 PE=4 SV=1
49 : U3IZR7_ANAPL 0.82 0.90 5 84 1791 1870 80 0 0 2422 U3IZR7 Uncharacterized protein OS=Anas platyrhynchos GN=ZNF462 PE=4 SV=1
50 : G5AKH7_HETGA 0.81 0.92 7 82 501 577 77 1 1 609 G5AKH7 Tudor domain-containing protein 3 OS=Heterocephalus glaber GN=GW7_21815 PE=4 SV=1
51 : H0Z7I5_TAEGU 0.81 0.91 5 84 1834 1913 80 0 0 2463 H0Z7I5 Uncharacterized protein OS=Taeniopygia guttata GN=ZNF462 PE=4 SV=1
52 : R7VU07_COLLI 0.81 0.90 5 84 1824 1903 80 0 0 2456 R7VU07 Zinc finger protein 462 OS=Columba livia GN=A306_03420 PE=4 SV=1
53 : U3JJP7_FICAL 0.81 0.91 5 84 1818 1897 80 0 0 2447 U3JJP7 Uncharacterized protein OS=Ficedula albicollis GN=ZNF462 PE=4 SV=1
54 : F7A3X9_MONDO 0.80 0.89 5 84 1877 1956 80 0 0 2504 F7A3X9 Uncharacterized protein OS=Monodelphis domestica GN=ZNF462 PE=4 SV=2
55 : K7GFS6_PELSI 0.80 0.88 5 84 1820 1899 80 0 0 2440 K7GFS6 Uncharacterized protein OS=Pelodiscus sinensis GN=ZNF462 PE=4 SV=1
56 : H9G7R9_ANOCA 0.74 0.85 5 84 1811 1890 80 0 0 2082 H9G7R9 Uncharacterized protein OS=Anolis carolinensis GN=znf462 PE=4 SV=1
57 : F7A278_XENTR 0.67 0.84 5 73 1088 1157 70 1 1 1714 F7A278 Uncharacterized protein (Fragment) OS=Xenopus tropicalis GN=znf462 PE=4 SV=1
58 : H3A1H2_LATCH 0.60 0.79 5 77 1803 1875 73 0 0 2440 H3A1H2 Uncharacterized protein OS=Latimeria chalumnae PE=4 SV=2
59 : H2TDW5_TAKRU 0.54 0.76 7 73 68 134 67 0 0 626 H2TDW5 Uncharacterized protein (Fragment) OS=Takifugu rubripes GN=LOC101072841 PE=4 SV=1
60 : H2TDW4_TAKRU 0.52 0.75 5 73 1702 1770 69 0 0 2270 H2TDW4 Uncharacterized protein OS=Takifugu rubripes GN=LOC101072841 PE=4 SV=1
61 : G3PW60_GASAC 0.51 0.72 5 73 1749 1817 69 0 0 2305 G3PW60 Uncharacterized protein OS=Gasterosteus aculeatus GN=ZNF462 PE=4 SV=1
62 : I3KCJ1_ORENI 0.47 0.68 6 73 1737 1804 68 0 0 2307 I3KCJ1 Uncharacterized protein OS=Oreochromis niloticus GN=ZNF462 PE=4 SV=1
63 : E7FG04_DANRE 0.46 0.63 5 75 1785 1855 71 0 0 2502 E7FG04 Uncharacterized protein OS=Danio rerio PE=4 SV=1
64 : W5LG04_ASTMX 0.45 0.63 5 75 1786 1856 71 0 0 2592 W5LG04 Uncharacterized protein OS=Astyanax mexicanus GN=ZNF462 PE=4 SV=1
65 : M4AI41_XIPMA 0.42 0.71 7 75 1729 1797 69 0 0 2284 M4AI41 Uncharacterized protein OS=Xiphophorus maculatus GN=ZNF462 PE=4 SV=1
## ALIGNMENTS 1 - 65
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A G 0 0 107 1 0
2 2 A S + 0 0 139 1 0
3 3 A S - 0 0 112 1 0
4 4 A G + 0 0 73 1 0
5 5 A S - 0 0 128 62 31 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS SSSSSSSS DG DD
6 6 A S + 0 0 129 63 45 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRKKKKKK KKKKRKKK NSNKR
7 7 A G + 0 0 81 66 6 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDEED
8 8 A L + 0 0 134 66 2 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLILLLLLLLL
9 9 A K + 0 0 194 66 0 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK
10 10 A R - 0 0 212 66 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
11 11 A D - 0 0 144 66 2 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDNDDDDDDDDDDDDDDD
12 12 A F - 0 0 168 66 0 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF
13 13 A I + 0 0 141 66 46 IIIIIIIIIIIIIIIIIVVVVVVVVVVVVVVVVVVVVVVVVVVTTTTVTVSTSVTTSEVVVVVVL
14 14 A I + 0 0 141 66 26 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIVVVVVVTTV
15 15 A L + 0 0 166 66 37 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLSSVIS
16 16 A G + 0 0 76 66 47 GGGGGGGGGGGGGGGGGGGGGGGGSGGGGGGSSGGAGGGGAGGGGGGGGGGGGGGEgAPPPPPPS
17 17 A N + 0 0 140 66 56 NNNNNNNNNNNNNNNNSNNNSSSSNNNSSSSSSSSSSGGGGSSSCSSNSSSSSNCSsSDDGGSST
18 18 A G - 0 0 35 66 56 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGRGGGGGGGSPPPLPLP
19 19 A P S S+ 0 0 113 66 76 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPTTTTTTPTTTTTITTGGEWVVE
20 20 A R - 0 0 189 66 57 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRCRRRRRHRRDDDDEED
21 21 A L - 0 0 124 66 99 LLLLLLLLLLLLLLLLLLLLLLFFLLLFFFFLLFFLLLPLLPPSSSSSSLSSSSSFSSEEEEDDE
22 22 A Q - 0 0 131 66 51 QQQQQQQQQQQQQQQQQLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLQQQQQLQPLHDDEDGGE
23 23 A N - 0 0 124 66 63 NNNNNNNNNNNNNNNNNNSNNNNNNNNNNNNSSNNNSGGSGGGSSNNGNNSNSGSDTGSSSPTTS
24 24 A S S S+ 0 0 115 66 72 SSSSSSSSSSSSSSSSSSSSPPSSSSSSSSSSSSSPPSPPSPPSSAASASPPPTSPTYEEEDEEE
25 25 A T - 0 0 59 66 87 TTTTTTTTTTTTTTTTATTAAATTAAATTTTTTTTAAAAAAAAAAVVAVTVVVAGLTNWWLLLLL
26 26 A Y E -A 35 0A 85 66 1 YYYYYYYYYYYYYYYYYYYYYYFFYYYFFFFYYFFYYYYYYYYYYYYFYYYYYFYYYYYYYYYYY
27 27 A Q E -A 34 0A 108 66 18 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQKLQQE
28 28 A C - 0 0 4 66 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
29 29 A K S S+ 0 0 158 66 50 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRKKKKKKKKKKKRKKSSTSTVS
30 30 A H S S+ 0 0 114 66 0 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
31 31 A C S S- 0 0 65 66 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
32 32 A D + 0 0 118 66 21 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDEDDEDDDDDDDDDDDDDDDNEEEEEEEE
33 33 A S - 0 0 46 66 81 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSMMSMSTMTNSCSFLLLLIIL
34 34 A K E +A 27 0A 117 66 4 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRKKKKKKKKKRKKKKKRKKKKKKKKKKK
35 35 A L E -A 26 0A 15 66 10 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLFFFFFFF
36 36 A Q S S- 0 0 153 66 68 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHHHQHQHHHQQHNQLLLLLLT
37 37 A S S >> S- 0 0 57 66 43 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSsSSSSSSSTAAADTTG
38 38 A T H 3> S+ 0 0 72 66 71 TTTTTTTTTTTTTTTTTTTTTTIITTTITTTTTIITTTTTTTTVMTTMTaTTTMLIVLLLLLLLL
39 39 A A H 34 S+ 0 0 84 66 41 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAEAAAAAAAEPPPPPPP
40 40 A E H <> S+ 0 0 101 66 32 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEHEEEEEEDEAAVAEED
41 41 A L H X S+ 0 0 33 66 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
42 42 A T H X S+ 0 0 81 66 46 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTPTTTTATTGAAAASSA
43 43 A S H > S+ 0 0 73 66 65 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSAASASAAASTSIGKKKKITD
44 44 A H H >X S+ 0 0 34 66 0 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
45 45 A L H >X S+ 0 0 49 66 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
46 46 A N H 3X S+ 0 0 87 66 68 NNNNNNNNNNNNNNNNNNNNNNNNNSNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNILLLLTTL
47 47 A I H