Complet list of 1ug2 hssp file
Complete list of 1ug2.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1UG2
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-10
HEADER STRUCTURAL GENOMICS, UNKNOWN FUNCTION 11-JUN-03 1UG2
COMPND MOL_ID: 1; MOLECULE: 2610100B20RIK GENE PRODUCT; CHAIN: A; FRAGMENT: M
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: MUS MUSCULUS; ORGANISM_COMMON: HOUSE M
AUTHOR C.ZHAO,T.KIGAWA,N.TOCHIO,S.KOSHIBA,M.INOUE,M.SHIROUZU, T.TERADA,T.YABU
DBREF 1UG2 A 9 89 UNP Q9DB00 GON4_MOUSE 2148 2228
SEQLENGTH 95
NCHAIN 1 chain(s) in 1UG2 data set
NALIGN 83
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : G3WEB8_SARHA 0.88 0.94 13 95 1238 1320 83 0 0 1346 G3WEB8 Uncharacterized protein (Fragment) OS=Sarcophilus harrisii GN=GON4L PE=4 SV=1
2 : J9P8X5_CANFA 0.88 0.94 16 95 578 657 80 0 0 674 J9P8X5 Uncharacterized protein (Fragment) OS=Canis familiaris GN=GON4L PE=4 SV=1
3 : A4PB69_RAT 0.87 0.92 4 95 2139 2230 92 0 0 2256 A4PB69 YY1AP-related protein1 OS=Rattus norvegicus PE=2 SV=1
4 : D3ZMG1_RAT 0.87 0.92 4 95 2139 2230 92 0 0 2256 D3ZMG1 GON-4-like protein OS=Rattus norvegicus GN=Gon4l PE=4 SV=2
5 : E9Q507_MOUSE 0.87 0.92 1 95 2122 2216 95 0 0 2242 E9Q507 GON-4-like protein OS=Mus musculus GN=Gon4l PE=4 SV=1
6 : G5ARB2_HETGA 0.87 0.94 14 95 2027 2108 82 0 0 2134 G5ARB2 GON-4-like protein OS=Heterocephalus glaber GN=GW7_14301 PE=4 SV=1
7 : GON4L_MOUSE 1UG2 0.87 0.92 1 95 2140 2234 95 0 0 2260 Q9DB00 GON-4-like protein OS=Mus musculus GN=Gon4l PE=1 SV=3
8 : GON4L_RAT 0.87 0.92 4 95 2139 2230 92 0 0 2256 Q535K8 GON-4-like protein OS=Rattus norvegicus GN=Gon4l PE=2 SV=1
9 : I3M4Q8_SPETR 0.87 0.95 12 95 1348 1431 84 0 0 1457 I3M4Q8 Uncharacterized protein (Fragment) OS=Spermophilus tridecemlineatus PE=4 SV=1
10 : K4DI71_MOUSE 0.87 0.92 1 95 2123 2217 95 0 0 2243 K4DI71 GON-4-like protein OS=Mus musculus GN=Gon4l PE=4 SV=1
11 : Q32NZ8_MOUSE 0.87 0.92 1 95 1916 2010 95 0 0 2036 Q32NZ8 5830417I10Rik protein (Fragment) OS=Mus musculus GN=Gon4l PE=2 SV=1
12 : W5P3U2_SHEEP 0.85 0.93 9 95 1923 2009 87 0 0 2035 W5P3U2 Uncharacterized protein OS=Ovis aries PE=4 SV=1
13 : G3HC32_CRIGR 0.84 0.91 1 95 625 719 95 0 0 745 G3HC32 GON-4-like protein OS=Cricetulus griseus GN=I79_007981 PE=4 SV=1
14 : M1ESJ0_MUSPF 0.83 0.94 13 95 183 265 83 0 0 291 M1ESJ0 Gon-4-like protein (Fragment) OS=Mustela putorius furo PE=2 SV=1
15 : F1RLM0_PIG 0.82 0.93 6 95 1244 1333 90 0 0 1359 F1RLM0 Uncharacterized protein (Fragment) OS=Sus scrofa GN=GON4L PE=4 SV=2
16 : G1SYT8_RABIT 0.82 0.91 6 95 1874 1963 90 0 0 1989 G1SYT8 Uncharacterized protein (Fragment) OS=Oryctolagus cuniculus PE=4 SV=1
17 : H2N5G1_PONAB 0.82 0.90 6 95 2127 2216 90 0 0 2242 H2N5G1 Uncharacterized protein OS=Pongo abelii GN=GON4L PE=4 SV=1
18 : I3LQ61_PIG 0.82 0.93 6 95 1219 1308 90 0 0 1334 I3LQ61 Uncharacterized protein (Fragment) OS=Sus scrofa GN=LOC100625588 PE=4 SV=1
19 : F6QF38_MONDO 0.81 0.90 6 95 2084 2173 90 0 0 2199 F6QF38 Uncharacterized protein OS=Monodelphis domestica PE=4 SV=2
20 : F6R230_MACMU 0.81 0.91 5 95 1604 1694 91 0 0 1720 F6R230 Uncharacterized protein OS=Macaca mulatta GN=GON4L PE=4 SV=1
21 : F7DHH2_HORSE 0.81 0.91 5 95 2116 2206 91 0 0 2232 F7DHH2 Uncharacterized protein OS=Equus caballus GN=GON4L PE=4 SV=1
22 : G3RVP1_GORGO 0.81 0.91 5 95 1497 1587 91 0 0 1613 G3RVP1 Uncharacterized protein (Fragment) OS=Gorilla gorilla gorilla GN=101135693 PE=4 SV=1
23 : G3SVS1_LOXAF 0.81 0.91 5 95 2104 2194 91 0 0 2220 G3SVS1 Uncharacterized protein OS=Loxodonta africana GN=LOC100669025 PE=4 SV=1
24 : G3U1B9_LOXAF 0.81 0.91 5 95 2098 2188 91 0 0 2214 G3U1B9 Uncharacterized protein OS=Loxodonta africana GN=LOC100669025 PE=4 SV=1
25 : H0WHB8_OTOGA 0.81 0.92 8 95 2131 2218 88 0 0 2243 H0WHB8 Uncharacterized protein OS=Otolemur garnettii GN=GON4L PE=4 SV=1
26 : H9FW76_MACMU 0.81 0.91 5 95 2124 2214 91 0 0 2240 H9FW76 GON-4-like protein isoform a OS=Macaca mulatta GN=GON4L PE=2 SV=1
27 : I2CX94_MACMU 0.81 0.91 5 95 2124 2214 91 0 0 2240 I2CX94 GON-4-like protein isoform a OS=Macaca mulatta GN=GON4L PE=2 SV=1
28 : K7C6X1_PANTR 0.81 0.91 5 95 2124 2214 91 0 0 2240 K7C6X1 Gon-4-like OS=Pan troglodytes GN=GON4L PE=2 SV=1
29 : K7CP37_PANTR 0.81 0.91 5 95 2124 2214 91 0 0 2240 K7CP37 Gon-4-like OS=Pan troglodytes GN=GON4L PE=2 SV=1
30 : L9KHM1_TUPCH 0.81 0.90 5 95 1246 1336 91 0 0 1362 L9KHM1 GON-4-like protein OS=Tupaia chinensis GN=TREES_T100014146 PE=4 SV=1
31 : Q9NXJ9_HUMAN 0.81 0.91 6 95 582 671 90 0 0 697 Q9NXJ9 cDNA FLJ20203 fis, clone COLF1334 OS=Homo sapiens PE=2 SV=1
32 : S9XE06_9CETA 0.81 0.91 5 95 1811 1901 91 0 0 1927 S9XE06 GON-4-like protein OS=Camelus ferus GN=CB1_000161035 PE=4 SV=1
33 : W5P3U6_SHEEP 0.81 0.92 5 95 2124 2214 91 0 0 2240 W5P3U6 Uncharacterized protein OS=Ovis aries PE=4 SV=1
34 : A4PB67_HUMAN 0.80 0.91 5 95 2124 2214 91 0 0 2240 A4PB67 YY1AP-related protein1 OS=Homo sapiens PE=2 SV=1
35 : A6QNS1_BOVIN 0.80 0.92 5 95 1591 1681 91 0 0 1707 A6QNS1 GON4L protein (Fragment) OS=Bos taurus GN=GON4L PE=2 SV=1
36 : F1MP31_BOVIN 0.80 0.92 5 95 2123 2213 91 0 0 2239 F1MP31 Uncharacterized protein OS=Bos taurus GN=GON4L PE=4 SV=1
37 : F7GXE8_CALJA 0.80 0.91 5 95 2124 2214 91 0 0 2240 F7GXE8 Uncharacterized protein OS=Callithrix jacchus GN=GON4L PE=4 SV=1
38 : F7HP47_CALJA 0.80 0.91 5 95 2118 2208 91 0 0 2234 F7HP47 Uncharacterized protein OS=Callithrix jacchus GN=GON4L PE=4 SV=1
39 : F7HUX6_CALJA 0.80 0.91 5 95 2123 2213 91 0 0 2239 F7HUX6 Uncharacterized protein OS=Callithrix jacchus GN=GON4L PE=4 SV=1
40 : GON4L_HUMAN 0.80 0.91 5 95 2125 2215 91 0 0 2241 Q3T8J9 GON-4-like protein OS=Homo sapiens GN=GON4L PE=1 SV=1
41 : H0V441_CAVPO 0.80 0.90 5 95 2065 2155 91 0 0 2181 H0V441 Uncharacterized protein OS=Cavia porcellus PE=4 SV=1
42 : D2HZK3_AILME 0.79 0.91 5 95 2120 2210 91 0 0 2235 D2HZK3 Putative uncharacterized protein (Fragment) OS=Ailuropoda melanoleuca GN=PANDA_018252 PE=4 SV=1
43 : E2RT89_CANFA 0.79 0.89 5 95 2110 2200 91 0 0 2226 E2RT89 Uncharacterized protein OS=Canis familiaris GN=GON4L PE=4 SV=2
44 : G1KZJ4_AILME 0.79 0.91 5 95 2114 2204 91 0 0 2230 G1KZJ4 Uncharacterized protein OS=Ailuropoda melanoleuca GN=GON4L PE=4 SV=1
45 : G1PES6_MYOLU 0.79 0.89 5 95 2120 2210 91 0 0 2236 G1PES6 Uncharacterized protein OS=Myotis lucifugus PE=4 SV=1
46 : M3WGD9_FELCA 0.79 0.91 5 95 2101 2191 91 0 0 2217 M3WGD9 Uncharacterized protein OS=Felis catus PE=4 SV=1
47 : S7MVN7_MYOBR 0.79 0.89 5 95 2066 2156 91 0 0 2182 S7MVN7 GON-4-like protein OS=Myotis brandtii GN=D623_10031967 PE=4 SV=1
48 : K9IQL4_DESRO 0.77 0.87 5 95 2125 2215 91 0 0 2241 K9IQL4 Putative gon-4-like protein OS=Desmodus rotundus PE=2 SV=1
49 : G1MQA2_MELGA 0.74 0.90 15 95 1916 1996 81 0 0 2022 G1MQA2 Uncharacterized protein (Fragment) OS=Meleagris gallopavo PE=4 SV=2
50 : M3Y1N5_MUSPF 0.74 0.88 2 95 2112 2205 94 0 0 2231 M3Y1N5 Uncharacterized protein OS=Mustela putorius furo GN=Gon4l PE=4 SV=1
51 : R0L006_ANAPL 0.74 0.90 15 95 1973 2053 81 0 0 2079 R0L006 GON-4-like protein (Fragment) OS=Anas platyrhynchos GN=Anapl_06571 PE=4 SV=1
52 : U3J3E3_ANAPL 0.74 0.90 15 95 1873 1953 81 0 0 1979 U3J3E3 Uncharacterized protein (Fragment) OS=Anas platyrhynchos PE=4 SV=1
53 : H9L0N1_CHICK 0.71 0.87 11 95 2132 2216 85 0 0 2242 H9L0N1 Uncharacterized protein OS=Gallus gallus GN=GON4L PE=4 SV=2
54 : H1A477_TAEGU 0.68 0.88 5 95 1788 1878 91 0 0 1904 H1A477 Uncharacterized protein (Fragment) OS=Taeniopygia guttata GN=GON4L PE=4 SV=1
55 : K7FJ22_PELSI 0.68 0.80 2 95 1942 2035 94 0 0 2061 K7FJ22 Uncharacterized protein (Fragment) OS=Pelodiscus sinensis PE=4 SV=1
56 : M7B746_CHEMY 0.67 0.80 4 95 2197 2288 92 0 0 2314 M7B746 GON-4-like protein OS=Chelonia mydas GN=UY3_14910 PE=4 SV=1
57 : H9G779_ANOCA 0.66 0.83 2 95 1494 1587 94 0 0 1613 H9G779 Uncharacterized protein (Fragment) OS=Anolis carolinensis PE=4 SV=1
58 : A9UMQ1_XENLA 0.65 0.86 19 95 631 707 77 0 0 726 A9UMQ1 LOC100137703 protein (Fragment) OS=Xenopus laevis GN=LOC100137703 PE=2 SV=1
59 : U3JJH8_FICAL 0.64 0.83 2 95 1948 2041 94 0 0 2067 U3JJH8 Uncharacterized protein (Fragment) OS=Ficedula albicollis PE=4 SV=1
60 : H2ZYA2_LATCH 0.62 0.85 5 95 1497 1587 91 0 0 1612 H2ZYA2 Uncharacterized protein (Fragment) OS=Latimeria chalumnae PE=4 SV=1
61 : W5MK96_LEPOC 0.60 0.79 12 95 1873 1956 84 0 0 1966 W5MK96 Uncharacterized protein (Fragment) OS=Lepisosteus oculatus PE=4 SV=1
62 : G3NFE6_GASAC 0.59 0.79 20 95 538 613 76 0 0 628 G3NFE6 Uncharacterized protein (Fragment) OS=Gasterosteus aculeatus PE=4 SV=1
63 : Q0IHZ8_XENTR 0.59 0.81 5 95 644 734 91 0 0 751 Q0IHZ8 Gon-4-like OS=Xenopus tropicalis GN=gon4l PE=2 SV=1
64 : Q4VA68_XENTR 0.59 0.81 5 95 586 676 91 0 0 693 Q4VA68 Gon4l protein (Fragment) OS=Xenopus tropicalis GN=gon4l PE=2 SV=1
65 : W5KW33_ASTMX 0.59 0.79 15 87 1707 1779 73 0 0 1802 W5KW33 Uncharacterized protein OS=Astyanax mexicanus PE=4 SV=1
66 : B2GU93_XENTR 0.58 0.80 3 95 1282 1374 93 0 0 1391 B2GU93 Gon4l protein OS=Xenopus tropicalis GN=gon4l PE=2 SV=1
67 : F6SYN8_XENTR 0.58 0.80 3 95 1513 1605 93 0 0 1622 F6SYN8 Uncharacterized protein (Fragment) OS=Xenopus tropicalis GN=gon4l PE=4 SV=1
68 : F6SYP6_XENTR 0.58 0.80 3 95 1173 1265 93 0 0 1282 F6SYP6 Uncharacterized protein (Fragment) OS=Xenopus tropicalis GN=gon4l PE=4 SV=1
69 : H2S3H8_TAKRU 0.58 0.75 19 95 643 719 77 0 0 734 H2S3H8 Uncharacterized protein (Fragment) OS=Takifugu rubripes PE=4 SV=1
70 : S4RNV3_PETMA 0.57 0.77 3 93 274 364 91 0 0 388 S4RNV3 Uncharacterized protein (Fragment) OS=Petromyzon marinus PE=4 SV=1
71 : W5LGH8_ASTMX 0.56 0.80 12 95 2077 2160 84 0 0 2174 W5LGH8 Uncharacterized protein OS=Astyanax mexicanus PE=4 SV=1
72 : W5LGI2_ASTMX 0.56 0.80 12 95 1965 2048 84 0 0 2062 W5LGI2 Uncharacterized protein OS=Astyanax mexicanus PE=4 SV=1
73 : H3CKQ6_TETNG 0.55 0.77 12 95 571 654 84 0 0 668 H3CKQ6 Uncharacterized protein (Fragment) OS=Tetraodon nigroviridis PE=4 SV=1
74 : A4ZXU1_DANRE 0.54 0.79 1 95 1947 2041 95 0 0 2055 A4ZXU1 Ugly duckling OS=Danio rerio GN=gon4l PE=2 SV=1
75 : F1QI87_DANRE 0.54 0.79 1 95 1947 2041 95 0 0 2055 F1QI87 Uncharacterized protein OS=Danio rerio GN=gon4l PE=4 SV=1
76 : I3IYP6_ORENI 0.52 0.77 6 95 1608 1697 90 0 0 1712 I3IYP6 Uncharacterized protein OS=Oreochromis niloticus PE=4 SV=1
77 : M3ZRX1_XIPMA 0.51 0.79 4 95 1356 1447 92 0 0 1461 M3ZRX1 Uncharacterized protein (Fragment) OS=Xiphophorus maculatus PE=4 SV=1
78 : A7SZC4_NEMVE 0.43 0.68 19 92 125 198 74 0 0 223 A7SZC4 Predicted protein OS=Nematostella vectensis GN=v1g248199 PE=4 SV=1
79 : V4B9Z7_LOTGI 0.41 0.67 23 95 551 622 73 1 1 626 V4B9Z7 Uncharacterized protein OS=Lottia gigantea GN=LOTGIDRAFT_237967 PE=4 SV=1
80 : T1J6B0_STRMM 0.37 0.63 1 95 1204 1298 95 0 0 1306 T1J6B0 Uncharacterized protein OS=Strigamia maritima PE=4 SV=1
81 : C3Y1N7_BRAFL 0.36 0.62 4 95 1918 2009 92 0 0 2035 C3Y1N7 Putative uncharacterized protein OS=Branchiostoma floridae GN=BRAFLDRAFT_78599 PE=4 SV=1
82 : H0V2E9_CAVPO 0.32 0.62 15 87 1859 1931 73 0 0 1934 H0V2E9 Uncharacterized protein OS=Cavia porcellus GN=Casp8ap2 PE=4 SV=1
83 : B1AX75_MOUSE 0.31 0.56 11 87 111 187 78 2 2 190 B1AX75 CASP8-associated protein 2 OS=Mus musculus GN=Casp8ap2 PE=2 SV=1
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A G 0 0 122 9 23 G G GG G
2 2 A P + 0 0 136 13 78 T T TT T P G S A
3 3 A S - 0 0 117 17 74 Q Q QQ Q G R K G DDD S
4 4 A G - 0 0 69 23 61 GGG GG GG G K GGD P PPP G
5 5 A S + 0 0 120 54 73 KKM MK MM K GGGGG GGGGG GGGGGGGGGGGGGGAGG G GQQT GP NN NNN N
6 6 A S + 0 0 128 61 67 GGG GG GG G PPPPPPPPPP PPPPPPPPPPPPPPPPPPPPPPP P ARRN PH SS SSS S
7 7 A G + 0 0 74 61 80 PPP PP PP P AEEADEVEEE EEEEEEVAEAAEEEEDVVVEVEE M AEEG RK KK KKK K
8 8 A A + 0 0 112 62 58 EEE EE EE E GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG G ADDG EG EE EEE H
9 9 A G - 0 0 77 63 67 AAG GA GGSG EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE E EEEL QS SS SSS Q
10 10 A A + 0 0 91 63 77 VVA AV AANA PHQPQQQQQQQQQQQQQQLQLLQQQQQPQPQQQQ L GQQQ QV PP PPP A
11 11 A L + 0 0 144 65 75 LLL LL LLQL QPQQQQQQQQQQQQQQQQQQQQQQQQQHPHQQQQ H QRQQA QE DD DDD E
12 12 A P - 0 0 124 70 57 PPP PPPPPAP PPPPLPPPPPPPPPPPPPAPAAPPPPPPLPPPPP L QPQQQ QQS PP PPP P
13 13 A K - 0 0 171 72 57 K KKK KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK K RRKKR RKS DD DDD L
14 14 A A - 0 0 101 73 64 A AAAAAAAAAAVAAAAAAAAAAATAAAAAAAAAAAAAAAAAAAATAA A VVIIA VGG EE EEE E
15 15 A S - 0 0 118 78 76 S SSSTSSTSSTSTTATTSTTATTTTTTTTATTATTTTTATTTTTTTTTTTTTTMMA ASS QQPQQQ R
16 16 A E - 0 0 188 79 51 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE EED TTKTTT L
17 17 A A - 0 0 91 79 53 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAVAVVAAAAAAAAA AMS SSSSSS H
18 18 A T - 0 0 141 79 41 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TIT TTPTTT V
19 19 A V - 0 0 133 82 16 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVMVVV VVVVVVVV
20 20 A C - 0 0 122 83 29 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
21 21 A A - 0 0 102 83 15 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
22 22 A N - 0 0 148 83 58 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNKNKKKKKKKKKKKKKKKKKKKK
23 23 A N - 0 0 137 84 16 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
24 24 A S + 0 0 111 84 73 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSIISIIIIISIIIIS
25 25 A K + 0 0 159 84 62 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKSSTTSTTTST
26 26 A V + 0 0 128 84 33 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVLLVVRVVVLV
27 27 A S - 0 0 73 84 47 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTSSSMSSSTS
28 28 A S + 0 0 117 84 43 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSPASSPSSSAS
29 29 A T + 0 0 64 84 56 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSTTSSSSSSSSNT
30 30 A G + 0 0 46 84 5 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
31 31 A E + 0 0 163 84 7 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
32 32 A K - 0 0 171 84 39 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKL
33 33 A V + 0 0 141 84 11 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVV
34 34 A V S S- 0 0 108 84 22 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVIIIIVIIIIV
35 35 A L S S+ 0 0 141 83 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
36 36 A W S S- 0 0 31 84 6 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
37 37 A T > - 0 0 78 84 12 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTTTTTTTTT
38 38 A R H >> S+ 0 0 63 84 2 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
39 39 A E H 3> S+ 0 0 146 84 12 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
40 40 A A H 3> S+ 0 0 18 84 17 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
41 41 A D H S+ 0 0 28 84 68 MMMMMMMMMMMMMMMVMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMTTMTTTTTTTMSATTMMTMMMRT
48 48 A C I <>S+ 0 0 10 84 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
49 49 A Q I <5S+ 0 0 154 84 3 QQQQQQQQQQQQQQQQQQQQQQQQKQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQKQQQQQQQQQQQQ
50 50 A E I <5S+ 0 0 171 84 37 EEEEEEEEEEEEEQEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEQEEEEEEEEEEQREEQEEEQE
51 51 A Q I <5S- 0 0 105 84 56 RKQQQQQQQQQQQQQQQQRQQQQQQQQQQQQQQQQQQQQQQRKRQRQKKQKKKKRRRRKRQERRRRRRDK
52 52 A G I < - 0 0 14 84 13 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
53 53 A A S - 0 0 106 84 68 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHQHHHHHHTWHNSNQQSQQQNN
55 55 A P T 4 S+ 0 0 98 84 67 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPLPLLLLLLPPLQQQPPQPPPQG
56 56 A H T >4 S+ 0 0 162 84 67 QQHHHQHHQHHQHQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQEQEEEDDEDEDEAEDDSDDDNA
57 57 A T T >> S+ 0 0 18 84 6 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT
58 58 A F H 3X S+ 0 0 16 84 0 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF
59 59 A S H <4 S+ 0 0 55 84 78 SSSSSSSSSSSSSRSSSSSSSSSSSSSSSSNSSNSSGGGNSSSSRSRRQRHHQHSSGGHTQQAASAAAQT
60 60 A V H X> S+ 0 0 52 84 82 TSVVVSVVIVVVIGIVSITIIIIIIIIIISIIVIVVIIIISSSSIVINAGAAAAAAAAATASDDFDDDAE
61 61 A I H 3X S+ 0 0 0 84 14 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIVIVIIIIIIIV
62 62 A S H 3X S+ 0 0 10 84 20 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSASSSSASSSSA
63 63 A Q H <4 S+ 0 0 150 84 54 QQQQQRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQRQEATQQAQQQTK
64 64 A Q H < S+ 0 0 125 84 64 QQQQQQQQQQQQQRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQKRKKKKQERLKEQLSSQSSSLL
65 65 A L H < S- 0 0 43 84 2 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
66 66 A G S < S- 0 0 47 84 18 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGCGGGGSGGGGGGGG
67 67 A N S S+ 0 0 160 83 14 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
68 68 A K - 0 0 68 84 6 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK
69 69 A T > - 0 0 53 84 29 TTTTTTTTTTTTTTTTTTTTTTTTSTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSTTTTSSTSSSTS
70 70 A P H > S+ 0 0 55 84 43 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPAPAAAAAAAAAATPAAAAAAPS
71 71 A V H > S+ 0 0 97 84 76 GTVVVAVVAVVTLTSAASGATAAAAAAAAAAATATTTTTAAATATSTTATSSASSSDESVSSDDTDDDNQ
72 72 A E H > S+ 0 0 86 84 10 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
73 73 A V H X S+ 0 0 0 84 1 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVV
74 74 A S H X S+ 0 0 33 84 26 SSSSSSSSASSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSGSSSFS
75 75 A H H X S+ 0 0 95 84 58 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHWHRRRRRARRRCR
76 76 A R H X S+ 0 0 75 84 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
77 77 A F H X S+ 0 0 47 84 0 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFYFFFFF
78 78 A R H X S+ 0 0 126 84 42 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRGRRRRRRRRRRRRRRRRQRRRRQQLQQQQR
79 79 A E H X S+ 0 0 66 84 26 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDDDDDDDDDE
80 80 A L H >X S+ 0 0 6 84 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
81 81 A M H 3X S+ 0 0 75 84 31 MMMMMMMMMMMMMVMMMMMMMMMMMMMMMMMMMMVVMMMMMVMVMMMMMVMMMMMMMVMVMMVVIVVVMV
82 82 A Q H 3X S+ 0 0 110 84 69 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQRQQKQKKKRRRALRQRRSSNSSSGR
83 83 A L H - 0 0 78 84 12 TTTTTTTTSTTTQ
38 38 A R H >> S+ 0 0 63 84 2 RRRRRRRRRKRRR
39 39 A E H 3> S+ 0 0 146 84 12 EEEEEEEEEADNN
40 40 A A H 3> S+ 0 0 18 84 17 AAAAAAAAMAQDD
41 41 A D H S+ 0 0 28 84 68 AARTTTMTMTAEE
48 48 A C I <>S+ 0 0 10 84 0 CCCCCCCCCCCCC
49 49 A Q I <5S+ 0 0 154 84 3 QQQQQQQQQQQQQ
50 50 A E I <5S+ 0 0 171 84 37 QQQQQQQEIQQKK
51 51 A Q I <5S- 0 0 105 84 56 QQDQQEELDLQRR
52 52 A G I < - 0 0 14 84 13 GGGGGGGGGGGRM
53 53 A A S - 0 0 106 84 68 NNNSSNNTKTTSS
55 55 A P T 4 S+ 0 0 98 84 67 QQQLQQQQPDALL
56 56 A H T >4 S+ 0 0 162 84 67 SSSSSNNDEVEKK
57 57 A T T >> S+ 0 0 18 84 6 TTTTTTATTATTT
58 58 A F H 3X S+ 0 0 16 84 0 FFFFFFFFYFFFF
59 59 A S H <4 S+ 0 0 55 84 78 QQQQQQQIQQATT
60 60 A V H X> S+ 0 0 52 84 82 AAAAAAAQKIVQY
61 61 A I H 3X S+ 0 0 0 84 14 VVIVVIILIVVLL
62 62 A S H 3X S+ 0 0 10 84 20 SSSSSSSASSAAA
63 63 A Q H <4 S+ 0 0 150 84 54 AASEESNKQHEVV
64 64 A Q H < S+ 0 0 125 84 64 EELQQLLRKQQKK
65 65 A L H < S- 0 0 43 84 2 LLLLLLLILLLLL
66 66 A G S < S- 0 0 47 84 18 GGGGGGGAVGGNN
67 67 A N S S+ 0 0 160 83 14 NNNNNNNGTNDK.
68 68 A K - 0 0 68 84 6 KKKKKKKKKKRNK
69 69 A T > - 0 0 53 84 29 TTTTTTTTSTTPN
70 70 A P H > S+ 0 0 55 84 43 AAPAAPPPPVPNP
71 71 A V H > S+ 0 0 97 84 76 SSTSSSSEADQQN
72 72 A E H > S+ 0 0 86 84 10 EEEEEEEQEEQQQ
73 73 A V H X S+ 0 0 0 84 1 VVVVVVVVIVVVV
74 74 A S H X S+ 0 0 33 84 26 SSFSSSSETRTSS
75 75 A H H X S+ 0 0 95 84 58 KKRRRRQNEEDEE
76 76 A R H X S+ 0 0 75 84 0 RRRRRRRRRRRRR
77 77 A F H X S+ 0 0 47 84 0 FFFFFFFFFFFFF
78 78 A R H X S+ 0 0 126 84 42 RRQRRRREKSMQQ
79 79 A E H X S+ 0 0 66 84 26 DDDDDDDDVEEQQ
80 80 A L H >X S+ 0 0 6 84 0 LLLLLLLLLLLLL
81 81 A M H 3X S+ 0 0 75 84 31 MMMMMMMMMIMKK
82 82 A Q H 3X S+ 0 0 110 84 69 RRGRRHRSERAKK
83 83 A L H