Complet list of 1xu6 hssp file
Complete list of 1xu6.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1XU6
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-13
HEADER IMMUNE SYSTEM, MEMBRANE PROTEIN 25-OCT-04 1XU6
COMPND MOL_ID: 1; MOLECULE: VARIANT SURFACE GLYCOPROTEIN MITAT 1.2; CHAIN: A;
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: TRYPANOSOMA; ORGANISM_TAXID: 5702; STR
AUTHOR A.CHATTOPADHYAY,N.G.JONES,D.NIETLISPACH,P.R.NIELSEN,H.P.VOORHEIS, H.R.
DBREF 1XU6 A 359 433 UNP P26332 VSM2_TRYBB 385 459
SEQLENGTH 80
NCHAIN 1 chain(s) in 1XU6 data set
NALIGN 53
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : B3GVD9_TRYBB 0.99 1.00 5 80 384 459 76 0 0 476 B3GVD9 Variant surface glycoprotein OS=Trypanosoma brucei brucei GN=Tb427.BES40.22 PE=4 SV=1
2 : VSM2_TRYBB 1XU6 0.99 1.00 5 80 384 459 76 0 0 476 P26332 Variant surface glycoprotein MITAT 1.2 OS=Trypanosoma brucei brucei PE=1 SV=1
3 : M4T049_9TRYP 0.49 0.72 10 80 413 483 71 0 0 503 M4T049 Variant surface glycoprotein 471 OS=Trypanosoma brucei PE=4 SV=1
4 : M4T1U0_9TRYP 0.49 0.70 1 80 406 484 81 2 3 500 M4T1U0 Variant surface glycoprotein 484 OS=Trypanosoma brucei PE=4 SV=1
5 : M4SS59_9TRYP 0.43 0.68 5 79 351 423 75 1 2 441 M4SS59 Variant surface glycoprotein 2077 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
6 : M4SXQ4_9TRYP 0.42 0.56 1 80 403 483 81 1 1 495 M4SXQ4 Variant surface glycoprotein 1957 OS=Trypanosoma brucei PE=4 SV=1
7 : Q571W6_TRYBB 0.40 0.68 10 80 372 444 73 2 2 460 Q571W6 Variant surface glycoprotein Mul 3 OS=Trypanosoma brucei brucei GN=VSG Mul 3 PE=2 SV=1
8 : M4SSM1_9TRYP 0.38 0.61 11 79 192 260 69 0 0 278 M4SSM1 Variant surface glycoprotein 2294 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
9 : S5G4X3_9TRYP 0.38 0.59 10 80 407 476 74 3 7 493 S5G4X3 Variant surface glycoprotein OS=Trypanosoma brucei GN=VSG PE=2 SV=1
10 : B3GVR4_TRYBB 0.37 0.59 11 80 406 472 71 2 5 489 B3GVR4 Variant surface glycoprotein OS=Trypanosoma brucei brucei GN=VSG 427-19=VSG MITat 1.19 PE=4 SV=1
11 : M4SYC7_9TRYP 0.37 0.67 4 80 395 473 79 2 2 489 M4SYC7 Variant surface glycoprotein 551 OS=Trypanosoma brucei PE=4 SV=1
12 : L7NU91_TRYEV 0.36 0.60 5 76 403 471 72 1 3 486 L7NU91 Variant surface glycoprotein OS=Trypanosoma evansi GN=VSG PE=2 SV=1
13 : M4SU11_9TRYP 0.36 0.64 13 78 212 278 67 1 1 307 M4SU11 Variant surface glycoprotein 3471 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
14 : S5FWR5_9TRYP 0.36 0.53 1 80 397 475 81 2 3 493 S5FWR5 Variant surface glycoprotein OS=Trypanosoma brucei GN=VSG PE=2 SV=1
15 : A4UUN1_TRYEV 0.35 0.61 5 76 396 465 72 1 2 477 A4UUN1 Variable surface glycoprotein (Fragment) OS=Trypanosoma evansi GN=VSG PE=2 SV=1
16 : M4SYF1_9TRYP 0.35 0.55 1 80 396 470 80 1 5 486 M4SYF1 Variant surface glycoprotein 576 OS=Trypanosoma brucei PE=4 SV=1
17 : Q22KT8_TRYB2 0.35 0.60 1 80 405 484 82 3 4 501 Q22KT8 Variant surface glycoprotein (VSG), putative OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=Tb11.57.0084 PE=4 SV=1
18 : Q380W1_TRYB2 0.35 0.56 1 80 400 477 80 1 2 497 Q380W1 Variant surface glycoprotein (VSG, atypical), putative OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=Tb11.14.0007 PE=4 SV=1
19 : S5G4A1_9TRYP 0.35 0.55 10 80 407 476 74 3 7 491 S5G4A1 Variant surface glycoprotein OS=Trypanosoma brucei GN=VSG PE=2 SV=1
20 : C9ZTZ3_TRYB9 0.34 0.63 5 74 398 467 71 2 2 547 C9ZTZ3 Variant surface glycoprotein (VSG), putative OS=Trypanosoma brucei gambiense (strain MHOM/CI/86/DAL972) GN=TbgDal_VII7550 PE=4 SV=1
21 : M4SRH1_9TRYP 0.34 0.60 16 79 126 192 67 2 3 257 M4SRH1 Variant surface glycoprotein 2200 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
22 : M4T1H2_9TRYP 0.34 0.65 5 74 398 467 71 2 2 547 M4T1H2 Variant surface glycoprotein 312 OS=Trypanosoma brucei PE=4 SV=1
23 : Q1EQ83_TRYEV 0.34 0.60 10 76 170 236 67 0 0 252 Q1EQ83 Variable surface glycoprotein (Fragment) OS=Trypanosoma evansi GN=RoTat 1.2 PE=4 SV=1
24 : Q26981_TRYEQ 0.34 0.55 9 80 378 445 73 2 6 462 Q26981 Variant surface glycoprotein (Fragment) OS=Trypanosoma equiperdum GN=VSG-78-20 PE=2 SV=1
25 : Q26984_TRYEQ 0.34 0.55 9 80 378 445 73 2 6 462 Q26984 Variant surface glycoprotein OS=Trypanosoma equiperdum GN=VSG-78 PE=2 SV=1
26 : Q57Y75_TRYB2 0.34 0.63 5 74 398 467 71 2 2 547 Q57Y75 Variant surface glycoprotein (VSG, atypical), putative OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=Tb927.7.6540 PE=4 SV=1
27 : M4SVB9_9TRYP 0.33 0.58 18 80 301 366 66 2 3 424 M4SVB9 Variant surface glycoprotein 3172 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
28 : M4SWY2_9TRYP 0.33 0.57 9 74 390 457 69 3 4 519 M4SWY2 Variant surface glycoprotein 382 OS=Trypanosoma brucei PE=4 SV=1
29 : M4SYJ1_9TRYP 0.33 0.57 11 78 389 456 69 2 2 471 M4SYJ1 Variant surface glycoprotein 622 OS=Trypanosoma brucei PE=4 SV=1
30 : M4SYY3_9TRYP 0.33 0.54 3 72 376 446 72 3 3 510 M4SYY3 Variant surface glycoprotein 708 OS=Trypanosoma brucei PE=4 SV=1
31 : M4SZE2_9TRYP 0.33 0.46 8 80 383 456 76 3 5 472 M4SZE2 Variant surface glycoprotein 1138 OS=Trypanosoma brucei PE=4 SV=1
32 : M4T7Q0_9TRYP 0.33 0.52 11 79 192 260 69 0 0 276 M4T7Q0 Variant surface glycoprotein 2185 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
33 : Q57X38_TRYB2 0.33 0.52 2 80 384 465 82 2 3 481 Q57X38 Variant surface glycoprotein (VSG), putative OS=Trypanosoma brucei brucei (strain 927/4 GUTat10.1) GN=Tb927.5.4770 PE=4 SV=1
34 : M4SU78_9TRYP 0.32 0.53 3 72 375 446 72 2 2 510 M4SU78 Variant surface glycoprotein 709 OS=Trypanosoma brucei PE=4 SV=1
35 : M4SUK8_9TRYP 0.32 0.51 11 79 108 176 69 0 0 192 M4SUK8 Variant surface glycoprotein 2296 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
36 : M4SVN1_9TRYP 0.32 0.55 1 68 393 458 69 2 4 469 M4SVN1 Variant surface glycoprotein 1155 OS=Trypanosoma brucei PE=4 SV=1
37 : M4SX75_9TRYP 0.32 0.52 11 74 398 466 69 3 5 518 M4SX75 Variant surface glycoprotein 693 OS=Trypanosoma brucei PE=4 SV=1
38 : M4SZ30_9TRYP 0.32 0.61 1 78 369 448 80 2 2 448 M4SZ30 Variant surface glycoprotein 776 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
39 : M4T8E4_9TRYP 0.32 0.54 1 79 302 377 81 3 7 410 M4T8E4 Variant surface glycoprotein 3116 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
40 : M4TDB0_9TRYP 0.32 0.51 9 80 379 453 75 2 3 504 M4TDB0 Variant surface glycoprotein 464 OS=Trypanosoma brucei PE=4 SV=1
41 : M4TDN6_9TRYP 0.32 0.57 3 78 354 432 79 2 3 478 M4TDN6 Variant surface glycoprotein 623 OS=Trypanosoma brucei PE=4 SV=1
42 : S5FV30_9TRYP 0.32 0.50 1 79 376 456 82 3 4 520 S5FV30 Variant surface glycoprotein OS=Trypanosoma brucei GN=VSG PE=2 SV=1
43 : S5FV34_9TRYP 0.32 0.50 1 79 376 456 82 3 4 520 S5FV34 Variant surface glycoprotein OS=Trypanosoma brucei GN=VSG PE=2 SV=1
44 : M4SVJ7_9TRYP 0.31 0.46 10 80 346 416 74 3 6 445 M4SVJ7 Variant surface glycoprotein 2022 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
45 : S5G525_9TRYP 0.31 0.57 1 80 401 479 81 2 3 495 S5G525 Variant surface glycoprotein OS=Trypanosoma brucei GN=VSG PE=2 SV=1
46 : S5G7J9_9TRYP 0.31 0.57 1 79 401 478 80 2 3 493 S5G7J9 Variant surface glycoprotein OS=Trypanosoma brucei GN=VSG PE=2 SV=1
47 : S5G7Q2_9TRYP 0.31 0.57 1 80 401 479 81 2 3 495 S5G7Q2 Variant surface glycoprotein OS=Trypanosoma brucei GN=VSG PE=2 SV=1
48 : S5G7W9_9TRYP 0.31 0.57 1 79 401 478 80 2 3 493 S5G7W9 Variant surface glycoprotein OS=Trypanosoma brucei GN=VSG PE=2 SV=1
49 : B3GVM3_TRYBB 0.30 0.51 1 80 390 469 83 3 6 488 B3GVM3 Variant surface glycoprotein OS=Trypanosoma brucei brucei GN=VSG 427-14 PE=4 SV=1
50 : M4T841_9TRYP 0.30 0.50 1 75 440 519 80 3 5 532 M4T841 Variant surface glycoprotein 3016 (Fragment) OS=Trypanosoma brucei PE=4 SV=1
51 : Q968K7_9TRYP 0.30 0.62 1 80 369 450 82 2 2 471 Q968K7 Variable surface glycoprotein OS=Trypanosoma brucei GN=VSG PE=2 SV=1
52 : VSIB_TRYBB 0.30 0.62 1 80 369 450 82 2 2 471 P07208 Variant surface glycoprotein ILTAT 1.1BC OS=Trypanosoma brucei brucei PE=3 SV=1
53 : VSWA_TRYBR 0.30 0.59 2 80 374 456 83 3 4 471 P20946 Variant surface glycoprotein WRATAT A OS=Trypanosoma brucei rhodesiense PE=2 SV=1
## ALIGNMENTS 1 - 53
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 354 A G 0 0 120 20 56 G G S EGA G GN EE TTTTGAGG
2 355 A S + 0 0 112 22 80 Q S Q AQD A Q VS TT TTTTQSVVE
3 356 A H - 0 0 117 25 77 S E H KSR K EQ N AR HSS KKKKNNAAQ
4 357 A M S S+ 0 0 159 26 50 M L L V LMT L LK M LL TLL LLLLKQLLT
5 358 A L - 0 0 102 34 62 FF LLL II LLALV L L L K LL A AM AAA IIIIAVAAL
6 359 A E + 0 0 159 34 71 EE AEA RD SQAAA N N N Q EK A KK DTT EEEEDLKKA
7 360 A V - 0 0 102 34 78 VV VVY AK TAAIS E E E A AQ E AT ITA EEEENTAAT
8 361 A L S S- 0 0 92 35 84 LL AEL NL EEKAE E E E TL KA L IE VNN AAAAKLIIT
9 362 A T + 0 0 16 39 70 TT DKT NK TSTDP K N TTK T TT RT K NPTAPP KKKKGQNNG
10 363 A Q + 0 0 135 45 79 QQEANPQ E EQ ADAAAEQ QENNQ G IT QA S VTNASSEIIIIPQVVT
11 364 A K S S- 0 0 112 51 70 KKPAKERKSKAS MAAAKSK KSKKK KKVAKLIKQKKVKVTTKRRRRETKKK
12 365 A H S S+ 0 0 200 51 78 HHAKRPHNATKN KKETDAK KDQQK PKKVIQVNQHEKPPSSTKKKKTPEES
13 366 A K - 0 0 71 52 74 KKMKNENKKQKVKQNKKCKE GAGGG SDQKKAKKEIAAPQQQKQQQQATAAS
14 367 A P S S+ 0 0 135 52 80 PPKAAIARKSEASEKVTSKS AKQQG PTQPRQQRKPSKTAKKPMMMMAGSSS
15 368 A A - 0 0 61 52 78 AAQCAKENTTVAPCAPCTTD DNEED HNAENNQQKTKKGTTTQPPPPDTKKP
16 369 A E + 0 0 129 53 74 EENKTQNALKSKTKSTKNLRQDKKKD TSQPAKAATEKETTDDIKKKKKQKKQ
17 370 A S S S+ 0 0 93 53 73 SSCPSnnAEIsKGPTAPGETSTAKKT KASSASTAQTeTGASSVEEEENAees
18 371 A Q S S- 0 0 130 25 76 QQK..kpT..e.S.......K.S...E.E..TEST.Sn.AG........Pnnq
19 372 A Q - 0 0 75 32 83 QQP..PTS..A.K......AATT..TAEEI.STIS.NL.PQQQ......ALLQ
20 373 A Q + 0 0 39 43 54 QQQQKQKK..N.KQK.QG.KEKK..KKQKKAKAKK.KH.QQKK.KKKK.AHHK
21 374 A A - 0 0 31 47 79 AALVALQA..IPIVV.LI.THTV..TQEDEQAKEAVQG.EQVVQQQQQQAGGA
22 375 A A - 0 0 58 50 74 AASTKSTKDTTQTTT.TQDSAST..SEEKSTKESKAVH.KQTTQTTTTQEHHS
23 376 A E - 0 0 49 54 39 EEEELEELEEEEEEEPEEEEEEEEEEAEEETLAELEQEEEMEETEEEEEKEEE
24 377 A T >> - 0 0 86 54 59 TTTTSTTSATTTTTTKTEATATTTTTAACTKSATSTTTTAPAAKRRRRSTTTA
25 378 A E H 3> S+ 0 0 119 54 29 EEEEDEDDEDDNDEDPENEDEDNDDDEENDEDEDDDEDDKEEEEDDDDEEDDE
26 379 A G H 3> S+ 0 0 48 54 54 GGEDAEEDETAEEDEDAKEEGEEAAEKATEEDAEDEKAEESDDEEEEEKKAAA
27 380 A S H X> S+ 0 0 27 54 63 SSSSDSTDNTTTTSTETENTETTTTTEKATDDKTDTETTSDSSDTTTTKDTTN
28 381 A a H 3< S+ 0 0 0 54 7 CCCCCCCFCCCCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCCCCCCCC
29 382 A N H 3< S+ 0 0 81 54 53 NNNSQNEQNEEEKNEKASNQNQEEEQNNDENKNEKQNKKNNNNNEEEENNEEN
30 383 A K H << S+ 0 0 153 54 56 KKTKDKKDATKKTKKAAKAAKAKAAAKKNKKAKKAKKATKKKKKKKKKATAAD
31 384 A K S < S- 0 0 50 54 41 KKKKKKKKKKRKNKKKRKKKKKKKKKKIKKDKIGKRIKKLQHHHKKKKKKKKK
32 385 A D > - 0 0 100 54 55 DDGGKNGKDGGEGGEKGTDGEGEGGGEDAGQTDGTGDGGQQKKEGGGGEDGGA
33 386 A Q T >4 S+ 0 0 120 54 78 QQKQGQTGKTEVTQVGAGKTKTVIITTKETSRKKGKKVTSNDDTTKTKKKVVK
34 387 A N T 34 S+ 0 0 154 54 61 NNNTDNGDEGEDGTDDGDEGEGDGGGDQCGKGDEGKDGGKNRRPGEGEQEGGE
35 388 A E T 34 S+ 0 0 69 54 59 EEEDDEdSsddKdEKTdDsdsdKaadttEdeAtdAdtdndsttdddddtpddt
36 389 A b << - 0 0 25 54 5 CCCCCCcCcccCcCCCcCccccCcccccKccCccCcccccccccccccccccc
37 390 A K > - 0 0 35 54 48 KKKTKSKKKTKEKKEKKKKTTTEKKTQQKKTKKKKKAKKTKQQAKKKKKKKKN
38 391 A S T 3 S+ 0 0 67 54 71 SSPSDSDNKSEKISKDSDKSKSKPPSKNTDDDTDDDAPADSPPEDEEEPAPPS
39 392 A P T 3 S+ 0 0 1 54 41 PPPPGPGGPPGPPPPGPGPPPPPPPPPPGGPGPGGGtPPPPPPPGGGGPnPPP
40 393 A a S < S- 0 0 2 54 4 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcCRCCCCCCCCCCcCCC
41 394 A K - 0 0 63 54 6 KKKKKKKKKKKKKKKKKKKKKKKKKKKKTKKKKKEKKKKKKKKKKKKKKKKKK
42 395 A W E -A 51 0A 84 54 87 WWWLWWWLWEWVVWVWLLWEWEVVVEWWYEWEWEEWWVVWWWWWLWVWWWEEW
43 396 A H E > -A 50 0A 73 54 76 HHNVESDKDVDVVNVDVTDVNVVEEVNDDININIIEDVVENNNEEDEDNNVVD
44 397 A N T 3 S+ 0 0 107 54 64 NNAeGPGGgESEEpESeGgEnEEGGEaketeSatSGtGdedaaaGSGSptEEk
45 398 A D T 3 S+ 0 0 132 54 63 DDEeTKETkEDAEaADeEkEeEATTEkkkntEqnETkAettttaEDEDkkEEk
46 399 A A S < S- 0 0 19 54 44 AAAGGAGGENGNNGNGGGEGDGNGGGDDDDDNDDNGDGDDDDDEGGGGDDGGD
47 400 A E S S+ 0 0 178 54 59 EEEGEEDEPGGGGGGEGEPGPGGKKGEEKQQGEQGEEDGTSAAKEGEGAPGGE
48 401 A N S S- 0 0 147 54 61 NNGKNGNNNTNAATANKNNKKKADDKKKGNAQANQNEQTNNNNNKNKNKKKKK
49 402 A K - 0 0 88 54 13 KKKKKKKKKKKKKKKKKKKKKKKAAKKKKKKKKKKKKKKKKKKKAKAKKKKKK
50 403 A K E -A 43 0A 119 54 7 KKKKKKKKKKKKKKKKKKKKKKKKKKKRRKKKKKKKRKKKKKKKKKKKKKKKR
51 404 A b E +A 42 0A 5 54 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
52 405 A T - 0 0 36 53 67 TTTKLKVVKKVTTKTVTVKTTTTTTTTT.VSVTVVVTTQSSTTSVVVVTTKKK
53 406 A L - 0 0 33 54 35 LLLLVLKVLLVLLLLVLKLLLLLLLLLLKVLVLVVVLLLLLLLLKVKVLLLLL
54 407 A D > - 0 0 31 54 29 DDDDDDDDDDDDDDDDDDDDSDDDDDSSLDDDSDDDSDDDDDDDDDDDSSDDS
55 408 A K H > S+ 0 0 161 54 65 KKKKPKLPKKQKKSKPATKKEKKEEKEKSKQKEKKKEKKSTPPHPPPPEEKKE
56 409 A E H > S+ 0 0 148 54 58 EEEENEDNDENDEKDHKYDEGEDEEEEEEAAKKAKTEEDVAKKKDTDTEEEEE
57 410 A E H > S+ 0 0 68 54 85 EEEEYAYYAEYEEAEFAFAASAEEEAGAEYTEGYEYCAEKKKKKYSYSGGEEG
58 411 A A H X S+ 0 0 22 54 61 AAAATKKTAAKAAKAVAVAKKKAAAKKKAKQAQKATKAAAAAAGKKKKKKAAK
59 412 A K H X S+ 0 0 151 54 66 KKKKREPRKKPKKEKKKKKQKQKKKQKQKPATKLTAKTTAAAARPPPPRQKKQ
60 413 A K H < S+ 0 0 144 54 56 KKKKKAKKEKPKKAKKKKETSTKKKTAAKKGKTKKKEKKEEEEKKKKKQERRA
61 414 A V H >< S+ 0 0 7 54 75 VVVVQVQQALQLVVLEQEAATALLLAVAVQTVAQVQVVNQQQQQQQQQVIVVE
62 415 A A H 3< S+ 0 0 41 54 65 AAAAAEAAAEAEAEEVAVAEEEEEEEEQAVGEEVEVQATQAQQAVEVEERAAK
63 416 A D T 3< + 0 0 119 54 50 DDEEEKEEEEEEEKEEQEEGAGEEEGNADEDGKEGEQEQTNAATEEEEKKEEE
64 417 A E S < S- 0 0 67 54 70 EEKKGAGGKKGKNAKGNGKAAAKEEAQGGRGEAREGAQKSQTTQGGGGTEQQN
65 418 A T S S+ 0 0 84 54 65 TTTAGNGGATGTQATGQGAAAATKKATTTTTNNTNGATTQAQQTGPGPEAAAQ
66 419 A A S S+ 0 0 61 54 67 AATNEQEEKEEDAADEEEKKTKDTTKDGANAKQNKEETAAAAAAEKEKQVAAE
67 420 A K S S+ 0 0 88 54 70 KKAQKEKKQQKGGIGKTKQQGQGKKQQAKDGEEDEKRKQGGAAGKEKEKNTTG
68 421 A D S S- 0 0 133 54 59 DDNEDTEDEKDKGEKDEDAEDEKKKEEGDDAKGDKKAENGTGGTDDDDDANNK
69 422 A G - 0 0 34 53 60 GGQGSDSSGGSTNETSDSGGGGTEEGTAAGQTKGT EDGGGTTESGSGGGQQD
70 423 A K + 0 0 194 53 67 KKEKKGKKKSKNDGNKKKTKAKNDDKGEKKTVGKV EGKTEGGDKKKKGAEEG
71 424 A T + 0 0 73 53 65 TTGDTKTTDKTTGKTTDTDDADTGGDVATTETATT PKFEGDDKTTTTDETTK
72 425 A G + 0 0 49 53 68 GGKEGYGGGTGTKDTGDGGGKGTKKGKTNNKTKNT QTTTAGGKGTGTSNEEA
73 426 A N S S+ 0 0 151 51 69 NNDKTETTKVTGDGGTKTKKEKGAAKNTT TNT N TTNTAAAETNTNKQGGN
74 427 A T + 0 0 91 51 55 TTGTTKTTSTTSDKSQTTNTGTSNNTDGT NTT T GNTTDTTETTTTTTKKT
75 428 A N S S+ 0 0 108 46 62 NNKNNTNNTSNNKTNNTNT A NTT K G TTN T TTDKAAKNTNTGGDDT
76 429 A T S S- 0 0 105 45 58 TTTTTITTNNTSKTSTSTA S SNN T S TGT G TGKKTTCTTTTT GGG
77 430 A T S S+ 0 0 127 42 56 TTTTTTTTTTT NN TTTG A TT A N GST S GSCETTKTSTST KKS
78 431 A G S S+ 0 0 50 42 61 GGNGENGEATG ET GTGS T TT N S SNG N SNGETTGGNGNN TTS
79 432 A S 0 0 91 38 58 SSTSSNSSGGS T SGSN T GG T DSN S SA GGKSSSST TTN
80 433 A S 0 0 167 28 59 SSTN TN SSN A NSNS SS T S N A GN N T NNS
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 354 A 0 0 0 0 0 0 0 45 10 0 5 20 0 0 0 0 0 15 5 0 20 0 0 1.496 49 0.43
2 355 A 14 0 0 0 0 0 0 0 9 0 18 27 0 0 0 0 23 5 0 5 22 0 0 1.772 59 0.20
3 356 A 0 0 0 0 0 0 0 0 12 0 16 0 0 12 8 24 8 8 12 0 25 0 0 2.005 66 0.23
4 357 A 4 58 0 15 0 0 0 0 0 0 0 12 0 0 0 8 4 0 0 0 26 0 0 1.302 43 0.49
5 358 A 6 38 18 3 6 0 0 0 26 0 0 0 0 0 0 3 0 0 0 0 34 0 0 1.566 52 0.38
6 359 A 0 3 0 0 0 0 0 0 21 0 3 6 0 0 3 15 6 26 9 9 34 0 0 2.032 67 0.28
7 360 A 15 0 6 0 0 0 3 0 26 0 3 15 0 0 0 3 3 24 3 0 34 0 0 1.941 64 0.22
8 361 A 3 23 9 0 0 0 0 0 20 0 0 6 0 0 0 9 0 23 9 0 35 0 0 1.893 63 0.16
9 362 A 0 0 0 0 0 0 0 5 3 10 3 33 0 0 3 23 3 0 13 5 39 0 0 1.882 62 0.29
10 363 A 7 0 11 0 0 0 0 2 16 4 7 7 0 0 0 0 22 13 9 2 45 0 0 2.201 73 0.20
11 364 A 6 2 2 2 0 0 0 0 12 2 8 6 0 0 10 45 2 4 0 0 51 0 0 1.884 62 0.30
12 365 A 4 0 2 0 0 0 0 0 6 10 6 8 0 10 2 27 8 8 6 4 51 0 0 2.317 77 0.21
13 366 A 2 0 2 2 0 0 0 8 13 2 4 2 2 0 0 31 19 6 6 2 52 0 0 2.133 71 0.25
14 367 A 2 0 2 8 0 0 0 4 13 13 15 6 0 0 6 17 10 4 0 0 52 0 0 2.286 76 0.19
15 368 A 2 0 0 0 0 0 0 2 13 13 0 17 6 2 0 12 8 8 10 8 52 0 0 2.302 76 0.21
16 369 A 0 4 2 0 0 0 0 0 8 2 6 13 0 0 2 32 9 9 6 8 53 0 0 2.141 71 0.25
17 370 A 2 0 2 0 0 0 0 6 15 6 23 13 2 0 0 8 2 17 6 0 53 29 7 2.172 72 0.27
18 371 A 0 0 0 0 0 0 0 4 4 8 16 12 0 0 0 12 16 16 12 0 25 0 0 2.103 70 0.24
19 372 A 0 9 6 0 0 0 0 0 16 9 9 16 0 0 0 3 22 6 3 0 32 0 0 2.141 71 0.16
20 373 A 0 0 0 0 0 0 0 2 7 0 0 0 0 7 0 53 26 2 2 0 43 0 0 1.317 43 0.46
21 374 A 15 6 6 0 0 0 0 6 19 2 0 6 0 2 0 2 23 9 0 2 47 0 0 2.180 72 0.20
22 375 A 2 0 0 0 0 0 0 0 10 0 16 32 0 6 0 12 10 8 0 4 50 0 0 1.951 65 0.25
23 376 A 0 7 0 2 0 0 0 0 4 2 0 4 0 0 0 2 2 78 0 0 54 0 0 0.928 30 0.60
24 377 A 0 0 0 0 0 0 0 0 19 2 9 54 2 0 7 6 0 2 0 0 54 0 0 1.441 48 0.41
25 378 A 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 2 0 43 7 46 54 0 0 1.061 35 0.71
26 379 A 0 0 0 0 0 0 0 7 20 0 2 4 0 0 0 9 0 43 0 15 54 0 0 1.580 52 0.45
27 380 A 0 0 0 0 0 0 0 0 2 0 19 44 0 0 0 6 0 9 6 15 54 0 0 1.571 52 0.37
28 381 A 0 0 0 0 2 0 0 2 0 0 0 0 96 0 0 0 0 0 0 0 54 0 0 0.184 6 0.92
29 382 A 0 0 0 0 0 0 0 0 2 0 4 0 0 0 0 11 11 30 41 2 54 0 0 1.484 49 0.47
30 383 A 0 0 0 0 0 0 0 0 28 0 0 9 0 0 0 56 0 0 2 6 54 0 0 1.137 37 0.43
31 384 A 0 2 6 0 0 0 0 2 0 0 0 0 0 6 6 74 2 0 2 2 54 0 0 1.073 35 0.58
32 385 A 0 0 0 0 0 0 0 44 4 0 0 6 0 0 0 9 6 13 2 17 54 0 0 1.661 55 0.44
33 386 A 11 0 4 0 0 0 0 9 2 0 4 22 0 0 2 26 11 4 2 4 54 0 0 2.103 70 0.21
34 387 A 0 0 0 0 0 0 0 33 0 2 0 4 2 0 4 6 4 17 11 19 54 0 0 1.896 63 0.38
35 388 A 0 0 0 0 0 0 0 0 7 2 9 17 0 0 0 6 0 15 2 43 54 0 37 1.666 55 0.41
36 389 A 0 0 0 0 0 0 0 0 0 0 0 0 98 0 0 2 0 0 0 0 54 0 0 0.092 3 0.94
37 390 A 0 0 0 0 0 0 0 0 4 0 2 15 0 0 0 65 7 6 2 0 54 0 0 1.187 39 0.52
38 391 A 0 0 2 0 0 0 0 0 6 17 24 4 0 0 0 13 0 9 4 22 54 0 0 1.939 64 0.28
39 392 A 0 0 0 0 0 0 0 30 0 67 0 2 0 0 0 0 0 0 2 0 54 0 2 0.778 25 0.59
40 393 A 0 0 0 0 0 0 0 0 0 0 0 0 98 0 2 0 0 0 0 0 54 0 0 0.092 3 0.96
41 394 A 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 96 0 2 0 0 54 0 0 0.184 6 0.93
42 395 A 17 9 0 0 0 54 2 0 0 0 0 0 0 0 0 0 0 19 0 0 54 0 0 1.239 41 0.13
43 396 A 26 0 7 0 0 0 0 0 0 0 2 2 0 6 0 2 0 15 20 20 54 0 0 1.856 61 0.23
44 397 A 0 0 0 0 0 0 0 22 11 6 11 7 0 0 0 4 0 28 7 4 54 0 23 1.969 65 0.35
45 398 A 0 0 0 0 0 0 0 0 11 0 0 19 0 0 0 19 2 33 4 13 54 0 0 1.696 56 0.36
46 399 A 0 0 0 0 0 0 0 41 9 0 0 0 0 0 0 0 0 6 13 31 54 0 0 1.375 45 0.55
47 400 A 0 0 0 0 0 0 0 35 6 7 2 2 0 0 0 7 6 31 0 4 54 0 0 1.708 57 0.41
48 401 A 0 0 0 0 0 0 0 6 11 0 0 6 0 0 0 28 6 2 39 4 54 0 0 1.645 54 0.38
49 402 A 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0 93 0 0 0 0 54 0 0 0.264 8 0.86
50 403 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 93 0 0 0 0 54 0 0 0.264 8 0.93
51 404 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 54 1 0 0.000 0 1.00
52 405 A 26 2 0 0 0 0 0 0 0 0 8 45 0 0 0 17 2 0 0 0 53 0 0 1.356 45 0.33
53 406 A 20 70 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 0 0 54 0 0 0.792 26 0.65
54 407 A 0 2 0 0 0 0 0 0 0 0 15 0 0 0 0 0 0 0 0 83 54 0 0 0.509 16 0.71
55 408 A 0 2 0 0 0 0 0 0 2 17 6 4 0 2 0 48 4 17 0 0 54 0 0 1.575 52 0.35
56 409 A 2 0 0 0 0 0 2 2 7 0 0 6 0 2 0 15 0 43 6 17 54 0 0 1.754 58 0.41
57 410 A 0 0 0 0 4 0 17 9 19 0 6 2 2 0 0 9 0 33 0 0 54 0 0 1.848 61 0.15
58 411 A 4 0 0 0 0 0 0 2 48 0 0 6 0 0 0 37 4 0 0 0 54 0 0 1.198 40 0.39
59 412 A 0 2 0 0 0 0 0 0 11 13 0 7 0 0 7 44 11 4 0 0 54 0 0 1.695 56 0.34
60 413 A 0 0 0 0 0 0 0 2 9 2 2 7 0 0 4 57 2 15 0 0 54 0 0 1.432 47 0.43
61 414 A 31 11 2 0 0 0 0 0 13 0 0 4 0 0 0 0 31 6 2 0 54 0 0 1.667 55 0.25
62 415 A 13 0 0 0 0 0 0 2 35 0 0 2 0 0 2 2 9 35 0 0 54 0 0 1.516 50 0.35
63 416 A 0 0 0 0 0 0 0 9 7 0 0 4 0 0 0 9 6 52 4 9 54 0 0 1.599 53 0.49
64 417 A 0 0 0 0 0 0 0 26 15 0 2 6 0 0 4 17 11 15 6 0 54 0 0 1.976 65 0.29
65 418 A 0 0 0 0 0 0 0 17 24 4 0 31 0 0 0 4 11 2 7 0 54 0 0 1.760 58 0.34
66 419 A 2 0 0 0 0 0 0 2 28 0 0 9 0 0 0 17 6 24 6 7 54 0 0 1.879 62 0.32
67 420 A 0 0 2 0 0 0 0 19 7 0 0 6 0 0 2 31 17 11 2 4 54 0 0 1.916 63 0.29
68 421 A 0 0 0 0 0 0 0 11 7 0 0 6 0 0 0 19 0 19 7 31 54 0 0 1.779 59 0.41
69 422 A 0 0 0 0 0 0 0 38 4 0 15 15 0 0 0 2 8 9 2 8 53 0 0 1.825 60 0.39
70 423 A 4 0 0 0 0 0 0 17 4 0 2 6 0 0 0 43 0 11 6 8 53 0 0 1.752 58 0.32
71 424 A 2 0 0 0 2 0 0 9 6 2 0 43 0 0 0 11 0 6 0 19 53 0 0 1.696 56 0.34
72 425 A 0 0 0 0 0 0 2 34 4 0 2 23 0 0 0 17 2 6 8 4 53 0 0 1.834 61 0.31
73 426 A 2 0 0 0 0 0 0 12 10 0 0 29 0 0 0 16 2 6 20 4 51 0 0 1.897 63 0.31
74 427 A 0 0 0 0 0 0 0 8 0 0 8 57 0 0 0 8 2 2 10 6 51 0 0 1.468 49 0.45
75 428 A 0 0 0 0 0 0 0 7 7 0 2 33 0 0 0 11 0 0 35 7 46 0 0 1.591 53 0.37
76 429 A 0 0 2 0 0 0 0 13 2 0 13 51 2 0 0 7 0 0 9 0 45 0 0 1.530 51 0.41
77 430 A 0 0 0 0 0 0 0 7 5 0 14 55 2 0 0 7 0 2 7 0 42 0 0 1.496 49 0.44
78 431 A 0 0 0 0 0 0 0 31 2 0 12 24 0 0 0 0 0 10 21 0 42 0 0 1.601 53 0.39
79 432 A 0 0 0 0 0 0 0 18 3 0 45 18 0 0 0 3 0 0 11 3 38 0 0 1.507 50 0.42
80 433 A 0 0 0 0 0 0 0 4 7 0 39 14 0 0 0 0 0 0 36 0 28 0 0 1.320 44 0.41
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
4 43 448 1 eEe
6 18 420 1 nCk
7 9 380 1 nAp
7 27 399 1 dGc
9 23 429 1 sDc
9 32 439 2 gEAk
10 22 427 1 dKc
11 15 409 1 sKe
11 33 428 1 dNc
13 24 235 1 dNc
14 43 439 1 pEa
17 34 438 1 dDc
17 43 448 1 eEe
19 23 429 1 sDc
19 32 439 2 gEAk
20 31 428 1 dKc
21 21 146 1 sEc
21 30 156 2 nKAe
22 31 428 1 dKc
24 23 400 1 aEc
25 23 400 1 aEc
26 31 428 1 dKc
27 19 319 1 tEc
27 28 329 2 aEAk
28 27 416 1 tDc
28 36 426 2 kEEk
29 35 423 1 eAk
30 33 408 1 dNc
30 42 418 1 tEn
31 27 409 1 eKc
31 36 419 2 eNSt
33 35 418 1 tEc
33 44 428 2 aEAq
34 34 408 1 dNc
34 43 418 1 tEn
36 33 425 1 dNc
37 26 423 1 tDc
37 30 428 2 tPKc
37 35 435 2 tEEk
38 18 386 1 eCn
38 36 405 1 dNc
39 31 332 1 nGc
39 40 342 1 dEe
40 28 406 1 dKc
40 37 416 2 eKAt
41 34 387 1 sSc
41 43 397 2 dKAt
42 35 410 1 tTc
42 44 420 2 aNAt
43 35 410 1 tTc
43 44 420 2 aNAt
44 24 369 1 dKc
44 33 379 2 aNAa
45 34 434 1 dEc
46 34 434 1 dNc
47 34 434 1 dNc
48 34 434 1 dNc
49 33 422 1 tDc
49 42 432 2 pEEk
50 36 475 1 pEc
50 40 480 2 nPKc
50 45 487 2 tEAk
51 18 386 1 eCn
51 36 405 1 dNc
52 18 386 1 eCn
52 36 405 1 dNc
53 17 390 1 sTq
53 35 409 1 tEc
53 44 419 2 kEEk
//