Complet list of 1wf9 hssp file
Complete list of 1wf9.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1WF9
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-11
HEADER STRUCTURAL GENOMICS, UNKNOWN FUNCTION 26-MAY-04 1WF9
COMPND MOL_ID: 1; MOLECULE: NPL4 FAMILY PROTEIN; CHAIN: A; FRAGMENT: HYPOTHET
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: ARABIDOPSIS THALIANA; ORGANISM_COMMON:
AUTHOR X.QIN,F.HAYASHI,S.YOKOYAMA,RIKEN STRUCTURAL GENOMICS/PROTEOMICS INITIA
DBREF 1WF9 A 8 101 UNP Q9LYC2 Q9LYC2_ARATH 2 95
SEQLENGTH 107
NCHAIN 1 chain(s) in 1WF9 data set
NALIGN 51
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : NPL41_ARATH 1WF9 0.97 0.99 8 104 2 98 97 0 0 413 Q9LYC2 NPL4-like protein 1 OS=Arabidopsis thaliana GN=At3g63000 PE=1 SV=1
2 : D7LTC0_ARALL 0.91 0.99 8 104 2 98 97 0 0 413 D7LTC0 NPL4 family protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_486804 PE=4 SV=1
3 : R0H4B1_9BRAS 0.90 0.98 8 104 2 98 97 0 0 413 R0H4B1 Uncharacterized protein OS=Capsella rubella GN=CARUB_v10017347mg PE=4 SV=1
4 : M4FH41_BRARP 0.82 0.95 9 104 3 98 96 0 0 413 M4FH41 Uncharacterized protein OS=Brassica rapa subsp. pekinensis GN=BRA040419 PE=4 SV=1
5 : V4L6Y2_THESL 0.81 0.98 9 104 3 98 96 0 0 413 V4L6Y2 Uncharacterized protein OS=Thellungiella salsuginea GN=EUTSA_v10001466mg PE=4 SV=1
6 : NPL42_ARATH 0.76 0.95 9 104 3 98 96 0 0 413 O82264 NPL4-like protein 2 OS=Arabidopsis thaliana GN=At2g47970 PE=1 SV=1
7 : D7LHE3_ARALL 0.74 0.96 9 104 2 97 96 0 0 412 D7LHE3 NPL4 family protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_483989 PE=4 SV=1
8 : B9S3Y6_RICCO 0.70 0.93 9 107 2 100 99 0 0 390 B9S3Y6 Nuclear protein localization, putative OS=Ricinus communis GN=RCOM_0557110 PE=4 SV=1
9 : B9I9Y7_POPTR 0.69 0.93 9 107 2 100 99 0 0 411 B9I9Y7 Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s13120g PE=4 SV=2
10 : A9PCA9_POPTR 0.67 0.92 9 107 3 101 99 0 0 411 A9PCA9 NPL4 family protein OS=Populus trichocarpa GN=POPTR_0002s22770g PE=2 SV=1
11 : M5VVX8_PRUPE 0.67 0.86 9 107 2 101 100 1 1 412 M5VVX8 Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006427mg PE=4 SV=1
12 : G8A181_MEDTR 0.65 0.90 9 104 2 98 97 1 1 414 G8A181 NPL4-like protein OS=Medicago truncatula GN=MTR_118s0004 PE=4 SV=1
13 : V4S3Y2_9ROSI 0.65 0.94 9 107 2 100 99 0 0 411 V4S3Y2 Uncharacterized protein OS=Citrus clementina GN=CICLE_v10011862mg PE=4 SV=1
14 : I1JQF4_SOYBN 0.64 0.83 9 107 2 101 100 1 1 413 I1JQF4 Uncharacterized protein OS=Glycine max PE=4 SV=1
15 : I1NB01_SOYBN 0.64 0.84 9 107 2 101 100 1 1 413 I1NB01 Uncharacterized protein OS=Glycine max PE=4 SV=1
16 : M1C800_SOLTU 0.64 0.87 9 107 2 100 99 0 0 411 M1C800 Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400024040 PE=4 SV=1
17 : U5D8X0_AMBTC 0.63 0.86 9 107 2 100 99 0 0 412 U5D8X0 Uncharacterized protein OS=Amborella trichopoda GN=AMTR_s00057p00124520 PE=4 SV=1
18 : V7D1G5_PHAVU 0.62 0.83 9 107 2 101 100 1 1 412 V7D1G5 Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_001G201100g PE=4 SV=1
19 : A5BEN5_VITVI 0.60 0.87 9 107 2 100 99 0 0 1616 A5BEN5 Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_016664 PE=4 SV=1
20 : F6HQ30_VITVI 0.60 0.87 9 107 2 100 99 0 0 411 F6HQ30 Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0104g01080 PE=4 SV=1
21 : S8E763_9LAMI 0.60 0.83 9 103 2 96 95 0 0 337 S8E763 Uncharacterized protein (Fragment) OS=Genlisea aurea GN=M569_06509 PE=4 SV=1
22 : H9MAJ2_PINRA 0.59 0.81 9 93 2 85 85 1 1 85 H9MAJ2 Uncharacterized protein (Fragment) OS=Pinus radiata GN=2_792_01 PE=4 SV=1
23 : H9WY59_PINTA 0.59 0.81 9 93 2 85 85 1 1 85 H9WY59 Uncharacterized protein (Fragment) OS=Pinus taeda GN=2_792_01 PE=4 SV=1
24 : K4CAU9_SOLLC 0.59 0.85 9 107 2 100 99 0 0 411 K4CAU9 Uncharacterized protein OS=Solanum lycopersicum GN=Solyc06g084440.2 PE=4 SV=1
25 : D5A9C1_PICSI 0.56 0.77 9 105 2 97 97 1 1 413 D5A9C1 Putative uncharacterized protein OS=Picea sitchensis PE=2 SV=1
26 : D8T0J4_SELML 0.44 0.73 6 107 2 103 103 2 2 411 D8T0J4 Putative uncharacterized protein OS=Selaginella moellendorffii GN=SELMODRAFT_129073 PE=4 SV=1
27 : D8TEQ7_SELML 0.44 0.73 6 107 2 103 103 2 2 411 D8TEQ7 Putative uncharacterized protein OS=Selaginella moellendorffii GN=SELMODRAFT_187743 PE=4 SV=1
28 : A9SAL9_PHYPA 0.43 0.73 9 106 2 98 98 1 1 409 A9SAL9 Predicted protein OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_163834 PE=4 SV=1
29 : M0S4I6_MUSAM 0.42 0.65 9 95 2 84 88 2 6 395 M0S4I6 Uncharacterized protein OS=Musa acuminata subsp. malaccensis PE=4 SV=1
30 : M0RS05_MUSAM 0.40 0.66 9 107 2 96 100 2 6 395 M0RS05 Uncharacterized protein OS=Musa acuminata subsp. malaccensis PE=4 SV=1
31 : D8UI34_VOLCA 0.39 0.62 9 107 2 96 99 2 4 421 D8UI34 Putative uncharacterized protein OS=Volvox carteri GN=VOLCADRAFT_69322 PE=4 SV=1
32 : A2WQ74_ORYSI 0.38 0.64 9 107 2 97 100 2 5 415 A2WQ74 Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_02003 PE=4 SV=1
33 : A4RW87_OSTLU 0.38 0.73 9 98 2 89 91 2 4 416 A4RW87 Predicted protein OS=Ostreococcus lucimarinus (strain CCE9901) GN=OSTLU_31250 PE=4 SV=1
34 : B9EWV4_ORYSJ 0.38 0.64 9 107 2 97 100 2 5 415 B9EWV4 Uncharacterized protein OS=Oryza sativa subsp. japonica GN=OsJ_01845 PE=4 SV=1
35 : I1NVV7_ORYGL 0.38 0.64 9 107 2 97 100 2 5 415 I1NVV7 Uncharacterized protein OS=Oryza glaberrima PE=4 SV=1
36 : NPL4_ORYSJ 0.38 0.64 9 107 2 97 100 2 5 415 Q9AS33 NPL4-like protein OS=Oryza sativa subsp. japonica GN=Os01g0377700 PE=2 SV=1
37 : Q0JMN0_ORYSJ 0.38 0.64 9 107 2 97 100 2 5 415 Q0JMN0 Os01g0377700 protein OS=Oryza sativa subsp. japonica GN=Os01g0377700 PE=4 SV=1
38 : K3XI98_SETIT 0.37 0.65 9 107 2 97 100 2 5 416 K3XI98 Uncharacterized protein OS=Setaria italica GN=Si001620m.g PE=4 SV=1
39 : Q01AR0_OSTTA 0.37 0.63 9 94 2 85 87 2 4 412 Q01AR0 Nuclear pore complex, rNpl4 component (Sc Npl4) (ISS) OS=Ostreococcus tauri GN=Ot04g03400 PE=4 SV=1
40 : C1MXP3_MICPC 0.36 0.66 9 106 4 102 100 3 3 432 C1MXP3 Predicted protein OS=Micromonas pusilla (strain CCMP1545) GN=MICPUCDRAFT_69159 PE=4 SV=1
41 : C5XR81_SORBI 0.36 0.65 9 107 2 98 100 2 4 417 C5XR81 Putative uncharacterized protein Sb03g041070 OS=Sorghum bicolor GN=Sb03g041070 PE=4 SV=1
42 : I0YKB3_9CHLO 0.36 0.64 9 107 2 96 99 2 4 412 I0YKB3 NPL4-domain-containing protein OS=Coccomyxa subellipsoidea C-169 GN=COCSUDRAFT_49073 PE=4 SV=1
43 : B4FSQ0_MAIZE 0.35 0.65 9 106 2 97 99 2 4 417 B4FSQ0 Uncharacterized protein OS=Zea mays PE=2 SV=1
44 : B6TLC9_MAIZE 0.35 0.65 9 106 2 97 99 2 4 417 B6TLC9 NPL4 family protein OS=Zea mays PE=2 SV=1
45 : C1EDX8_MICSR 0.35 0.66 9 100 2 93 93 2 2 395 C1EDX8 Predicted protein OS=Micromonas sp. (strain RCC299 / NOUM17) GN=MICPUN_62427 PE=4 SV=1
46 : B4FHK5_MAIZE 0.34 0.66 9 107 2 98 100 2 4 417 B4FHK5 NPL4 family OS=Zea mays GN=ZEAMMB73_599617 PE=2 SV=1
47 : F2DQH9_HORVD 0.34 0.61 9 107 2 96 100 2 6 415 F2DQH9 Predicted protein OS=Hordeum vulgare var. distichum PE=2 SV=1
48 : I1HFE5_BRADI 0.34 0.62 9 107 2 96 100 2 6 415 I1HFE5 Uncharacterized protein OS=Brachypodium distachyon GN=BRADI2G13180 PE=4 SV=1
49 : W5GGV4_WHEAT 0.33 0.61 9 107 2 96 100 2 6 415 W5GGV4 Uncharacterized protein OS=Triticum aestivum PE=4 SV=1
50 : K8EHQ2_9CHLO 0.30 0.53 9 95 3 105 103 3 16 499 K8EHQ2 Uncharacterized protein OS=Bathycoccus prasinos GN=Bathy08g02590 PE=4 SV=1
51 : R1FV81_EMIHU 0.30 0.61 9 97 2 86 89 2 4 293 R1FV81 Uncharacterized protein OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_454904 PE=4 SV=1
## ALIGNMENTS 1 - 51
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A G 0 0 136 1 0
2 2 A S - 0 0 109 1 0
3 3 A S S S+ 0 0 129 1 0
4 4 A G S S+ 0 0 76 1 0
5 5 A S - 0 0 103 1 0
6 6 A S - 0 0 108 3 73 AA
7 7 A G - 0 0 49 3 53 AA
8 8 A T E -A 23 0A 27 6 66 TTT AA
9 9 A M E -A 22 0A 37 52 34 MMMMMMMMMMMMLMMMMMLLMLLMLVVLIILIIIIIIIILILIILIIIILI
10 10 A L E -A 21 0A 7 52 25 LLLLLLLIIVLLLLLIILIIIIIIIVVILLLLVLLLLLVLLILLILLLLLL
11 11 A R E -Ab 20 88A 132 52 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
12 12 A V E -Ab 19 89A 5 52 16 VIIVIIIIIIVIIIIIVLIIIVVIVIILIILIVIIIIIVIILIILIIIIII
13 13 A R E +Ab 18 90A 109 52 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
14 14 A S - 0 0 6 52 20 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTSSSSSTTSSSSTSSSSTT
15 15 A R S S+ 0 0 206 52 2 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRK
16 16 A D S S- 0 0 109 52 0 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
17 17 A G - 0 0 30 52 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
18 18 A L E -A 13 0A 76 52 60 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLMLTTTTTTTTTTLTTTTTTTTT
19 19 A E E -A 12 0A 61 52 15 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEDDDDDEEDEDDEDDDDEE
20 20 A R E -A 11 0A 167 52 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
21 21 A V E -A 10 0A 2 52 20 VVVVVVVVVVVVVVVVVVLLVVVVVVVVVVIIIIIIIIVLIIIIVIIIILI
22 22 A S E +A 9 0A 62 52 60 SSSSTTTSSSTTTSSTSMTTLKKTKSSTTAETTTTTTTETTQSSETTTTET
23 23 A V E -A 8 0A 6 52 29 VVVVAAAVIIVLVVVIVVLLIVVIVVVLVVVVVVVVVVIVVLVVVVVVVCA
24 24 A D - 0 0 85 52 61 DDDDDEEEDDDDDEEDDEDDEDDDDHHAPPDPPPPPPPPDPPPPDPPPPAD
25 25 A G > - 0 0 6 52 57 GGGGGGGNNNNNGNNNNNNNNKKNRPPNDDDDSDDDDDSADEDDDDDDDDA
26 26 A P T 3 S+ 0 0 66 52 61 PPPSAAAPPTppAppPLpPPTPPPPKKNppGpSpppppANsHssKsagagS
27 27 A H T 3 S+ 0 0 129 44 78 HHHDHHNNNNhnHsaHNaHHQNNHS...at.aDaaaasA.s.ss.sasav.
28 28 A I S < S- 0 0 2 52 65 IIITIIIIIITKVTTAPTAAAPPAAAAAAAAAIAAAAASCAAAACVAAAKT
29 29 A T B >> -D 77 0B 17 52 4 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTTTTTTTTTTTTTTTTTTTT
30 30 A V H 3> S+ 0 0 0 52 14 VVVVVVVVIVVVVVVIIVIIVVVIVVVVVVLVVVVVVVVLVVVVLVVVVIL
31 31 A S H 3> S+ 0 0 43 52 56 SSSSSSSSSSSSASSSSLSSGGGAGLLGAASGAGGGGAGRAAAASAGGGSS
32 32 A Q H <> S+ 0 0 98 52 52 QQQQQQQQQQQDEYDQSGQQEDDQEEEDTTADKDDDDDEQDEDDQDDEDDR
33 33 A L H >X S+ 0 0 10 52 5 LLLLLLLLFLLLLLLLLLLLLLLLFLLLILLLLLLLLLLLLLLLLLLLLVL
34 34 A K H 3X S+ 0 0 18 52 48 KKKKKKKKKKKKKKKKKKKKKKKKKQQQRCKQKQQQQQKRQKQQRQQQQQR
35 35 A T H 3X S+ 0 0 57 52 82 TTTTTTTTTTATTRRSTRTTLSSSSMMASSARERRRRRDERKRRTRRRRDE
36 36 A L H > - 0 0 11 52 0 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
45 45 A I T 34 S+ 0 0 55 52 44 IIIILLLIIIFTVVVVIVIIVIIVIVVVAALVKVVVVVRQVVVVTVVVVVV
46 46 A H T 34 S+ 0 0 158 52 77 HHHQHHHQHRQHEHHQAQSSHSSQPSSSATDPDPPPPTDEPEPPSPATAEG
47 47 A N T <4 S+ 0 0 19 52 87 NNNNSKKNNSNNSNNSQNSSASSSSAASAVDLELLLLLDDLDLLDLALANE
48 48 A Q < - 0 0 6 52 10 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQMQQQQQQQQQQIQQQQQQQQQ
49 49 A T - 0 0 24 52 80 TTTTTTTTTTTTTTTTTTTTVIITIYYLTTLRIRRRRRITRTRRRRRRRSS
50 50 A L + 0 0 5 52 8 LLLLLLLLLLILLLLLLLLLLLLLLVVLLLLLLLLLLLLLLLIILLLLLLL
51 51 A S B -C 90 0A 2 51 0 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS.
52 52 A T S S+ 0 0 58 51 76 TTTTTTTTTTTTTTTATTTTTTTSTRRRLLKLTLLLLLTLLRLLLLLLLL.
53 53 A N S > S- 0 0 71 51 42 NNNDNNNNNNNNNNNNNNNNNSSNNNNNDDQDSDDDDESNDNDDDDDDDD.
54 54 A R G > S+ 0 0 163 52 63 RRRRRRRQQQQQQQQQQQQQQQQQQQQQPHPPLPPPPPPQPKPPQPPPPPC
55 55 A N G >> S+ 0 0 72 52 61 NNNNDDDNNSNNNNNNNNNNNNNNNSSANNGADAAAAADGADAAGAAAASR
56 56 A L G <4 S+ 0 0 1 52 10 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLVLLLLLVLLLLLLLLLLLS
57 57 A L G <4 S+ 0 0 54 52 5 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLE
58 58 A L T <4 S+ 0 0 110 52 24 LLLLLLLLLLLLLLLLLLLLLLLLLMMLLLTLTLLLLLTLLTLLMLLLLLQ
59 59 A A < + 0 0 7 52 51 AAAAAAAAAAAAAAAAAAAAAAAAAAASPPSPSPPPPPSAPAPPSPPPPRR
60 60 A K + 0 0 186 52 56 KKKKKKKKKKKKKKKKKKKKKKKKKKKKSSKSKSSSSSKNNLNNKNAAATG
61 61 A S S > S- 0 0 73 51 67 STTSTTTSSSTSSSSTTSTTS.GTGDDDKKESTSSSSPDAPDPPDPPAPSP
62 62 A P H > S+ 0 0 81 51 65 PPPPPPPPPPHLPLLPPRPPPG.PNPPPAAPAPAAAASPRSKSSPSQQQSV
63 63 A S H 4 S+ 0 0 90 52 69 SASESAASPSDQSEEDEESSSGGDNSSVQQYSESSSSASAASAAHAPPPEP
64 64 A D H >4 S+ 0 0 74 52 65 DDDEDDDDDDDDDDDDQDDDENNDTggNTSSAEAAAAVAeAGAAaVPQPnT
65 65 A F H >< S+ 0 0 35 34 83 FFFSLLLVLLILLLLWALIIIIVWQaaGP..........f....d....tK
66 66 A L T 3< S+ 0 0 121 34 86 LLLLLLLLLLSLLHLSLHAASQQSQLLKA..........D....D....KG
67 67 A A T < S+ 0 0 72 34 86 AGAAAAAKQKRLQRRRQRRRRQQRFMMEL..........D....L....KA
68 68 A F < + 0 0 34 42 63 FFFFFFFFFFFFFFFFFFFFFFFFKAAFD.F.F.....FLVFVVKV...EA
69 69 A T + 0 0 105 48 71 TTTTTTTTTTTTTTTTSTTTTKKTDAAAP.RAKAAAAPLAPEPPPP...VF
70 70 A D S S+ 0 0 42 51 75 DDDDDDDDDDDDDDDDDDDDDDDD.DDDSPDLDLLLLLDPLDLLGLLLLVG
71 71 A M S S+ 0 0 3 52 27 MMMMLLLMMMMMMMMMMMMMMMMMMMMMVALLMLLLLLMALLLLRLLLLVA
72 72 A A + 0 0 55 52 68 AATTSTTSSSASASSAAASSAEEDERRQRLAAAAAAAASRASAAKTSSSLS
73 73 A D > - 0 0 84 52 40 DDDDDDDNNNNNNNNNDNDDDDDNDDDDLDIDNDDDDDDADRDDNDDDDSD
74 74 A P T 3 S+ 0 0 65 52 34 PPPPPPPPPPPPPPLPPPPPPPPPPPPSSPNPGPPPPPDKPPPPDSPPPPD
75 75 A N T 3 S+ 0 0 119 52 70 NNNNNNNNSDNNDDDHQDQQSNNHNSSGSADARAAAAAGDAGAAGAAAARA
76 76 A L < - 0 0 34 52 72 LLLLLLLTTTTTRAATTVTTAMMTIAAALTAAAAAAAADATTAAKAATAKQ
77 77 A R B > -D 29 0B 138 51 57 RRPPRPPPLPPPPPSPLTPPGLLPPAAT.PSPKPPPPQAKPPPPPPPPPTP
78 78 A I G >> S+ 0 0 2 51 16 IIIILLLLLLLLLLLIILIILLLILLLL.LLLILLLLLLLLLLLLLLLLLL
79 79 A S G 34 S+ 0 0 57 51 50 SSSSSSSSSSSTSSSSSSAASSSSSSSS.SRSASSSSAEVAVAAKAAAAKS
80 80 A S G <4 S+ 0 0 80 51 25 SSSSSSSSSSAASSSSSSAAFSSSSSSS.SSSsSSSSSsrSKSSSSSSSSA
81 81 A L T <4 S- 0 0 47 51 18 LLLLLLLLLLLLLLLLLLLLLLLFLLLL.LQLgLLLLLglLLLLLLLLLLL
82 82 A N < - 0 0 122 52 78 NNNSNNNNNSNNNNNNGNGGGGGNGGGGPPGRIRRRRRVGRGRRKRPPPQG
83 83 A L - 0 0 24 52 34 LLLLLLLIIILLILLLILVVIIILIIIILIVLDLLLLLKILILLILLLLLL
84 84 A A > - 0 0 60 52 60 AASSGGGTSSSSSGGTGATTSDDTGGGAAGSSTSSSSAQDAQAAEAAAAKA
85 85 A H T 3 S+ 0 0 116 52 37 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHNNNNNNNNHNHNNHNNNNHS
86 86 A G T 3 S+ 0 0 47 52 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
87 87 A S < - 0 0 35 52 45 SSSSSSSSSSSSSSSSSSSSSHHSHSSSSSDSASSSSAASADAATASSSDE
88 88 A M E +b 11 0A 71 52 42 MMMIMMMIILIIIMIMMIIIIIIIIVVLIIMFVFFFFFIIFVFFIFFFFII
89 89 A V E -b 12 0A 5 52 21 VLLLLLLLIIVIVVVLIVVVVVVLVVVLLVVVVVVVVVVVVVIIVVVVVVL
90 90 A Y E -bC 13 51A 31 52 11 YFYYYYFFFFYFFFFYFFYYFFFYFFFYYYYYHYYYYYHYYFYYYYYYYYF
91 91 A L - 0 0 2 52 6 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLALLMLLMLLLLML
92 92 A A + 0 0 8 52 67 AAAAAAATAAAASAAASTAAASSASFFFSSLSMSSSSARSAHAATASSSTE
93 93 A Y - 0 0 31 52 6 YYYYYYYYYYYYYYYYYYHHYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY
94 94 A E S S- 0 0 112 50 69 EEEEEDDDEDDEDEEEEEDDE QESSSPPSPGPPPPPTSPPPPSPPPPPQ
95 95 A G S S- 0 0 77 49 69 GGGGGGGGGGGGGGGGGGTTG GGGGGAAFPCPPPPP VPFPPVPPPPDR
96 96 A E - 0 0 137 47 34 EEEEEEEEHEEEEEEEEEQQE EEKKN DEDEDDDDD EDEDDEDGDG D
97 97 A R - 0 0 109 47 64 RRRRRRRRRRRRRRRRRRRRR RRRRR LRARAAAAA RARAARAAAA R
98 98 A T - 0 0 69 46 79 TTTTTSSTTTTTNRRTTHTTT TSEEV RQRERRRRR ERERRKRRRR
99 99 A I - 0 0 150 45 69 IIIIIIIIIIVVVVVIVVVVV IIVVV RVS SSSSS PSVSSVSSSS
100 100 A R + 0 0 191 45 65 RRRRRPPAAAAQRAAAAASSS PAAAK AES SSSSA TAAAATAASA
101 101 A G + 0 0 69 44 64 GGGGGGGGGGGGGGGGGGGGG GGGGG APQ QQQQR PRSRR RQQQ
102 102 A S - 0 0 122 44 35 GGGGGAAPPPPPPPPPPPPPP PPPPP AAP PPPPP TPTPP PPPP
103 103 A G - 0 0 68 44 63 PPPPPPPAAATASAAASATTA TQKKQ AVP PPPPP APVTT PPPP
104 104 A P + 0 0 127 43 81 AAAAAPPVVVFAFFFVVFFF VIVVV PRP PPPPP APPPP PPPA
105 105 A S - 0 0 118 35 77 NHRH NNNQKNSS QNTTT TPP PPPPP APRPP PPPP
106 106 A S 0 0 131 34 67 PPPP PPPPPPPP P PPP PAK KKKKK SKTKK KRKR
107 107 A G 0 0 130 30 22 AAAA AAAAAAAA A SS PGA AAAAA AA AAAA
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0.000 0 1.00
2 2 A 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 1 0 0 0.000 0 1.00
3 3 A 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 1 0 0 0.000 0 1.00
4 4 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0.000 0 1.00
5 5 A 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 1 0 0 0.000 0 1.00
6 6 A 0 0 0 0 0 0 0 0 67 0 33 0 0 0 0 0 0 0 0 0 3 0 0 0.637 21 0.27
7 7 A 0 0 0 0 0 0 0 33 67 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0.637 21 0.47
8 8 A 0 0 0 0 0 0 0 0 33 0 0 67 0 0 0 0 0 0 0 0 6 0 0 0.637 21 0.33
9 9 A 4 23 35 38 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 1.198 40 0.65
10 10 A 10 63 27 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.867 28 0.75
11 11 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 52 0 0 0.000 0 1.00
12 12 A 19 10 71 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.784 26 0.83
13 13 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 52 0 0 0.000 0 1.00
14 14 A 0 0 0 0 0 0 0 0 0 0 88 12 0 0 0 0 0 0 0 0 52 0 0 0.358 11 0.80
15 15 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 98 2 0 0 0 0 52 0 0 0.095 3 0.98
16 16 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 52 0 0 0.000 0 1.00
17 17 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.000 0 1.00
18 18 A 0 62 0 2 0 0 0 0 0 0 0 37 0 0 0 0 0 0 0 0 52 0 0 0.743 24 0.39
19 19 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 75 0 25 52 0 0 0.562 18 0.85
20 20 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 52 0 0 0.000 0 1.00
21 21 A 60 8 33 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.871 29 0.80
22 22 A 0 2 0 2 0 0 0 0 2 0 29 50 0 0 0 6 2 8 0 0 52 0 0 1.371 45 0.40
23 23 A 69 10 12 0 0 0 0 0 8 0 0 0 2 0 0 0 0 0 0 0 52 0 0 1.002 33 0.70
24 24 A 0 0 0 0 0 0 0 0 4 35 0 0 0 4 0 0 0 13 0 44 52 0 0 1.249 41 0.39
25 25 A 0 0 0 0 0 0 0 17 4 4 4 0 0 0 2 4 0 2 29 35 52 0 0 1.683 56 0.42
26 26 A 0 2 0 0 0 0 0 6 13 50 13 4 0 2 0 6 0 0 4 0 52 8 21 1.618 54 0.39
27 27 A 2 0 0 0 0 0 0 0 25 0 18 2 0 27 0 0 2 0 18 5 44 0 0 1.719 57 0.21
28 28 A 4 0 21 0 0 0 0 0 48 6 2 12 4 0 0 4 0 0 0 0 52 0 0 1.546 51 0.34
29 29 A 0 0 0 0 0 0 0 0 0 0 0 98 0 0 0 0 0 0 2 0 52 0 0 0.095 3 0.96
30 30 A 79 8 13 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.655 21 0.86
31 31 A 0 6 0 0 0 0 0 27 21 0 44 0 0 0 2 0 0 0 0 0 52 0 0 1.283 42 0.44
32 32 A 0 0 0 0 0 0 2 2 2 0 2 4 0 0 2 2 35 15 0 35 52 0 0 1.604 53 0.48
33 33 A 2 92 2 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.351 11 0.94
34 34 A 0 0 0 0 0 0 0 0 0 0 0 0 2 0 8 58 33 0 0 0 52 0 0 0.956 31 0.52
35 35 A 0 2 0 4 0 0 0 0 6 0 13 33 0 0 31 2 0 6 0 4 52 0 0 1.730 57 0.18
36 36 A 2 67 10 0 0 0 0 0 10 0 2 0 0 0 0 2 8 0 0 0 52 0 0 1.142 38 0.48
37 37 A 0 2 98 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.095 3 0.98
38 38 A 0 0 0 0 0 0 0 2 15 0 2 0 0 0 0 0 31 46 4 0 52 0 0 1.285 42 0.51
39 39 A 0 0 0 0 0 0 0 0 21 0 27 10 0 0 6 4 4 4 8 17 52 0 0 1.948 65 0.27
40 40 A 0 0 0 0 0 0 0 0 0 0 0 0 0 13 17 4 50 12 4 0 52 0 0 1.420 47 0.48
41 41 A 12 79 0 0 2 0 2 0 4 2 0 0 0 0 0 0 0 0 0 0 52 0 1 0.790 26 0.72
42 42 A 0 0 0 0 0 0 0 15 2 0 2 25 0 4 15 4 29 0 4 0 52 0 0 1.809 60 0.19
43 43 A 60 0 40 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.675 22 0.86
44 44 A 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 52 0 0 0.000 0 1.00
45 45 A 50 8 27 0 2 0 0 0 4 0 0 4 0 0 2 2 2 0 0 0 52 0 0 1.452 48 0.55
46 46 A 0 0 0 0 0 0 0 2 8 19 15 6 0 23 2 0 12 8 0 6 52 0 0 2.068 69 0.22
47 47 A 2 21 0 0 0 0 0 0 12 0 21 0 0 0 0 4 2 4 25 10 52 0 0 1.881 62 0.12
48 48 A 0 0 2 2 0 0 0 0 0 0 0 0 0 0 0 0 96 0 0 0 52 0 0 0.190 6 0.90
49 49 A 2 4 10 0 0 0 4 0 0 0 4 50 0 0 27 0 0 0 0 0 52 0 0 1.377 45 0.20
50 50 A 4 90 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 1 0 0.381 12 0.91
51 51 A 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 51 0 0 0.000 0 1.00
52 52 A 0 35 0 0 0 0 0 0 2 0 2 51 0 0 8 2 0 0 0 0 51 0 0 1.142 38 0.23
53 53 A 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 2 2 55 33 51 0 0 1.049 35 0.58
54 54 A 0 2 0 0 0 0 0 0 0 33 0 0 2 2 15 2 44 0 0 0 52 0 0 1.318 44 0.36
55 55 A 0 0 0 0 0 0 0 6 27 0 8 0 0 0 2 0 0 0 46 12 52 0 0 1.397 46 0.38
56 56 A 4 94 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 52 0 0 0.257 8 0.90
57 57 A 0 98 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 52 0 0 0.095 3 0.94
58 58 A 0 85 0 6 0 0 0 0 0 0 0 8 0 0 0 0 2 0 0 0 52 0 0 0.579 19 0.75
59 59 A 0 0 0 0 0 0 0 0 58 29 10 0 0 0 4 0 0 0 0 0 52 0 0 1.026 34 0.48
60 60 A 0 2 0 0 0 0 0 2 6 0 15 2 0 0 0 63 0 0 10 0 52 1 0 1.194 39 0.43
61 61 A 0 0 0 0 0 0 0 4 4 16 35 24 0 0 0 4 0 2 0 12 51 1 0 1.708 57 0.33
62 62 A 2 6 0 0 0 0 0 2 14 49 12 0 0 2 4 2 6 0 2 0 51 0 0 1.720 57 0.34
63 63 A 2 0 0 0 0 0 2 4 17 10 37 0 0 2 0 0 6 13 2 6 52 0 0 1.925 64 0.31
64 64 A 4 0 0 0 0 0 0 6 19 4 4 6 0 0 0 0 4 8 8 38 52 18 5 1.910 63 0.34
65 65 A 6 29 15 0 15 6 0 3 9 3 3 3 0 0 0 3 3 0 0 3 34 0 0 2.197 73 0.17
66 66 A 0 50 0 0 0 0 0 3 9 0 12 0 0 6 0 6 9 0 0 6 34 0 0 1.630 54 0.14
67 67 A 0 9 0 6 3 0 0 3 24 0 0 0 0 0 26 9 15 3 0 3 34 0 0 1.984 66 0.14
68 68 A 10 2 0 0 71 0 0 0 7 0 0 0 0 0 0 5 0 2 0 2 42 0 0 1.065 35 0.37
69 69 A 2 2 0 0 2 0 0 0 19 15 2 46 0 0 2 6 0 2 0 2 48 1 0 1.690 56 0.28
70 70 A 2 25 0 0 0 0 0 4 0 4 2 0 0 0 0 0 0 0 0 63 51 0 0 1.049 35 0.24
71 71 A 4 35 0 54 0 0 0 0 6 0 0 0 0 0 2 0 0 0 0 0 52 0 0 1.066 35 0.72
72 72 A 0 4 0 0 0 0 0 0 38 0 29 10 0 0 8 2 2 6 0 2 52 0 0 1.666 55 0.31
73 73 A 0 2 2 0 0 0 0 0 2 0 2 0 0 0 2 0 0 0 25 65 52 0 0 1.004 33 0.60
74 74 A 0 2 0 0 0 0 0 2 0 81 6 0 0 0 0 2 0 0 2 6 52 0 0 0.806 26 0.65
75 75 A 0 0 0 0 0 0 0 8 29 0 10 0 0 4 4 0 6 0 27 13 52 0 0 1.820 60 0.29
76 76 A 2 17 2 4 0 0 0 0 38 0 0 27 0 0 2 4 2 0 0 2 52 1 0 1.655 55 0.27
77 77 A 0 8 0 0 0 0 0 2 6 61 4 6 0 0 8 4 2 0 0 0 51 0 0 1.443 48 0.43
78 78 A 0 78 22 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 0.521 17 0.84
79 79 A 4 0 0 0 0 0 0 0 22 0 65 2 0 0 2 4 0 2 0 0 51 0 0 1.098 36 0.50
80 80 A 0 0 0 0 2 0 0 0 10 0 84 0 0 0 2 2 0 0 0 0 51 0 3 0.603 20 0.74
81 81 A 0 92 0 0 2 0 0 4 0 0 0 0 0 0 0 0 2 0 0 0 51 0 0 0.356 11 0.82
82 82 A 2 0 2 0 0 0 0 27 0 10 4 0 0 0 19 2 2 0 33 0 52 0 0 1.690 56 0.21
83 83 A 6 60 31 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 2 52 0 0 0.988 32 0.66
84 84 A 0 0 0 0 0 0 0 19 29 0 27 12 0 0 0 2 4 2 0 6 52 0 0 1.720 57 0.39
85 85 A 0 0 0 0 0 0 0 0 0 0 2 0 0 69 0 0 0 0 29 0 52 0 0 0.689 23 0.63
86 86 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.000 0 1.00
87 87 A 0 0 0 0 0 0 0 0 13 0 71 2 0 6 0 0 0 2 0 6 52 0 0 0.993 33 0.55
88 88 A 8 4 42 21 25 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 1.362 45 0.57
89 89 A 65 23 12 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.865 28 0.79
90 90 A 0 0 0 0 37 0 60 0 0 0 0 0 0 4 0 0 0 0 0 0 52 0 0 0.802 26 0.89
91 91 A 0 92 0 6 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 52 0 0 0.314 10 0.93
92 92 A 0 2 0 2 6 0 0 0 46 0 31 8 0 2 2 0 0 2 0 0 52 0 0 1.461 48 0.32
93 93 A 0 0 0 0 0 0 96 0 0 0 0 0 0 4 0 0 0 0 0 0 52 0 0 0.163 5 0.94
94 94 A 0 0 0 0 0 0 0 2 0 34 12 2 0 0 0 0 4 30 0 16 50 0 0 1.561 52 0.31
95 95 A 4 0 0 0 4 0 0 51 4 27 0 4 2 0 2 0 0 0 0 2 49 0 0 1.456 48 0.30
96 96 A 0 0 0 0 0 0 0 4 0 0 0 0 0 2 0 4 4 55 2 28 47 0 0 1.250 41 0.65
97 97 A 0 2 0 0 0 0 0 0 28 0 0 0 0 0 70 0 0 0 0 0 47 0 0 0.686 22 0.36
98 98 A 2 0 0 0 0 0 0 0 0 0 7 37 0 2 35 2 2 11 2 0 46 0 0 1.571 52 0.20
99 99 A 36 0 31 0 0 0 0 0 0 2 29 0 0 0 2 0 0 0 0 0 45 0 0 1.259 42 0.31
100 100 A 0 0 0 0 0 0 0 0 47 7 20 4 0 0 16 2 2 2 0 0 45 0 0 1.540 51 0.34
101 101 A 0 0 0 0 0 0 0 61 2 5 2 0 0 0 11 0 18 0 0 0 44 0 0 1.169 39 0.35
102 102 A 0 0 0 0 0 0 0 11 9 73 2 5 0 0 0 0 0 0 0 0 44 0 0 0.923 30 0.64
103 103 A 5 0 0 0 0 0 0 2 25 41 5 14 0 0 0 5 5 0 0 0 44 0 0 1.632 54 0.36
104 104 A 21 0 2 0 16 0 0 0 19 40 0 0 0 0 2 0 0 0 0 0 43 0 0 1.478 49 0.19
105 105 A 0 0 0 0 0 0 0 0 3 40 9 11 0 6 6 3 6 0 17 0 35 0 0 1.821 60 0.23
106 106 A 0 0 0 0 0 0 0 0 3 50 6 3 0 0 6 32 0 0 0 0 34 0 0 1.252 41 0.33
107 107 A 0 0 0 0 0 0 0 7 83 3 7 0 0 0 0 0 0 0 0 0 30 0 0 0.626 20 0.78
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
11 19 20 1 pQh
12 19 20 1 pNn
14 19 20 1 pHs
15 19 20 1 pLa
18 19 20 1 pHa
26 59 60 1 gAa
27 59 60 1 gAa
29 19 20 1 pSa
30 19 20 1 pSt
32 19 20 1 pAa
33 70 71 1 sCg
34 19 20 1 pAa
35 19 20 1 pAa
36 19 20 1 pAa
37 19 20 1 pAa
38 19 20 1 pAs
39 70 71 1 sVg
40 56 59 1 eDf
40 72 76 1 rQl
41 19 20 1 sAs
43 19 20 1 sAs
44 19 20 1 sAs
45 56 57 1 aYd
46 19 20 1 sAs
47 19 20 1 aAa
48 19 20 1 gAs
49 19 20 1 aAa
50 19 21 8 gAGGGENDVv
50 34 44 1 pWg
50 57 68 7 nADETKAKt
//