Complet list of 1cou hssp file
Complete list of 1cou.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1COU
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-04-09
HEADER BLOOD CLOTTING 28-MAY-99 1COU
COMPND MOL_ID: 1; MOLECULE: PROTEIN (NEMATODE ANTICOAGULANT PROTEIN C2); CHAI
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: ANCYLOSTOMA CANINUM; ORGANISM_COMMON:
AUTHOR B.M.DUGGAN,H.J.DYSON,P.E.WRIGHT
DBREF 1COU A 1 84 UNP Q16938 Q16938_ANCCA 8 91
SEQLENGTH 85
NCHAIN 1 chain(s) in 1COU data set
NALIGN 48
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : Q16938_ANCCA2H9E 1.00 1.00 1 84 8 91 84 0 0 91 Q16938 Anti-coagulant protein C2 (Precursor) OS=Ancylostoma caninum PE=1 SV=1
2 : Q1W610_ANCCA 0.90 1.00 1 84 1 84 84 0 0 84 Q1W610 Nematode anticoagulant protein 8 (Fragment) OS=Ancylostoma caninum PE=2 SV=1
3 : Q1W611_ANCCA 0.90 0.99 1 84 20 103 84 0 0 103 Q1W611 Nematode anticoagulant protein 7 OS=Ancylostoma caninum PE=4 SV=2
4 : Q6XGY3_ANCCA 0.80 0.90 1 84 1 84 84 0 0 84 Q6XGY3 Anticoagulant protein c3 (Fragment) OS=Ancylostoma caninum PE=2 SV=1
5 : A4ZY75_ANCCA 0.69 0.79 6 83 4 80 78 1 1 80 A4ZY75 Anticoagulant protein 11 (Fragment) OS=Ancylostoma caninum PE=2 SV=1
6 : Q6X631_ANCCA 0.63 0.75 1 83 18 99 83 1 1 99 Q6X631 Anticoagulant protein c4 OS=Ancylostoma caninum GN=APc4 PE=4 SV=1
7 : A4ZY74_ANCCA 0.61 0.73 1 83 9 90 83 1 1 90 A4ZY74 Anticoagulant protein 10 OS=Ancylostoma caninum PE=4 SV=2
8 : A4ZY73_ANCCA 0.60 0.77 5 83 16 94 80 2 2 94 A4ZY73 Anticoagulant protein 9 OS=Ancylostoma caninum PE=4 SV=1
9 : E7EAV1_ANCCA 0.48 0.66 2 78 21 96 79 3 5 100 E7EAV1 Anticoagulant peptide-12 OS=Ancylostoma caninum PE=4 SV=1
10 : Q962V8_9BILA 0.46 0.62 1 78 20 94 79 3 5 102 Q962V8 Anticoagulant peptide-1 (Precursor) OS=Ancylostoma ceylanicum GN=AceAP-1 PE=4 SV=1
11 : G4RIK2_9BILA 0.44 0.66 1 83 28 108 85 3 6 108 G4RIK2 Anticoagulant peptide 1 OS=Ancylostoma duodenale PE=4 SV=1
12 : G4RIJ7_ANCCA 0.41 0.54 1 78 15 87 82 6 13 89 G4RIJ7 Anticoagulant peptide 29 OS=Ancylostoma caninum PE=4 SV=1
13 : G4RIJ3_ANCCA 0.38 0.54 1 78 5 77 82 6 13 79 G4RIJ3 Anticoagulant peptide 30 (Fragment) OS=Ancylostoma caninum PE=2 SV=1
14 : Q16935_ANCCA 0.38 0.54 1 78 1 75 82 6 11 77 Q16935 Anticoagulant peptide (Fragment) OS=Ancylostoma caninum PE=2 SV=1
15 : Q16939_ANCCA 0.38 0.52 1 78 24 96 82 6 13 98 Q16939 Anticoagulant protein 6 (Precursor) OS=Ancylostoma caninum PE=4 SV=1
16 : Q16940_ANCCA2P3F 0.38 0.54 1 78 24 98 82 6 11 100 Q16940 Anti-coagulant protein 5 (Precursor) OS=Ancylostoma caninum PE=1 SV=1
17 : Q16947_ANCCA 0.38 0.54 1 78 15 89 82 6 11 91 Q16947 Anticoagulant peptide isoform 2 OS=Ancylostoma caninum GN=AcAp-2 PE=4 SV=1
18 : Q9U9B1_9BILA 0.38 0.57 1 78 1 71 81 4 13 76 Q9U9B1 Ascaris type serine protease inhibitor (Fragment) OS=Ancylostoma ceylanicum PE=2 SV=1
19 : G4RIJ4_ANCCA 0.37 0.52 1 78 15 89 82 6 11 91 G4RIJ4 Anticoagulant peptide 26 OS=Ancylostoma caninum PE=4 SV=1
20 : G4RIJ5_ANCCA 0.37 0.54 1 78 24 98 82 6 11 100 G4RIJ5 Anticoagulant peptide 27 OS=Ancylostoma caninum PE=4 SV=1
21 : Q8I6Q5_ANOST 0.37 0.58 5 75 24 84 71 4 10 84 Q8I6Q5 Putative salivary secreted serine protease inhibitor OS=Anopheles stephensi PE=4 SV=1
22 : C8XPZ0_9BILA 0.36 0.64 5 78 29 97 76 4 9 104 C8XPZ0 Anticoagulant peptide 4 OS=Ancylostoma duodenale PE=4 SV=1
23 : G4RIJ6_ANCCA 0.35 0.51 1 78 15 89 82 6 11 91 G4RIJ6 Anticoagulant peptide 28 OS=Ancylostoma caninum PE=4 SV=1
24 : A4ZZ77_9BILA 0.34 0.61 5 78 104 172 76 4 9 180 A4ZZ77 Anticoagulant protein 7 (Precursor) OS=Ancylostoma duodenale PE=2 SV=1
25 : G6DTM3_DANPL 0.34 0.53 4 77 109 177 74 2 5 1509 G6DTM3 Serine protease inhibitor 28 OS=Danaus plexippus GN=KGM_16810 PE=3 SV=1
26 : Q2EQ00_ANOGA 0.34 0.56 5 75 25 85 71 4 10 85 Q2EQ00 AGAP006253-PA OS=Anopheles gambiae GN=AGAP006253 PE=4 SV=1
27 : B4PXK7_DROYA 0.33 0.50 4 71 29 87 70 4 13 93 B4PXK7 GE16660 OS=Drosophila yakuba GN=Dyak\GE16660 PE=4 SV=1
28 : G0MTA3_CAEBE 0.33 0.53 2 70 21 79 70 4 12 79 G0MTA3 Putative uncharacterized protein OS=Caenorhabditis brenneri GN=CAEBREN_24494 PE=4 SV=1
29 : H9J9P2_BOMMO 0.33 0.52 5 83 179 248 81 5 13 248 H9J9P2 Uncharacterized protein OS=Bombyx mori GN=Bmo.12647 PE=4 SV=1
30 : Q17PJ2_AEDAE 0.33 0.41 2 70 60 121 70 4 9 132 Q17PJ2 AAEL000351-PA OS=Aedes aegypti GN=AAEL000351 PE=4 SV=1
31 : E5S2I0_TRISP 0.32 0.43 1 76 76 143 76 3 8 264 E5S2I0 Putative trypsin Inhibitor like cysteine rich domain protein OS=Trichinella spiralis GN=Tsp_03044 PE=4 SV=1
32 : E7D188_LATHE 0.32 0.47 6 77 85 148 73 4 10 323 E7D188 Kunitz-like protease inhibitor (Fragment) OS=Latrodectus hesperus PE=2 SV=1
33 : G0NJ06_CAEBE 0.32 0.48 6 76 26 80 73 4 20 89 G0NJ06 Putative uncharacterized protein OS=Caenorhabditis brenneri GN=CAEBREN_25960 PE=4 SV=1
34 : O45764_CAEEL 0.32 0.51 6 78 185 247 73 1 10 249 O45764 Protein T06E6.10 OS=Caenorhabditis elegans GN=CELE_T06E6.10 PE=4 SV=1
35 : Q16MT8_AEDAE 0.32 0.46 6 79 24 89 74 3 8 89 Q16MT8 AAEL012204-PA OS=Aedes aegypti GN=AAEL012204 PE=4 SV=1
36 : Q17PK3_AEDAE 0.32 0.46 6 79 24 89 74 3 8 89 Q17PK3 AAEL000323-PA OS=Aedes aegypti GN=AAEL000323 PE=4 SV=1
37 : W2SXH3_NECAM 0.32 0.43 6 78 40 102 75 5 14 172 W2SXH3 Trypsin Inhibitor like cysteine rich domain protein OS=Necator americanus GN=NECAME_04325 PE=4 SV=1
38 : EG04A_MDBVW 0.31 0.46 6 82 35 99 78 4 14 103 Q4ZJZ2 Probable protease inhibitor Egf0.4a OS=Microplitis demolitor bracovirus (isolate Webb) GN=O4 PE=3 SV=1
39 : F1LFU4_ASCSU 0.31 0.39 1 75 15 79 77 4 14 144 F1LFU4 Chymotrypsin/elastase isoinhibitors 2 to 5 OS=Ascaris suum PE=2 SV=1
40 : F5XVC3_XENTR 0.31 0.45 5 81 650 712 77 4 14 2842 F5XVC3 von Willebrand factor OS=Xenopus tropicalis GN=vwf PE=2 SV=1
41 : F6ZFC8_XENTR 0.31 0.45 5 81 633 695 77 4 14 2810 F6ZFC8 Uncharacterized protein (Fragment) OS=Xenopus tropicalis GN=vwf PE=4 SV=1
42 : K1R459_CRAGI 0.31 0.50 6 85 854 920 80 4 13 932 K1R459 SCO-spondin OS=Crassostrea gigas GN=CGI_10022620 PE=4 SV=1
43 : Q6TRZ2_CULQU 0.31 0.49 4 75 24 91 74 5 8 91 Q6TRZ2 Putative cysteine-rich protease inhibitor OS=Culex quinquefasciatus PE=4 SV=1
44 : R7TVR6_CAPTE 0.31 0.42 4 80 1399 1466 77 4 9 1702 R7TVR6 Uncharacterized protein OS=Capitella teleta GN=CAPTEDRAFT_227119 PE=4 SV=1
45 : F1QK57_DANRE 0.30 0.42 5 83 996 1059 79 5 15 2153 F1QK57 Uncharacterized protein (Fragment) OS=Danio rerio GN=tecta PE=4 SV=1
46 : H3G367_PRIPA 0.30 0.55 5 84 52 121 82 5 14 626 H3G367 Uncharacterized protein OS=Pristionchus pacificus GN=WBGene00118582 PE=4 SV=1
47 : U6NLZ2_HAECO 0.30 0.43 6 85 493 552 80 3 20 553 U6NLZ2 Protease inhibitor I8 domain containing protein OS=Haemonchus contortus GN=HCOI_00482900 PE=4 SV=1
48 : W2ST73_NECAM 0.30 0.44 4 83 1152 1215 80 4 16 2001 W2ST73 Trypsin Inhibitor like cysteine rich domain protein OS=Necator americanus GN=NECAME_13761 PE=4 SV=1
## ALIGNMENTS 1 - 48
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A K 0 0 154 21 25 KKKK NN KKKKKKKKKKK K K S
2 2 A A + 0 0 15 24 37 AAAA AA ASAAPAAAAASP S P AV A
3 3 A T S S- 0 0 84 24 102 TTTT KN EVTYYYYYYMYY Y N PN R
4 4 A M S S+ 0 0 95 29 64 MMRR PP KKKPPPPPPPPP P P RK KS K KP P
5 5 A Q + 0 0 126 39 67 QQQK SSNKKNEEEEEEKKEKKKETKSKKQS PQQ NGPK E
6 6 A a - 0 0 13 49 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
7 7 A G S S- 0 0 7 49 35 GGGGGGGGGGGGGGGGGGGGGPGGGGPGtGPGgGDDPGGSSEtPPGGG
8 8 A E S S+ 0 0 175 49 39 EEEEEEEEPLDEEEEEEPEEETEPDEEEpEEEePEEPEKEENeEEKEE
9 9 A N S S+ 0 0 75 49 18 NNNNNNNNNNNNNNNNNNNNNDNDNNNNNNNNNNNNNNNGGGNNNNFN
10 10 A E E -A 52 0A 54 49 30 EEEEEQEEEEEEEEEEEEEEEEEEEEEEEEEQEEKKEEEKKFEASEEE
11 11 A K E -A 51 0A 110 49 91 KRRMRRREWRQWWWWWWRWWIWWWVITVVIVVSHVVVAVIIRIEHKSQ
12 12 A Y E +A 50 0A 93 48 38 YYYYYYHYY.FLLLLLLLLLYFLFYYYFWYFYFFFFYYWYYYSYFVFV
13 13 A D E -A 49 0A 56 49 59 DGDDDDDDDLDDDDDDDEDDQDDDDQLKDLQKQVNNKDTQQTRRDDRD
14 14 A S S S+ 0 0 78 49 87 SSSPDEEVKDKVVDVDDAVVRWVYLRTKSEEQLGQQSSEQQPVSEPDN
15 15 A b S S- 0 0 38 49 14 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCSCCCCMCCCCNCCCCC
16 16 A G - 0 0 0 49 40 GGGGNNSGGGGGGGGGGGGGGGGGPGGGPGGGKKGGGRTGGASGTPAG
17 17 A S S S+ 0 0 26 49 68 SSSRRRRNTNMttttttnttTttnSTPTSSSSANSSGRGSSSSSSNSN
18 18 A K > - 0 0 38 23 31 KKKKKKKR...kkkkkkkkk.krkP........I..............
19 19 A E T 3 - 0 0 32 41 76 EEEEEEET...PPPPPPEPPAHPKCGDA.EAA.CAA.G.PPPITA.QR
20 20 A c T > S+ 0 0 0 44 7 CCCCCCCC...CCCCCCCCCCCCCPCCC.CCCCSCCCCCCCCCCC.CC
21 21 A D T < S+ 0 0 71 46 58 DDDDDDDDLL.EEEEEEEEEEEEEPEQE.PPPEDPPEEENNPQVP.EE
22 22 A K T 3 S+ 0 0 49 47 80 KKKQPPPLKKKAAAAAAPAARLARKRTA.PELATEEPKLQQQPPL.PP
23 23 A K B < -b 58 0B 0 49 60 KKKKKKKKAAEKKKKKKTKKTKKKRTENTTTTTKTTTRKTTTTTTVTT
24 24 A d - 0 0 0 49 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
25 25 A K + 0 0 40 48 78 KKKKKKKQEEDGSNSNN.SSSDSSDDAALIDEANEELDGRRRPQSSSE
26 26 A Y - 0 0 0 48 88 YYYYYYYYDRLEGEEEE.EGNREEVNTEYNTNNETTTDQSSNLDNENN
27 27 A D S S- 0 0 43 47 86 DDDDDDD.RKrEEEEEE.EEGEEEDGLGEWVYPKIISHDLLILILSKA
28 28 A G S S+ 0 0 0 12 66 GGGGGGGDCCk.....................................
29 29 A V S > S- 0 0 55 12 81 VVVVTTTGNNI.....................................
30 30 A E T 3 S+ 0 0 84 15 35 EEEEEEEAEEE.............E...N................T..
31 31 A E T 3 S+ 0 0 147 16 66 EEEEEEEEDEQ.............Q...C..............N.C..
32 32 A E X + 0 0 102 17 73 EEEEKKKKEES.............L...N.............PG.E..
33 33 A D T 3 + 0 0 83 24 83 DDDDDD.KNSE..P.PP.PH.LPTI...D.............LK.D..
34 34 A D T 3 + 0 0 131 30 71 DDDDDDDDEEK..P.PP.PP.TPSD...VRS...EE.....GKE.G..
35 35 A E S < S- 0 0 28 41 61 EEEEEEDEEEEEEEEEEDEEEEEECD..DLSQ..HH...SSDLCDN.Y
36 36 A E - 0 0 120 42 53 EEEEKKEEREEEEEEEEGEEEKEKAT..NRGN..EEK..HHEREEE.G
37 37 A P S S+ 0 0 42 43 80 PPPPP.KRDIPEEEEEEEEEWEENPW.HPRFP..EEI.EPPPPKPK.Q
38 38 A N S >> S+ 0 0 37 46 69 NNNN.PPNEESNDDDDDDDDNEDENDGVNREP.RPPP.NDDERMERNV
39 39 A V T 34 S+ 0 0 84 48 82 VVVVVVVAEAAPPPPPPKPPKQPEPKKMVTRK.KEEVNVVVPISEMPK
40 40 A P T 34 S+ 0 0 54 48 86 PPPQEEVEAREIIIIIIIIIPAIALPPFVYPI.MPPAPPDDYQAPCTV
41 41 A c T <4 S+ 0 0 2 48 18 CCCCCCCCCCCCCCCCCCCCCCCCPCCCCCCC.CCCCTCCCCCACMCC
42 42 A L < - 0 0 91 48 90 LLLLLLLLLLLLRRRRRPRRKLRLGKLTDSTP.PPPTFTRRNITPANL
43 43 A V - 0 0 20 49 84 VVVVIITVSSASSSSSSRSSQSSSDAIMDSGAVAEEDCAEEDELLVLL
44 44 A R S S+ 0 0 184 49 88 RRRRRRRRRRRRRRFRRGFFPRFRAARQSTNMQVIIVFNFFTINPKSI
45 45 A V + 0 0 100 49 77 VVVVVVVVADVAAGSGGCSSCVSANCHCCCCCECCCCKCCCQNACMCC
46 46 A b + 0 0 39 49 68 CCCCCCCCCCCCCCCCCYCCVCCCCVIIKVIVCTVVRFRMMCRCSCAD
47 47 A H - 0 0 102 49 91 HHHYHHYYPLNSSLPLLGPPDEPTEDRVAPASLFSSQTPEEVPKERLP
48 48 A Q S S+ 0 0 125 49 73 QHHGGGGDgGPlllgllGggKKgGPKcneGGGGPGGGtPGGEREGQGP
49 49 A D E -A 13 0A 15 35 71 DDDDDDDggrgpppappsaa.sarG.d.r...EG....S..G.G.qRA
50 50 A a E -A 12 0A 1 49 0 CCCCCCCcccccccccccccCcccCCccCCCCCCCCCcCCCCCCCcCC
51 51 A V E -A 11 0A 7 49 69 VVVVIVIVMMLVVVVVVVVVFVVVRFYQVFFHVVFFQWEYYFVFQIQV
52 52 A e E S-A 10 0A 8 49 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
53 53 A E > - 0 0 75 49 69 EQQERKRRDDKEEKEKKEEEQNEKAKNKDQKQNLRRIESPPPEKEDNK
54 54 A E T 3 S+ 0 0 137 49 55 EEEEDDDKPYADDDDDDEDDEDDDDDEEEVNEENEEPKPPPEDEEDIP
55 55 A G T 3 S+ 0 0 47 49 14 GGGGGGGGGGGGGGGGGGGGGGGGNGGGNRGGGGGGGGgGGGGGGGGN
56 56 A F S < S- 0 0 83 48 14 FFFYFFFFFFYFFFFFFFFFFLFLYFFFHFYFFFYYFYmLL.FFFFFY
57 57 A Y E - C 0 65B 75 49 57 YYYFLLLYYYYYYYYYYYYYLYYYYLAFFVVVVFVVLVRYYYIIAVYY
58 58 A R E -bC 23 64B 1 49 35 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRKRRLLRRRTTVRSLRRR
59 59 A N - 0 0 11 49 44 NNNNNNNNNNNDDDDDDDDDDDGDNNNNNDGTNDDDNDTNNQNRRNNK
60 60 A K S S+ 0 0 62 49 77 KKKKKNKNKKRttttttRttGKtDEEAKNnYQAKPSSKHEERrGDtSD
61 61 A D S S- 0 0 75 46 71 DDDNNNNNNDEiiiiiiAiiNFiFNNGDDdDDQHDDDSDDDDd..nD.
62 62 A D S S+ 0 0 1 49 30 DDDDGGGGGGDGGGGGGGGGGGGGGGGKGGGGDDDDGDGGGENSGGGG
63 63 A K - 0 0 84 49 73 KKKIAAARKERDDDDDDDDDNNDNLKTKVSKRQKKKKTKEERRDIKRK
64 64 A e E +C 58 0B 3 49 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
65 65 A V E -C 57 0B 35 49 5 VVVVVVVVVVVVVVVVVVVVVVVVVVIVVIIVVVVVVIVVVVIVVVVV
66 66 A S >> - 0 0 37 49 77 SSSKKKKTSTTRRRKRRTRRRERPARPPPRPRPTLLEKPPPLPPAKTP
67 67 A A H 3> S+ 0 0 11 49 76 AAAAAAAAKEEEEEEEEDEEAKEHKAIRAPPPAQPPnVVKKALARKEQ
68 68 A E H 34 S+ 0 0 151 49 56 EEEEEGEEDDDEEEEEEEEEWDEDEWRNEREETEEEgESSSEESSNAR
69 69 A D H X> S+ 0 0 73 49 35 DNNDDDDDDDVEEEEEEDEEREEEEHQKEDDENEDDDDEDDDDDDEDD
70 70 A d H 3X S+ 0 0 0 49 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
71 71 A E H 3< S+ 0 0 47 47 66 EEEEEEEEEEQGDDDDDADDNNDNQNS P GTPDPPPPPSSPPTGPDP
72 72 A L H X4 S+ 0 0 119 46 97 LLLLLLLLYYEQQQQQQiQQPDQDTP S KsAENNPNRCCCSECAGP
73 73 A D H >< S+ 0 0 29 20 55 DDDDDDDDDDD......d........ . Pt.QNN.V....A.....
74 74 A N G >< + 0 0 12 25 46 NNNNNNNNNNF......D..N....N . NNNKAANSK...N.....
75 75 A M G < S+ 0 0 155 36 80 MMMMMMMMMMMHHHHHHMHHLMHM.L . NSRRTTQEM...I.....
76 76 A D G < S- 0 0 113 33 30 DEEEEEEEEEEEEEEEEEEE EEEE . DEEKTTQN ... .....
77 77 A F < - 0 0 119 33 52 FFFFFFFFFIFIIIIIIFII IIIL Y V SYYYL ... P....
78 78 A I + 0 0 2 36 49 IIIIIIIIIIIIIIIIIIII III V IAAVE YYY KFV..
79 79 A Y + 0 0 125 20 26 YYYYYYYY Y I YY F YYY YYD..
80 80 A P > + 0 0 7 19 66 PPPPPPPP P N S DDN PLN.P
81 81 A G T 4 S+ 0 0 50 18 61 GGGGNNNK E T E GGG GS.K
82 82 A T T 4 S- 0 0 141 17 73 TTTTRRRR Q E T Q RTNT
83 83 A R T 4 S+ 0 0 95 16 71 RQQQKKKK D R E QVTK
84 84 A N < 0 0 116 8 64 NHHH W HN
85 85 A P 0 0 24 3 0 P P
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 86 0 0 10 0 21 0 0 0.501 16 0.75
2 2 A 4 0 0 0 0 0 0 0 71 13 13 0 0 0 0 0 0 0 0 0 24 0 0 0.897 29 0.62
3 3 A 4 0 0 4 0 0 38 0 0 4 0 25 0 0 4 4 0 4 13 0 24 0 0 1.769 59 -0.02
4 4 A 0 0 0 10 0 0 0 0 0 52 3 0 0 0 10 24 0 0 0 0 29 0 0 1.270 42 0.35
5 5 A 0 0 0 0 0 0 0 3 0 5 10 3 0 0 0 31 18 23 8 0 39 0 0 1.780 59 0.32
6 6 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 49 0 0 0.000 0 1.00
7 7 A 0 0 0 0 0 0 0 73 0 12 4 4 0 0 0 0 0 2 0 4 49 0 3 0.955 31 0.65
8 8 A 0 2 0 0 0 0 0 0 0 12 0 2 0 0 0 4 0 73 2 4 49 0 0 0.983 32 0.61
9 9 A 0 0 0 0 2 0 0 6 0 0 0 0 0 0 0 0 0 0 88 4 49 0 0 0.496 16 0.81
10 10 A 0 0 0 0 2 0 0 0 2 0 2 0 0 0 0 8 4 82 0 0 49 0 0 0.739 24 0.70
11 11 A 18 0 12 2 0 24 0 0 2 0 4 2 0 4 16 6 4 4 0 0 49 1 0 2.140 71 0.08
12 12 A 4 21 0 0 23 4 44 0 0 0 2 0 0 2 0 0 0 0 0 0 48 0 0 1.452 48 0.62
13 13 A 2 6 0 0 0 0 0 2 0 0 0 4 0 0 6 6 12 2 4 55 49 0 0 1.598 53 0.40
14 14 A 16 4 0 0 0 2 2 2 2 6 16 2 0 0 4 6 10 12 2 12 49 0 0 2.419 80 0.13
15 15 A 0 0 0 2 0 0 0 0 0 0 2 0 94 0 0 0 0 0 2 0 49 0 0 0.298 9 0.86
16 16 A 0 0 0 0 0 0 0 71 4 6 4 4 0 0 2 4 0 0 4 0 49 0 0 1.144 38 0.59
17 17 A 0 0 0 2 0 0 0 4 2 2 37 29 0 0 10 0 0 0 14 0 49 26 12 1.606 53 0.32
18 18 A 0 0 4 0 0 0 0 0 0 4 0 0 0 0 9 83 0 0 0 0 23 0 0 0.643 21 0.69
19 19 A 0 0 2 0 0 0 0 5 17 29 0 5 5 2 2 2 2 24 0 2 41 0 0 1.991 66 0.24
20 20 A 0 0 0 0 0 0 0 0 0 2 2 0 95 0 0 0 0 0 0 0 44 0 0 0.216 7 0.92
21 21 A 2 4 0 0 0 0 0 0 0 17 0 0 0 0 0 0 4 46 4 22 46 0 0 1.486 49 0.42
22 22 A 0 11 0 0 0 0 0 0 23 21 0 4 0 0 6 19 9 6 0 0 47 0 0 1.919 64 0.19
23 23 A 2 0 0 0 0 0 0 0 4 0 0 39 0 0 4 45 0 4 2 0 49 0 0 1.277 42 0.40
24 24 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 49 1 0 0.000 0 1.00
25 25 A 0 4 2 0 0 0 0 4 6 2 21 0 0 0 6 17 4 13 8 13 48 0 0 2.257 75 0.22
26 26 A 2 4 0 0 0 0 21 4 0 0 4 10 0 0 4 0 2 23 19 6 48 1 0 2.078 69 0.12
27 27 A 2 11 9 0 0 2 2 6 2 2 4 0 0 2 4 6 0 26 0 21 47 36 1 2.237 74 0.13
28 28 A 0 0 0 0 0 0 0 67 0 0 0 0 17 0 0 8 0 0 0 8 12 0 0 0.983 32 0.34
29 29 A 42 0 8 0 0 0 0 8 0 0 0 25 0 0 0 0 0 0 17 0 12 0 0 1.424 47 0.18
30 30 A 0 0 0 0 0 0 0 0 7 0 0 7 0 0 0 0 0 80 7 0 15 0 0 0.720 24 0.65
31 31 A 0 0 0 0 0 0 0 0 0 0 0 0 13 0 0 0 13 63 6 6 16 0 0 1.160 38 0.34
32 32 A 0 6 0 0 0 0 0 6 0 6 6 0 0 0 0 24 0 47 6 0 17 1 0 1.528 51 0.27
33 33 A 0 8 4 0 0 0 0 0 0 21 4 4 0 4 0 8 0 4 4 38 24 0 0 1.903 63 0.17
34 34 A 3 0 0 0 0 0 0 7 0 20 7 3 0 0 3 7 0 17 0 33 30 0 0 1.868 62 0.28
35 35 A 0 5 0 0 0 0 2 0 0 0 7 0 5 5 0 0 2 56 2 15 41 0 0 1.511 50 0.39
36 36 A 0 0 0 0 0 0 0 7 2 0 0 2 0 5 7 12 0 60 5 0 42 1 0 1.407 46 0.46
37 37 A 0 0 5 0 2 5 0 0 0 35 0 0 0 2 5 7 2 33 2 2 43 1 0 1.784 59 0.20
38 38 A 4 0 0 2 0 0 0 2 0 13 2 0 0 0 9 0 0 15 26 26 46 0 0 1.852 61 0.30
39 39 A 27 0 2 4 0 0 0 0 6 25 2 2 0 0 2 15 2 10 2 0 48 0 0 2.006 66 0.18
40 40 A 6 2 23 2 2 0 4 0 10 27 0 2 2 0 2 0 4 8 0 4 48 0 0 2.189 73 0.13
41 41 A 0 0 0 2 0 0 0 0 2 2 0 2 92 0 0 0 0 0 0 0 48 0 0 0.402 13 0.82
42 42 A 0 35 2 0 2 0 0 2 2 13 2 10 0 0 21 4 0 0 4 2 48 0 0 1.939 64 0.09
43 43 A 16 8 6 2 0 0 0 2 10 0 29 2 2 0 2 0 2 10 0 8 49 0 0 2.176 72 0.16
44 44 A 4 0 8 2 14 0 0 2 4 4 4 4 0 0 41 2 4 0 6 0 49 0 0 2.041 68 0.11
45 45 A 22 0 0 2 0 0 0 6 10 0 8 0 37 2 0 2 2 2 4 2 49 0 0 1.919 64 0.22
46 46 A 12 0 6 4 2 0 2 0 2 0 2 2 57 0 6 2 0 0 0 2 49 0 0 1.605 53 0.31
47 47 A 4 12 0 0 2 0 6 2 4 18 10 4 0 12 4 2 2 10 2 4 49 0 0 2.512 83 0.08
48 48 A 0 10 0 0 0 0 0 47 0 10 0 2 2 4 2 6 6 6 2 2 49 14 12 1.862 62 0.27
49 49 A 0 0 0 0 0 0 0 20 14 14 9 0 0 0 11 0 3 3 0 26 35 0 18 1.889 63 0.29
50 50 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 49 0 0 0.000 0 1.00
51 51 A 49 2 6 4 16 2 6 0 0 0 0 0 0 2 2 0 8 2 0 0 49 0 0 1.720 57 0.30
52 52 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 49 0 0 0.000 0 1.00
53 53 A 0 2 2 0 0 0 0 0 2 6 2 0 0 0 10 22 10 27 8 8 49 0 0 2.051 68 0.30
54 54 A 2 0 2 0 0 0 2 0 2 12 0 0 0 0 0 4 0 35 4 37 49 0 0 1.571 52 0.44
55 55 A 0 0 0 0 0 0 0 92 0 0 0 0 0 0 2 0 0 0 6 0 49 1 1 0.329 10 0.86
56 56 A 0 8 0 2 71 0 17 0 0 0 0 0 0 2 0 0 0 0 0 0 48 0 0 0.911 30 0.85
57 57 A 16 12 4 0 8 0 53 0 4 0 0 0 0 0 2 0 0 0 0 0 49 0 0 1.434 47 0.42
58 58 A 2 6 0 0 0 0 0 0 0 0 2 4 0 0 84 2 0 0 0 0 49 0 0 0.689 22 0.65
59 59 A 0 0 0 0 0 0 0 4 0 0 0 4 0 0 4 2 2 0 49 35 49 0 0 1.267 42 0.55
60 60 A 0 0 0 0 0 0 2 4 4 2 6 20 0 2 8 27 2 8 8 6 49 3 12 2.211 73 0.23
61 61 A 0 0 20 0 4 0 0 2 2 0 2 0 0 2 0 0 2 2 22 41 46 0 0 1.652 55 0.28
62 62 A 0 0 0 0 0 0 0 69 0 0 2 0 0 0 0 2 0 2 2 22 49 0 0 0.907 30 0.70
63 63 A 2 2 4 0 0 0 0 0 6 0 2 4 0 0 12 31 2 6 6 22 49 0 0 2.047 68 0.26
64 64 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 49 0 0 0.000 0 1.00
65 65 A 90 0 10 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 49 0 0 0.330 11 0.94
66 66 A 0 6 0 0 0 0 0 0 4 24 10 12 0 0 24 14 0 4 0 0 49 0 0 1.889 63 0.23
67 67 A 4 2 2 0 0 0 0 0 31 10 0 0 0 2 4 12 4 24 2 2 49 0 1 1.986 66 0.23
68 68 A 0 0 0 0 0 4 0 4 2 0 10 2 0 0 6 0 0 57 4 10 49 0 0 1.507 50 0.44
69 69 A 2 0 0 0 0 0 0 0 0 0 0 0 0 2 2 2 2 35 6 49 49 0 0 1.285 42 0.65
70 70 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 49 0 0 0.000 0 1.00
71 71 A 0 0 0 0 0 0 0 6 2 23 6 4 0 0 0 0 4 23 9 21 47 0 0 1.921 64 0.33
72 72 A 0 20 2 0 0 0 4 2 4 9 7 2 9 0 2 2 20 7 7 4 46 26 2 2.422 80 0.03
73 73 A 5 0 0 0 0 0 0 0 5 5 0 5 0 0 0 0 5 0 10 65 20 0 0 1.259 42 0.45
74 74 A 0 0 0 0 4 0 0 0 8 0 4 0 0 0 0 8 0 0 72 4 25 0 0 1.027 34 0.53
75 75 A 0 6 3 44 0 0 0 0 0 0 3 6 0 25 6 0 3 3 3 0 36 0 0 1.686 56 0.20
76 76 A 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 3 3 76 3 9 33 0 0 0.916 30 0.69
77 77 A 3 6 36 0 36 0 12 0 0 3 3 0 0 0 0 0 0 0 0 0 33 0 0 1.479 49 0.47
78 78 A 8 0 69 0 3 0 8 0 6 0 0 0 0 0 0 3 0 3 0 0 36 0 0 1.127 37 0.51
79 79 A 0 0 5 0 5 0 85 0 0 0 0 0 0 0 0 0 0 0 0 5 20 0 0 0.588 19 0.73
80 80 A 0 5 0 0 0 0 0 0 0 63 5 0 0 0 0 0 0 0 16 11 19 0 0 1.129 37 0.34
81 81 A 0 0 0 0 0 0 0 50 0 0 6 6 0 0 0 11 0 11 17 0 18 0 0 1.455 48 0.39
82 82 A 0 0 0 0 0 0 0 0 0 0 0 47 0 0 29 0 12 6 6 0 17 0 0 1.300 43 0.27
83 83 A 6 0 0 0 0 0 0 0 0 0 0 6 0 0 19 31 25 6 0 6 16 0 0 1.717 57 0.29
84 84 A 0 0 0 0 0 13 0 0 0 0 0 0 0 50 0 0 0 0 38 0 8 0 0 0.974 32 0.36
85 85 A 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 3 0 0 0.000 0 1.00
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
8 45 60 1 gDc
9 45 65 1 gPg
9 46 67 1 gVc
10 46 65 1 rVc
11 24 51 1 rCk
11 46 74 1 gAc
12 18 32 1 tKk
12 42 57 1 lPp
12 43 59 1 pVc
12 54 71 1 tVi
13 18 22 1 tRk
13 42 47 1 lPp
13 43 49 1 pAc
13 54 61 1 tVi
14 18 18 1 tQk
14 44 45 1 lPp
14 45 47 1 pAc
14 56 59 1 tVi
15 18 41 1 tKk
15 42 66 1 gPa
15 43 68 1 aAc
15 54 80 1 tVi
16 18 41 1 tQk
16 44 68 1 lPp
16 45 70 1 pAc
16 56 82 1 tVi
17 18 32 1 tQk
17 44 59 1 lPp
17 45 61 1 pAc
17 56 73 1 tVi
18 18 18 1 nLk
18 40 41 1 sVc
18 63 65 1 iWd
19 18 32 1 tKk
19 44 59 1 gPa
19 45 61 1 aAc
19 56 73 1 tVi
20 18 41 1 tRk
20 44 68 1 gPa
20 45 70 1 aAc
20 56 82 1 tVi
22 14 42 1 tYk
22 41 70 1 sAc
23 18 32 1 tKr
23 44 59 1 gPa
23 45 61 1 aAc
23 56 73 1 tVi
24 14 117 1 nYk
24 41 145 1 rAc
27 35 63 1 cPd
27 36 65 1 dAc
28 38 58 1 nVc
29 4 182 1 tRp
29 38 217 1 ePr
30 52 111 1 nId
32 59 143 1 sNt
33 3 28 2 gGRe
37 52 91 2 nCNg
38 32 66 1 tVc
39 45 59 2 gRGm
43 5 28 1 tGe
43 52 76 1 rEd
46 39 90 1 qKc
46 50 102 1 tKn
//