Complet list of 1kgm hssp file
Complete list of 1kgm.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1KGM
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-02
HEADER HYDROLASE 28-NOV-01 1KGM
COMPND MOL_ID: 1; MOLECULE: SERINE PROTEASE INHIBITOR I; CHAIN: A; FRAGMENT:
SOURCE MOL_ID: 1; SYNTHETIC: YES; OTHER_DETAILS: SOLID-PHASE SYNTHESIS, FMOC
AUTHOR Z.GASPARI,A.PATTHY,L.GRAF,A.PERCZEL
DBREF 1KGM A 1 35 UNP O46162 SGP1_SCHGR 57 91
SEQLENGTH 35
NCHAIN 1 chain(s) in 1KGM data set
NALIGN 55
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : SGP1_SCHGR 1.00 1.00 1 35 57 91 35 0 0 92 O46162 Serine protease inhibitor I/II OS=Schistocerca gregaria PE=1 SV=1
2 : LCM_LOCMI 0.86 0.97 1 35 57 91 35 0 0 92 P80060 Protease inhibitors OS=Locusta migratoria PE=1 SV=2
3 : MSPI2_MELSA 0.83 0.94 1 35 1 36 36 1 1 36 B3A0N9 Serine protease inhibitor 2 OS=Melanoplus sanguinipes PE=1 SV=1
4 : Q5K4F7_SCHGR 0.82 0.94 1 34 109 142 34 0 0 146 Q5K4F7 Pacifastin-related peptide PP-5 (Precursor) OS=Schistocerca gregaria GN=pp-5 PE=2 SV=1
5 : Q95PM3_SCHGR 0.79 0.91 1 34 59 92 34 0 0 96 Q95PM3 Pacifastin-related 4t (Precursor) OS=Schistocerca gregaria GN=PP4t PE=4 SV=3
6 : Q4GZT5_SCHGR 0.76 0.91 1 34 59 92 34 0 0 96 Q4GZT5 Pacifastin-related 4a (Precursor) OS=Schistocerca gregaria GN=pp-4a PE=4 SV=1
7 : Q8WQ22_LOCMI 0.76 0.94 1 34 109 142 34 0 0 145 Q8WQ22 Pacifastin-related serine protease inhibitor (Precursor) OS=Locusta migratoria migratorioides GN=pp2 PE=1 SV=1
8 : D2A563_TRICA 0.73 0.80 4 33 90 119 30 0 0 120 D2A563 Putative uncharacterized protein GLEAN_15479 OS=Tribolium castaneum GN=GLEAN_15479 PE=4 SV=1
9 : Q17NZ1_AEDAE 0.70 0.80 4 33 24 53 30 0 0 351 Q17NZ1 AAEL000551-PA OS=Aedes aegypti GN=AAEL000551 PE=4 SV=1
10 : Q8MYK3_SCHGR 0.68 0.82 1 34 114 147 34 0 0 151 Q8MYK3 Pacifastin-related peptide (Precursor) OS=Schistocerca gregaria GN=pp-3p PE=2 SV=1
11 : Q8WQ21_LOCMI 0.68 0.85 1 34 160 193 34 0 0 197 Q8WQ21 Pacifastin-related serine protease inhibitor (Precursor) OS=Locusta migratoria migratorioides GN=pp3 PE=2 SV=1
12 : R4UK43_COPFO 0.67 0.83 4 33 73 102 30 0 0 159 R4UK43 Pacifastin-related peptide OS=Coptotermes formosanus PE=2 SV=1
13 : Q7QHR7_ANOGA 0.65 0.74 1 34 27 59 34 1 1 158 Q7QHR7 AGAP011319-PA (Fragment) OS=Anopheles gambiae GN=AGAP011319 PE=4 SV=4
14 : Q8MYK4_SCHGR 0.65 0.82 1 34 114 147 34 0 0 151 Q8MYK4 Pacifastin-related peptide (Precursor) OS=Schistocerca gregaria GN=pp-3a PE=2 SV=1
15 : T1EB95_ANOAQ 0.62 0.72 4 34 21 50 32 2 3 220 T1EB95 Putative serine protease inhibitor i/ii (Fragment) OS=Anopheles aquasalis PE=2 SV=1
16 : W5JP84_ANODA 0.62 0.72 4 34 21 50 32 2 3 595 W5JP84 Uncharacterized protein OS=Anopheles darlingi GN=AND_002015 PE=4 SV=1
17 : K7IMM2_NASVI 0.61 0.71 4 34 37 67 31 0 0 67 K7IMM2 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
18 : R4FJ57_9MOLL 0.61 0.71 7 34 123 149 28 1 1 176 R4FJ57 Paci-Abd-2 OS=Abdopus aculeatus PE=2 SV=1
19 : R4G2K7_9MOLL 0.61 0.71 7 34 123 149 28 1 1 176 R4G2K7 PACI-Abd-3 OS=Abdopus aculeatus PE=2 SV=1
20 : R4G2L8_OCTCY 0.61 0.71 7 34 126 152 28 1 1 179 R4G2L8 PACI-Oct-1 OS=Octopus cyanea PE=2 SV=1
21 : R4G7D5_OCTCY 0.61 0.71 7 34 126 152 28 1 1 179 R4G7D5 PACI-Oct-3 OS=Octopus cyanea PE=2 SV=1
22 : K7J3W0_NASVI 0.59 0.66 4 34 20 50 32 2 2 254 K7J3W0 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
23 : U5EWN5_9DIPT 0.59 0.71 1 34 561 594 34 0 0 968 U5EWN5 Putative serine protease inhibitor i/ii OS=Corethrella appendiculata PE=2 SV=1
24 : K7J3S3_NASVI 0.58 0.77 4 34 162 192 31 0 0 221 K7J3S3 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
25 : V5GNI4_ANOGL 0.58 0.73 1 33 99 131 33 0 0 136 V5GNI4 Serine protease inhibitor I/II (Fragment) OS=Anoplophora glabripennis GN=SGP1 PE=4 SV=1
26 : V5GYZ5_ANOGL 0.58 0.73 1 33 134 166 33 0 0 171 V5GYZ5 Serine protease inhibitor I/II (Fragment) OS=Anoplophora glabripennis GN=SGP1 PE=4 SV=1
27 : E0W2L7_PEDHC 0.57 0.63 1 33 56 90 35 2 2 91 E0W2L7 Serine protease inhibitor, putative OS=Pediculus humanus subsp. corporis GN=Phum_PHUM595790 PE=4 SV=1
28 : R1C921_EMIHU 0.57 0.63 4 34 916 950 35 3 4 1002 R1C921 Uncharacterized protein OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_243216 PE=4 SV=1
29 : R1EFD5_EMIHU 0.57 0.63 4 34 916 950 35 3 4 1002 R1EFD5 Uncharacterized protein OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_209153 PE=4 SV=1
30 : G6DIC0_DANPL 0.56 0.68 1 33 169 201 34 2 2 254 G6DIC0 Cysteine-rich/pacifastin venom protein 2 OS=Danaus plexippus GN=KGM_11391 PE=4 SV=1
31 : K7IZJ0_NASVI 0.56 0.75 3 34 22 53 32 0 0 295 K7IZJ0 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
32 : K7JX31_NASVI 0.56 0.69 4 35 20 51 32 0 0 563 K7JX31 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
33 : K7J3V9_NASVI 0.55 0.68 4 34 120 149 31 1 1 314 K7J3V9 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
34 : K7IZI9_NASVI 0.54 0.71 1 35 18 52 35 0 0 240 K7IZI9 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
35 : R4FJC8_RHOPR 0.54 0.69 1 35 20 53 35 1 1 170 R4FJC8 Putative pacifastin-related serine protease inhibitor OS=Rhodnius prolixus PE=2 SV=1
36 : A2CKF3_TRIIF 0.53 0.70 4 33 23 52 30 0 0 101 A2CKF3 Pacifastin-related serine protease inhibitor (Precursor) OS=Triatoma infestans GN=PP1 PE=4 SV=1
37 : K7IMM3_NASVI 0.53 0.69 4 35 48 79 32 0 0 80 K7IMM3 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
38 : Q19Q50_9DIPT 0.53 0.70 4 33 97 124 30 1 2 172 Q19Q50 Pacifastin-like (Fragment) OS=Belgica antarctica PE=2 SV=1
39 : B0WU78_CULQU 0.51 0.66 1 33 243 276 35 2 3 432 B0WU78 Pacifastin light chain OS=Culex quinquefasciatus GN=CpipJ_CPIJ010990 PE=4 SV=1
40 : E0W2L8_PEDHC 0.51 0.65 1 35 169 205 37 2 2 208 E0W2L8 Major latex allergen Hev b, putative OS=Pediculus humanus subsp. corporis GN=Phum_PHUM595800 PE=4 SV=1
41 : H2BIV8_PORTR 0.51 0.63 1 35 72 104 35 1 2 365 H2BIV8 Pacifastin-related serine protease inhibitor OS=Portunus trituberculatus PE=2 SV=1
42 : R1FR25_EMIHU 0.51 0.57 3 33 204 238 35 3 4 373 R1FR25 Uncharacterized protein OS=Emiliania huxleyi CCMP1516 GN=EMIHUDRAFT_193909 PE=4 SV=1
43 : K7JFB7_NASVI 0.50 0.66 4 33 64 94 32 3 3 289 K7JFB7 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
44 : R4FNR7_RHOPR 0.50 0.53 1 34 72 107 36 1 2 211 R4FNR7 Putative pacifastin inhibitor lcmii with vwf domain protein OS=Rhodnius prolixus PE=2 SV=1
45 : T1E1X3_9DIPT 0.50 0.61 1 33 147 180 36 3 5 483 T1E1X3 Putative serine protease inhibitor i/ii (Fragment) OS=Psorophora albipes PE=2 SV=1
46 : T1IGC1_RHOPR 0.50 0.53 1 34 43 78 36 1 2 195 T1IGC1 Uncharacterized protein OS=Rhodnius prolixus PE=4 SV=1
47 : C3Z6Y1_BRAFL 0.48 0.65 5 35 1577 1605 31 1 2 2085 C3Z6Y1 Putative uncharacterized protein OS=Branchiostoma floridae GN=BRAFLDRAFT_117488 PE=4 SV=1
48 : K7JGE4_NASVI 0.48 0.68 3 33 77 106 31 1 1 341 K7JGE4 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
49 : R4G3S0_RHOPR 0.48 0.61 4 34 60 92 33 1 2 172 R4G3S0 Putative pacifastin inhibitor lcmii with vwf adomain-containing protein OS=Rhodnius prolixus PE=2 SV=1
50 : R4G3U6_RHOPR 0.48 0.68 4 34 32 61 31 1 1 234 R4G3U6 Protein with 4 pacifastin inhibitor domains lcmii OS=Rhodnius prolixus PE=2 SV=1
51 : SGP3_SCHGR 0.48 0.64 2 34 22 53 33 1 1 63 O46163 Serine protease inhibitor 3 OS=Schistocerca gregaria PE=1 SV=1
52 : T1I1K9_RHOPR 0.48 0.61 4 34 60 92 33 1 2 172 T1I1K9 Uncharacterized protein OS=Rhodnius prolixus PE=4 SV=1
53 : V8NI45_OPHHA 0.48 0.61 1 33 682 712 33 1 2 1502 V8NI45 Mucin-5AC (Fragment) OS=Ophiophagus hannah GN=MUC5AC PE=4 SV=1
54 : G6DP22_DANPL 0.47 0.69 4 35 69 100 32 0 0 151 G6DP22 Cysteine-rich/pacifastin venom protein 2 OS=Danaus plexippus GN=KGM_11689 PE=4 SV=1
55 : W6MNY0_9EUCA 0.46 0.60 1 34 80 112 35 2 3 159 W6MNY0 Pacifastin-like-3 protein (Fragment) OS=Coenobita clypeatus GN=pacifastin-like-3 PE=4 SV=1
## ALIGNMENTS 1 - 55
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A E 0 0 233 27 12 EEEQEEQ EE EE D EEE E EE QEE EEE E E
2 2 A V - 0 0 37 28 91 VIIVVVV VV EV F FFE S FK QDP KKK L A R
3 3 A T + 0 0 133 31 78 TSSNNNN SS KS S SSR GS HQ QSQD IQI K A T E
4 4 A a S S- 0 0 27 51 1 CCCCCCCCCCCCCCCCC CCCCCCCCCCCCCCCCCCCCCCCCC CCCCCYCC
5 5 A E > - 0 0 145 52 72 EEETTTTQETTEETVVP TLPIIEEETLTFTTEPDVTEEETETQTIETIKTE
6 6 A P T 4 S+ 0 0 62 52 22 PPPPPPPPPPPPPPPPP PPAPPPPPPPPPPPPAKPPGPPPPPVPPVPPPPA
7 7 A G T 4 S+ 0 0 57 56 16 GGGGGGGGGGGGGGGGNGGGGGNGGGGGGGGGGGGNNGGGSGGGGGGGGGGGGKG
8 8 A T T 4 + 0 0 86 56 71 TKTTTATTSAASTATTEAATTTEEQQQKKSSSASTTQQSEPTRKTKELTTSTEAA
9 9 A T < - 0 0 49 56 70 TTTTTTTTTTTSTTTTSSSSSYTFTTSDDVVYVTTRPSTTTIMRSRTVKTRKTTK
10 10 A F - 0 0 62 56 18 FFFFFFFFFYYFFYFFFFFFFFFFFFFFFFFFFFWFFYFFWFFFFFYFVWKVIYW
11 11 A K E +A 16 0A 188 56 67 KKQKKKQKKKKKMKKKMKKKKKKHAAAAATLKQQKKKFKKKASLKLYKPKYPKKR
12 12 A D E > -A 15 0A 69 56 71 DDDDNNDKLEEKEEEEDDDDDTFDKKKSSQQKDMEQWDDKDAPSSSDNSEDSDIR
13 13 A K T 3 S- 0 0 106 56 65 KKKKKKRDDGDDDGDDKKKRREDKEEEaaDDDDDDENGaLDldeaeDKgEGgKEG
14 14 A b T 3 S+ 0 0 54 56 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcCCcccccCCcCCcCCC
15 15 A N E < S-A 12 0A 19 56 5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNDNHNNNNN
16 16 A T E -A 11 0A 60 56 72 TTTTTTTTTIISKITTYTTTTTSVTTTTTTAMGSTWYTDDTTLWDWRTDSWDTTR
17 17 A c - 0 0 6 56 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
18 18 A R - 0 0 136 56 82 RRRRRRRVRRRRRRRRRRRRRVRHTTTFFRTSITFTTVFTRTKVFVTITFTTTLM
19 19 A a B -B 26 0B 0 56 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
20 20 A G - 0 0 20 56 64 GGGSGGSNGRRSGRGGGGGGGANSTTPPPTSSGSSSGGTPAPSNTNTGTTSTKST
21 21 A S S S+ 0 0 117 56 70 SAKSSSSKASSEPSEEPSSSSKDAAADDDENMSNSAPNAPGSNRDRPDDKSDNDA
22 22 A D S S- 0 0 107 56 53 DDDNNNNDDDDDDDNNEKKRRDDNDDSNNDDDDDTNEGDTGSDDNDYDEEGERDN
23 23 A G S S+ 0 0 24 56 24 GGGGGGGGGGGGGGggGNNNNGGGGGgggGGGGGGGGVGgRggGgGGAGGGGKGg
24 24 A K S S+ 0 0 126 47 58 KKKRRRRTKKKKQKkkK..RR.KFKKkkk.LKKKQEK.Rk.pkS.S.RNKANWE.
25 25 A S - 0 0 61 51 77 SSsSSSSNVSSTKSAADNNVVsTSNNsaagSTATIYD.As.aAN.N.AIVWIIS.
26 26 A A B -B 19 0B 36 43 40 AAaAAAAAMGGA.G..AVV..sAAAAaaaaAA.A.AAY.aAa.AaAG.G..G.Fa
27 27 A A - 0 0 67 53 67 AAGASSAASAAVAA..AMMMMIAAIIGAATAATMGTSAATVPVAAARAPSIP.VA
28 28 A c - 0 0 19 56 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
29 29 A T - 0 0 64 56 11 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLTSTTSTT
30 30 A L + 0 0 111 56 79 LLLLLLLLRRKLRRRRKKKKKQLLLLLEELDDNGLLRLELAEMLMLLGLKLLNMR
31 31 A K - 0 0 160 56 57 KKKKMMKKKKKKKKKKMMMMMKMMNNKKKKMMMIMLMKKLKIKMRMKTKEKKKMK
32 32 A A + 0 0 72 56 79 AAAAAAAAFISFMIMMNLLLLQLGPPLAAQALDAAYAALTHALLFLGGAAYAPGL
33 33 A b - 0 0 45 56 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
34 34 A P 0 0 124 41 35 PPPPPPP PP PATTPPPPPPPP PP PPNIH P GD P PP PPPP SP
35 35 A Q 0 0 204 12 41 QNQ N QH Q EQ Q E
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 11 85 0 4 27 0 0 0.503 16 0.87
2 2 A 32 4 7 0 14 0 0 0 4 4 4 0 0 0 4 14 4 7 0 4 28 0 0 2.131 71 0.09
3 3 A 0 0 6 0 0 0 0 3 3 0 32 10 0 3 3 6 13 3 13 3 31 0 0 2.138 71 0.21
4 4 A 0 0 0 0 0 0 2 0 0 0 0 0 98 0 0 0 0 0 0 0 51 0 0 0.097 3 0.98
5 5 A 6 4 8 0 2 0 0 0 0 6 0 35 0 0 0 2 4 33 0 2 52 0 0 1.738 58 0.27
6 6 A 4 0 0 0 0 0 0 2 6 87 0 0 0 0 0 2 0 0 0 0 52 0 0 0.567 18 0.77
7 7 A 0 0 0 0 0 0 0 89 0 0 2 0 0 0 0 2 0 0 7 0 56 0 0 0.433 14 0.83
8 8 A 0 2 0 0 0 0 0 0 16 2 14 36 0 0 2 9 9 11 0 0 56 0 0 1.826 60 0.28
9 9 A 7 0 2 2 2 0 4 0 0 2 16 50 0 0 7 5 0 0 0 4 56 0 0 1.700 56 0.29
10 10 A 4 0 2 0 75 7 11 0 0 0 0 0 0 0 0 2 0 0 0 0 56 0 0 0.906 30 0.82
11 11 A 0 5 0 4 2 0 4 0 11 4 2 2 0 2 2 57 7 0 0 0 56 0 0 1.621 54 0.33
12 12 A 0 2 2 2 2 2 0 0 2 2 13 2 0 0 2 13 5 14 5 34 56 0 0 2.125 70 0.28
13 13 A 0 4 0 0 0 0 0 13 7 0 0 0 0 0 5 23 0 16 2 30 56 0 10 1.791 59 0.34
14 14 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 56 0 0 0.000 0 1.00
15 15 A 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 96 2 56 0 0 0.179 5 0.95
16 16 A 2 2 5 2 0 7 4 2 2 0 7 54 0 0 4 2 0 0 0 9 56 0 0 1.753 58 0.27
17 17 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 56 0 0 0.000 0 1.00
18 18 A 9 2 4 2 11 0 0 0 0 0 2 25 0 2 43 2 0 0 0 0 56 0 0 1.643 54 0.18
19 19 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 56 0 0 0.000 0 1.00
20 20 A 0 0 0 0 0 0 0 34 4 9 21 18 0 0 5 2 0 0 7 0 56 0 0 1.756 58 0.36
21 21 A 0 0 0 2 0 0 0 2 14 9 30 0 0 0 4 7 0 7 9 16 56 0 0 2.005 66 0.29
22 22 A 0 0 0 0 0 0 2 5 0 0 4 4 0 0 5 4 0 9 21 46 56 0 0 1.645 54 0.46
23 23 A 2 0 0 0 0 0 0 86 2 0 0 0 0 0 2 2 0 0 7 0 56 9 8 0.608 20 0.75
24 24 A 0 2 0 0 2 2 0 0 2 2 4 2 0 0 17 53 4 4 4 0 47 0 0 1.666 55 0.41
25 25 A 8 0 8 0 0 2 2 2 18 0 33 8 0 0 0 2 0 0 14 4 51 13 8 1.979 66 0.23
26 26 A 5 0 0 2 2 0 2 14 72 0 2 0 0 0 0 0 0 0 0 0 43 0 0 1.003 33 0.60
27 27 A 8 0 8 9 0 0 0 6 45 6 9 8 0 0 2 0 0 0 0 0 53 0 0 1.789 59 0.33
28 28 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 56 0 0 0.000 0 1.00
29 29 A 0 2 0 0 0 0 0 0 0 0 4 95 0 0 0 0 0 0 0 0 56 0 0 0.243 8 0.89
30 30 A 0 46 0 5 0 0 0 4 2 0 0 0 0 0 14 13 2 7 4 4 56 0 0 1.740 58 0.20
31 31 A 0 4 4 30 0 0 0 0 0 0 0 2 0 0 2 54 0 2 4 0 56 0 0 1.269 42 0.43
32 32 A 0 21 4 5 5 0 4 7 36 5 2 2 0 2 0 0 4 0 2 2 56 0 0 2.073 69 0.20
33 33 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 56 0 0 0.000 0 1.00
34 34 A 0 0 2 0 0 0 0 2 2 78 2 5 0 2 0 0 0 0 2 2 41 0 0 0.975 32 0.64
35 35 A 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 58 17 17 0 12 0 0 1.119 37 0.59
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
3 26 26 1 sAa
15 21 41 1 gLk
16 21 41 1 gLk
22 22 41 1 sAs
27 24 79 1 gLk
27 26 82 1 sLa
28 11 926 2 aDGc
28 21 938 1 gLk
28 23 941 1 aMa
29 11 926 2 aDGc
29 21 938 1 gLk
29 23 941 1 aMa
30 25 193 1 gHa
39 14 256 2 aEGc
40 24 192 1 gHk
40 26 195 1 sAa
42 12 215 2 lDGc
42 22 227 1 gDp
42 24 230 1 aQa
43 11 74 1 dNc
43 21 85 1 gTk
44 14 85 2 eDGc
45 14 160 2 aDGc
45 24 172 1 gLa
46 14 56 2 eDGc
49 11 70 2 gDGc
52 11 70 2 gDGc
55 24 103 1 gLa
//