Complet list of 1erg hssp file
Complete list of 1erg.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1ERG
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-04-28
HEADER COMPLEMENT FACTOR 13-DEC-93 1ERG
COMPND MOL_ID: 1; MOLECULE: CD59; CHAIN: A; ENGINEERED: YES
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: HOMO SAPIENS; ORGANISM_COMMON: HUMAN;
AUTHOR B.KIEFFER,P.C.DRISCOLL,I.D.CAMPBELL,A.C.WILLIS,P.A.VAN DER MERWE,S.J.D
DBREF 1ERG A 1 70 UNP P13987 CD59_HUMAN 26 95
SEQLENGTH 70
NCHAIN 1 chain(s) in 1ERG data set
NALIGN 71
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : CD59_HUMAN 1.00 1.00 1 70 26 95 70 0 0 128 P13987 CD59 glycoprotein OS=Homo sapiens GN=CD59 PE=1 SV=1
2 : E9PNW4_HUMAN 1.00 1.00 1 70 26 95 70 0 0 108 E9PNW4 CD59 glycoprotein OS=Homo sapiens GN=CD59 PE=2 SV=1
3 : E9PR17_HUMAN 1.00 1.00 1 70 26 95 70 0 0 130 E9PR17 CD59 glycoprotein OS=Homo sapiens GN=CD59 PE=2 SV=1
4 : Q6FHM9_HUMAN 1.00 1.00 1 70 26 95 70 0 0 128 Q6FHM9 CD59 antigen, complement regulatory protein, isoform CRA_b OS=Homo sapiens GN=CD59 PE=2 SV=1
5 : G2HDS7_PANTR 0.96 1.00 1 70 26 95 70 0 0 144 G2HDS7 CD59 glycoprotein OS=Pan troglodytes PE=2 SV=1
6 : H2Q3D9_PANTR 0.96 1.00 1 70 26 95 70 0 0 128 H2Q3D9 CD59 molecule, complement regulatory protein OS=Pan troglodytes GN=CD59 PE=2 SV=1
7 : G3RL04_GORGO 0.89 0.97 1 70 26 95 70 0 0 128 G3RL04 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101129232 PE=4 SV=1
8 : CD59_PONAB 0.87 0.97 1 70 26 95 70 0 0 128 Q5R510 CD59 glycoprotein OS=Pongo abelii GN=CD59 PE=2 SV=1
9 : M3ZBA1_NOMLE 0.80 0.96 1 70 26 95 70 0 0 128 M3ZBA1 Uncharacterized protein OS=Nomascus leucogenys GN=CD59 PE=4 SV=1
10 : CD59_CHLAE 0.77 0.96 1 70 26 95 70 0 0 128 Q28216 CD59 glycoprotein OS=Chlorocebus aethiops GN=CD59 PE=3 SV=1
11 : CD59_MACFA 0.76 0.96 1 70 26 95 70 0 0 128 Q8SQ46 CD59 glycoprotein OS=Macaca fascicularis GN=CD59 PE=2 SV=1
12 : F6QCT1_MACMU 0.76 0.96 1 70 26 95 70 0 0 128 F6QCT1 CD59 glycoprotein preproprotein OS=Macaca mulatta GN=CD59 PE=2 SV=1
13 : G7PQF7_MACFA 0.76 0.96 1 70 26 95 70 0 0 128 G7PQF7 Macaca fascicularis brain cDNA clone: QorA-14466, similar to human CD59 antigen p18-20 (antigen identified by monoclonalantibodies 16.3A5, EJ16, EJ30, EL32 and G344) (CD59),transcript variant 2, mRNA, RefSeq: NM_000611.4 OS=Macaca fascicularis GN=EGM_05736 PE=2 SV=1
14 : CD59_PAPSP 0.74 0.94 1 70 26 95 70 0 0 126 Q28785 CD59 glycoprotein OS=Papio sp. GN=CD59 PE=3 SV=1
15 : CD59_AOTTR 0.61 0.84 1 70 26 95 70 0 0 128 P51447 CD59 glycoprotein OS=Aotus trivirgatus GN=CD59 PE=3 SV=1
16 : H0WMJ6_OTOGA 0.60 0.80 1 70 26 95 70 0 0 125 H0WMJ6 Uncharacterized protein OS=Otolemur garnettii GN=CD59 PE=4 SV=1
17 : CD59_CALSQ 0.59 0.83 1 70 26 95 70 0 0 128 P46657 CD59 glycoprotein OS=Callithrix sp. GN=CD59 PE=3 SV=1
18 : F6SUS4_CALJA 0.59 0.83 1 70 26 95 70 0 0 128 F6SUS4 CD59 glycoprotein preproprotein OS=Callithrix jacchus GN=CD59 PE=2 SV=1
19 : CD59_SHV21 0.53 0.83 1 70 20 89 70 0 0 121 Q00996 Surface glycoprotein CD59 homolog OS=Saimiriine herpesvirus 2 (strain 11) GN=15 PE=3 SV=1
20 : O40635_SHV2 0.53 0.84 1 70 20 89 70 0 0 116 O40635 CD59 protein OS=Saimiriine herpesvirus 2 GN=orf15 PE=4 SV=1
21 : CD59_SAISC 0.51 0.78 1 70 26 98 73 1 3 131 P47777 CD59 glycoprotein OS=Saimiri sciureus GN=CD59 PE=2 SV=1
22 : L5LDS1_MYODS 0.51 0.70 1 70 26 95 70 0 0 122 L5LDS1 CD59 glycoprotein OS=Myotis davidii GN=MDA_GLEAN10000382 PE=4 SV=1
23 : S7PFA6_MYOBR 0.51 0.71 1 70 26 95 70 0 0 122 S7PFA6 CD59 glycoprotein OS=Myotis brandtii GN=D623_10030910 PE=4 SV=1
24 : CD59_RABIT 0.50 0.76 1 70 26 95 70 0 0 124 O77541 CD59 glycoprotein OS=Oryctolagus cuniculus GN=CD59 PE=1 SV=1
25 : CD59_PIG 0.49 0.72 1 70 26 96 71 1 1 123 O62680 CD59 glycoprotein OS=Sus scrofa GN=CD59 PE=1 SV=1
26 : G1PM32_MYOLU 0.49 0.74 1 70 26 95 70 0 0 116 G1PM32 Uncharacterized protein OS=Myotis lucifugus PE=4 SV=1
27 : G1THU6_RABIT 0.49 0.74 1 70 27 96 70 0 0 125 G1THU6 CD59 glycoprotein OS=Oryctolagus cuniculus GN=CD59 PE=4 SV=2
28 : I3MH91_SPETR 0.49 0.73 1 70 26 95 70 0 0 131 I3MH91 Uncharacterized protein OS=Spermophilus tridecemlineatus GN=CD59 PE=4 SV=1
29 : L8IWY0_9CETA 0.49 0.72 1 70 25 94 71 2 2 121 L8IWY0 CD59 glycoprotein OS=Bos mutus GN=M91_01234 PE=4 SV=1
30 : S7MI50_MYOBR 0.49 0.66 1 70 26 95 70 0 0 122 S7MI50 CD59 glycoprotein OS=Myotis brandtii GN=D623_10002505 PE=4 SV=1
31 : W5Q927_SHEEP 0.49 0.69 1 70 26 95 71 2 2 122 W5Q927 CD59 glycoprotein OS=Ovis aries GN=CD59 PE=4 SV=1
32 : Q32PA1_BOVIN 0.48 0.69 1 70 25 94 71 2 2 121 Q32PA1 CD59 molecule, complement regulatory protein OS=Bos taurus GN=CD59 PE=2 SV=1
33 : T0ML30_9CETA 0.48 0.70 1 70 29 99 71 1 1 127 T0ML30 CD59 glycoprotein OS=Camelus ferus GN=CB1_056579031 PE=4 SV=1
34 : CD59_RAT 0.47 0.74 1 70 23 92 70 0 0 126 P27274 CD59 glycoprotein OS=Rattus norvegicus GN=Cd59 PE=1 SV=2
35 : G1QBH4_MYOLU 0.47 0.64 1 70 26 95 70 0 0 124 G1QBH4 Uncharacterized protein OS=Myotis lucifugus PE=4 SV=1
36 : G5BKJ2_HETGA 0.47 0.77 1 70 4 73 70 0 0 103 G5BKJ2 CD59 glycoprotein (Fragment) OS=Heterocephalus glaber GN=GW7_18981 PE=4 SV=1
37 : K9IWC5_DESRO 0.47 0.67 1 70 25 94 70 0 0 121 K9IWC5 Putative ly-6 antigen / upa receptor-like domain protein OS=Desmodus rotundus PE=2 SV=1
38 : S7MCJ0_MYOBR 0.47 0.66 1 70 26 95 70 0 0 122 S7MCJ0 CD59 glycoprotein OS=Myotis brandtii GN=D623_10011086 PE=4 SV=1
39 : CD59B_MOUSE 0.46 0.69 1 70 24 93 70 0 0 129 P58019 CD59B glycoprotein OS=Mus musculus GN=Cd59b PE=2 SV=2
40 : G1QDS6_MYOLU 0.46 0.64 1 70 26 95 70 0 0 124 G1QDS6 Uncharacterized protein OS=Myotis lucifugus PE=4 SV=1
41 : L5M4R8_MYODS 0.46 0.76 1 70 26 95 70 0 0 122 L5M4R8 CD59 glycoprotein OS=Myotis davidii GN=MDA_GLEAN10000385 PE=4 SV=1
42 : C4N540_APOSY 0.44 0.73 1 70 24 93 70 0 0 130 C4N540 Testicular CD59 OS=Apodemus sylvaticus PE=2 SV=1
43 : L8YH07_TUPCH 0.44 0.67 1 70 72 141 70 0 0 172 L8YH07 CD59 glycoprotein OS=Tupaia chinensis GN=TREES_T100002096 PE=4 SV=1
44 : S7P8T8_MYOBR 0.44 0.73 1 70 26 95 70 0 0 122 S7P8T8 CD59 glycoprotein OS=Myotis brandtii GN=D623_10030911 PE=4 SV=1
45 : A2BI31_MOUSE 0.41 0.66 1 70 24 94 71 1 1 123 A2BI31 CD59a antigen OS=Mus musculus GN=Cd59a PE=2 SV=1
46 : CD59A_MOUSE 0.41 0.66 1 70 24 94 71 1 1 123 O55186 CD59A glycoprotein OS=Mus musculus GN=Cd59a PE=2 SV=1
47 : G5BKJ5_HETGA 0.41 0.69 1 70 15 81 70 1 3 113 G5BKJ5 CD59 glycoprotein (Fragment) OS=Heterocephalus glaber GN=GW7_18984 PE=4 SV=1
48 : Q920G6_MOUSE 0.41 0.66 1 70 24 94 71 1 1 123 Q920G6 CD59A OS=Mus musculus GN=CD59A PE=4 SV=1
49 : R0LFB4_ANAPL 0.41 0.64 1 70 4 73 70 0 0 97 R0LFB4 CD59 glycoprotein (Fragment) OS=Anas platyrhynchos GN=Anapl_05151 PE=4 SV=1
50 : U3IGH4_ANAPL 0.41 0.64 1 70 25 94 70 0 0 118 U3IGH4 Uncharacterized protein OS=Anas platyrhynchos GN=CD59 PE=4 SV=1
51 : G1NCQ9_MELGA 0.40 0.66 1 70 25 94 70 0 0 119 G1NCQ9 Uncharacterized protein OS=Meleagris gallopavo GN=CD59 PE=4 SV=1
52 : F7E3L3_MONDO 0.39 0.57 1 70 25 94 70 0 0 125 F7E3L3 Uncharacterized protein OS=Monodelphis domestica GN=CD59 PE=4 SV=1
53 : J9P4N4_CANFA 0.39 0.72 1 70 42 112 71 1 1 139 J9P4N4 Uncharacterized protein OS=Canis familiaris GN=CD59 PE=4 SV=1
54 : M3WCR6_FELCA 0.39 0.64 1 70 26 97 72 2 2 124 M3WCR6 Uncharacterized protein (Fragment) OS=Felis catus GN=CD59 PE=4 SV=1
55 : M3YJC7_MUSPF 0.39 0.61 1 70 23 92 71 2 2 117 M3YJC7 Uncharacterized protein OS=Mustela putorius furo GN=CD59 PE=4 SV=1
56 : G1LL89_AILME 0.38 0.61 1 70 24 93 71 2 2 120 G1LL89 Uncharacterized protein OS=Ailuropoda melanoleuca GN=CD59 PE=4 SV=1
57 : H0Z5A6_TAEGU 0.36 0.61 1 70 25 94 70 0 0 117 H0Z5A6 Uncharacterized protein OS=Taeniopygia guttata GN=CD59 PE=4 SV=1
58 : K7FHG9_PELSI 0.36 0.66 1 70 27 96 70 0 0 122 K7FHG9 Uncharacterized protein OS=Pelodiscus sinensis PE=4 SV=1
59 : I3KUZ0_ORENI 0.35 0.62 1 70 22 92 71 1 1 107 I3KUZ0 Uncharacterized protein OS=Oreochromis niloticus GN=LOC100703096 PE=4 SV=1
60 : V8P7N9_OPHHA 0.35 0.63 1 70 26 93 71 3 4 123 V8P7N9 CD59 glycoprotein OS=Ophiophagus hannah GN=CD59 PE=4 SV=1
61 : D9IVG0_GECJA 0.34 0.63 1 70 21 91 71 1 1 116 D9IVG0 Glycosylphosphatidylinositol-anchored protein OS=Gecko japonicus GN=CD59 PE=4 SV=1
62 : K7FIC9_PELSI 0.34 0.61 1 70 25 94 70 0 0 119 K7FIC9 Uncharacterized protein OS=Pelodiscus sinensis PE=4 SV=1
63 : U3JF27_FICAL 0.34 0.63 1 70 25 94 70 0 0 117 U3JF27 Uncharacterized protein (Fragment) OS=Ficedula albicollis GN=CD59 PE=4 SV=1
64 : B5XCG2_SALSA 0.33 0.58 1 70 22 92 72 3 3 118 B5XCG2 CD59 glycoprotein OS=Salmo salar GN=CD59 PE=4 SV=1
65 : F6TA80_ORNAN 0.33 0.59 1 70 27 96 70 0 0 122 F6TA80 Uncharacterized protein OS=Ornithorhynchus anatinus GN=CD59 PE=4 SV=1
66 : H2MWG4_ORYLA 0.33 0.60 1 70 22 92 72 3 3 117 H2MWG4 Uncharacterized protein OS=Oryzias latipes GN=LOC101156067 PE=4 SV=1
67 : Q64HX7_ONCMY 0.33 0.56 1 70 22 93 72 2 2 119 Q64HX7 CD59-like protein OS=Oncorhynchus mykiss GN=CD59 PE=4 SV=1
68 : A7RPI9_NEMVE 0.32 0.51 1 70 53 127 75 3 5 151 A7RPI9 Predicted protein OS=Nematostella vectensis GN=v1g200173 PE=4 SV=1
69 : J3SBY7_CROAD 0.32 0.65 1 70 25 92 71 3 4 122 J3SBY7 CD59 glycoprotein-like OS=Crotalus adamanteus PE=2 SV=1
70 : T1DNJ2_CROHD 0.32 0.65 1 70 25 92 71 3 4 122 T1DNJ2 CD59 glycoprotein-like protein OS=Crotalus horridus PE=2 SV=1
71 : H9G4X0_ANOCA 0.31 0.45 1 70 21 93 77 5 11 127 H9G4X0 Uncharacterized protein (Fragment) OS=Anolis carolinensis GN=LYPD2 PE=4 SV=1
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A L 0 0 67 72 7 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLILLILIIILL
2 2 A Q - 0 0 82 72 65 QQQQQQQQQQQQQQQKQQQQQEQMQRMQQQQQQRQTQQKQRKERTTTTRRRRTTTTRDQVKRRRKRRKVV
3 3 A a E -A 17 0A 4 72 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
4 4 A Y E +A 16 0A 94 72 7 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYVYFVYYY
5 5 A N - 0 0 10 72 71 NNNNNNNSDNNNNNSSSSNNSTIHNYHSSTSSSNTETTNTNHSNHHQHHHHDSSATHNSNDTHKSNKVNN
6 6 A b - 0 0 11 72 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
7 7 A P S S+ 0 0 89 72 83 PPPPPPPPPPPPPPPLPPSSPLLLIPLVIVIIVLVALVLVPLMPFFPFEEENPVIVDSPPIADNSKKTPP
8 8 A N S S- 0 0 108 71 67 NNNNNNNNNNNNNNYNYYHHLNNLNNLDND.NDDDSNDDDNDGNQQNQNNNVEPDDNLGVPNNDLDDDII
9 9 A P S S+ 0 0 51 72 65 PPPPPPPPPPPPPPPPSSSSpPPPPSPPQPNQPPPHPPPPAPGWppTpSSSPSpSNSTgDGSSYTYDEAA
10 10 A T + 0 0 19 67 71 TTTTTTTTTTTTTTTVTTTTsGGSAVSVVVQVPVVNGVVVVVIVvvTvPPPTEs..PVs.PPPTRTTK..
11 11 A A S S+ 0 0 86 69 72 AAAAAAAAATTTTTTAAAMMMGGPGGPSNGVNHSGQGGSGGSSGSSPSSSSQYGEKTGQ.SSTGYGGA..
12 12 A D S S+ 0 0 165 71 83 DDDDDDDDDDDDDNQDRRQQEAENSANF.SDCESSNPSSSASPASSKSLLLHTNTALPNNLLLSGSSNNN
13 13 A b + 0 0 35 72 2 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCTCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
14 14 A K - 0 0 74 72 66 KKKKKKKKKKKKKKTTTTTTTTTSTNSNTTTSTKTTTTKTTKTTNNINKKKTNTNNRKADKSRTNTTNDD
15 15 A T - 0 0 86 72 69 TTTTTTTTTTTTTTMSTTTTALVTTRMESVSITTVVHVIVYTTVMMEMTTTKNKKKTTAKTTTNTKTKRR
16 16 A A E +A 4 0A 48 71 76 AAAAAAAAAAAAAATVTTSSSTTVATVTVIV.INIPVINITNVTNNTNNNNNTTTTNNKNENNTSQTnNN
17 17 A V E -A 3 0A 62 72 77 VVVVVVVVVIIIIITATTTTTTTTMTTTVNIVVSNITNTNTSSTSSKSNNSIISTTSIQIKASQIRQvII
18 18 A N + 0 0 117 72 60 NNNNNNNNNNNNNNNNNNSSNNNNNDNRNNNNNTNVNNTNNTNNTTLTTTTTSVVVTTETNTTDTDDDTT
19 19 A c + 0 0 34 72 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
20 20 A S > + 0 0 27 72 65 SSSSSSSSSSSSSSTTTTTTTTTTSTTHSTTSTSTATTSTTSTTSSTSLLLRGSSSLATKSTLSPSRSKK
21 21 A S T 3 S+ 0 0 102 71 74 SSSSSSSSSSSSSSSPSSSSSVVPHAPLHVHHHPVPPVPVAPAAPPPPSSSTISVVTSQGSHTNS.VVGG
22 22 A D T 3 S+ 0 0 89 72 50 DDDDDDDGGGGGGGNNNNNNNNNNNNNDNDNNNNDDNDNDDNNNDDNDTTNENNNNTGGEEGTDEYDAEE
23 23 A F < + 0 0 62 72 87 FFFFFFFFFFFFFFLLLLLLLFFHQFHLQFQQQLSIFFLSLLFDQQLQEEEDLFHFELKQYDEDKDEGEE
24 24 A D + 0 0 31 71 6 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD.NDDEDD
25 25 A A E -B 40 0B 2 72 49 AAAAAAVAVTTTTTSASSSSSAAAAAAAAAAAAAAAAASAASATSSASAATAASATTSATSTTAAAAsTT
26 26 A a E -BC 39 64B 2 72 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcCC
27 27 A L E -BC 38 63B 0 72 14 LLLLLLLLLLLLLLLLLLLLLLLLILLALLLLILLLLLLLLLVLLLLLLLLLLLLLLLLLILLLLLLLVV
28 28 A I E -BC 37 62B 25 72 83 IIIIIIIIIIIIIIIHIIIIIYYTFYTIFYYFIVYYSYYYYITWYYAYQQQRFYLFQWKIWQQTKTTKII
29 29 A T - 0 0 0 72 66 TTTTTTTTTAAAAAATAAAAAVVAVVAVVIVVVAVALIAIVAVVAAAAMMMVIIVSLLLVVILLLLLVVV
30 30 A K + 0 0 125 72 68 KKKKKKKKKRRRRRKIKKKKKKKVeKVQkKkkkVKKYKVKKVKKVVKVRRRMkeklKKTysKKtSsAKyy
31 31 A A B > S-D 34 0C 29 68 70 AAAAAAAAAAAAAAAAAAAAAAASvTSAvAvvvSAAGAAATSAVAA.AFFFTdedeFVS.gLFtVreT..
32 32 A G T 3 S- 0 0 75 71 39 GGGGGGGGGGGGGGGGGGGGGEEGPEGGPEPPPGEGEEGEDGEDGG.GGGGGPPPPGGGgEHGGGGgGgg
33 33 A L T 3 S+ 0 0 155 71 88 LLLLLLLLLLLLLLSPLLSSSTTPPTPPLKLLSKKQVKRKKRLTMM.MKKKTLKKEKEGLQNKGEGGDVV
34 34 A Q B < S-D 31 0C 96 72 74 QQQQQQQRRQQQQQRRRRGKGRRRKRRRKRKKKQRRHRQRQQLRQQSQLLLDRSLVLRKKRLLQTMQAKK
35 35 A V - 0 0 21 72 75 VVVVVVVVVVVVVVVVVVVVVTKVTTVIFTVFSVTVTTVTTVTTVVVVRRRYYYFFRTTNEKRTHTTVNN
36 36 A Y + 0 0 50 72 66 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYHYYYYHYYHYHAYYSYYHYTTTRYYYQTLYTSTTTSYTTTT
37 37 A N E +B 28 0B 3 72 86 NNNNNNNNNNNNNNYHYYYYYYYRYYRHHYRHYQYQYYQYYQYYQQRQSSFYYYRHFHTSYYFRYRRKSS
38 38 A K E -B 27 0B 101 72 58 KKKKKKKQRQQQQQRQRRRRRQQQQQQQQQQQQQQQQQQQGQHQRRQRSSSLQQQQSDERNTSQDQQSQQ
39 39 A c E -B 26 0B 27 72 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
40 40 A W E -B 25 0B 36 72 22 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWYLWWWWIYLIAWW
41 41 A K > - 0 0 103 71 24 KKKKKKKKKKKKKKKKKKKKKKKRRKRKKMSKKRMRTMKMKKRKKKRKKKKKRKKKKKRKKKKKKKKKKK
42 42 A F G > S+ 0 0 86 71 48 FFFFLLLFSFFFFFFLFFFFFFFYFFYYYFLYIFFYFFLFFFFFQQYQMMLYQLFFAYYYYHAYFYVQYY
43 43 A E G 3 S+ 0 0 164 72 62 EEEEEEEEEAAAAAEEEEDDDEEEDEEKEEEEASEAAESEQSDESSASSSSAKADDSSASSASSPSSTSS
44 44 A H G < + 0 0 85 72 50 HHHHHHHHNNNNNNDNDDDDDNNDENDDENHEDDNDDNDNNDDNDDNDQQQNDDDDQKDDQRQDLDDQDD
45 45 A d < + 0 0 33 72 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
46 46 A N S S- 0 0 118 72 49 NNNNNNNTDNNNNNTNTTSSSNNNNNNTSNNSKNNNNNNNKTTNHHHHNNNNNSSNNNDNDNSDDEDDNN
47 47 A F S > S- 0 0 127 72 62 FFFFFFFFFFYYYFFFFFFFFYYFFYFFFYFFFAYSFYSYYAYYGGLGVVVMYFYYVIFIGMVFDYFTVV
48 48 A N T 3> S+ 0 0 18 72 60 NNNNKKNNNNNNNNSARRKKKEEEDEEEENEEEKNQNNNNEKDKEEQENNNEQELNNHMDKNNNENNFDD
49 49 A D T 34 S+ 0 0 61 72 94 DDDDDDDENDDDDDRDQQRRRYYFFYFEFYFFVFYTYYYYDFYDIIFIDDDKYRSYEATFNEERDRRKFF
50 50 A V T <4 S+ 0 0 0 72 27 VVVVLLLVIIIIIIVILLIIILLIILIFILIIIILILLILLIILIILIIIIIIIIIIILIIIILILLSII
51 51 A T T 4 S+ 0 0 0 72 64 TTTTTTATSSSSSSSSSSSSSSSSSSSSASAAALSMSSMSSLSAMMKMAAAGSSSSAEASEAAAEAATSS
52 52 A T S >< S+ 0 0 66 72 77 TTTTTTATTTTTTTNRNNNNNKKNRKNKKKNKKSKNQNSNKSKKDDEDVVVENAKKDEVLKDDQKQQCNN
53 53 A R T 3 S+ 0 0 226 72 87 RRRRRRHRHLLLLLQLQQQQQNNRNNRRALAASRLRHNRNKRAQQQRQFFFISSATSDRNHDSMEMMDHH
54 54 A L T 3 S- 0 0 29 72 26 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLFILIFLLFLLLLYYYLFLLLFYYFFFFFFFFKFF
55 55 A R < - 0 0 190 72 78 RRRRRRKKKKKKKKSGSSSSSGGEAGEGGGGGGEGRGGDGVEGVEEEEQQQGQGGGLKSSNGLPKPPSSS
56 56 A E + 0 0 71 72 65 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEIEEEEVEGLEGEEEELLLIILLLLTLTTVLsVqsDTT
57 57 A N S S- 0 0 126 71 74 NNNNNNNSNSSSSSNNNNTTTNKNKNNPKKKKKAKNRKAKSADSTTTTDDDAKQKRDSPKGDDpFspGKK
58 58 A E S S+ 0 0 179 71 65 EEEEEEEEEEEEEEEEEEQQQTTSKTSEETEEENTQKTGTNNKNKKQKNNNKKKKKNNDDTNNKSRKKNN
59 59 A L - 0 0 25 72 30 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLILLLVIVLIIILVLLLLVLFFFALLLLFFFFFFFFFFFCFF
60 60 A T + 0 0 110 72 75 TTTTTTTTTQRRRQKRKKKKKQQKKQKTHQRHKQQLWEQEKQTKKKLKEEDTEQEEENTRTHETKTTVRR
61 61 A Y - 0 0 46 72 13 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYSYYYYYFFYFYYYYYYYYFFFYFFFFFFFAYY
62 62 A Y E -C 28 0B 136 72 92 YYYYYYSHNFFFFFYYHHHHHEENNENRDEDDSRERNEKEERLNRRRRFFFHYRSHFKSRRFFDKKDARR
63 63 A d E -C 27 0B 36 72 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
64 64 A e E -C 26 0B 9 72 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
65 65 A K + 0 0 95 72 51 KKKKKKKKKKKKKKKSRRKKKQQRRQRRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQKQQTQNSnQQ
66 66 A K S S- 0 0 175 72 59 KKKKKKKKKKKKKEKKEEKKKKKKKKKKKKKKKAKKKKWKKVKKFFKFQQQEKQKQHKKWSKQSWSSkWW
67 67 A D S S- 0 0 60 72 25 DDDDDDDDDDDDDDNDNNNNNDDDDDDKNDNNDDDDDDDDDDDDNNDNDDDDDDDDDNTNDDDDDDHDNN
68 68 A L - 0 0 46 72 4 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLMRLLLLLLLLLLLLLLLLLLFLLLLLLLL
69 69 A e 0 0 37 72 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
70 70 A N 0 0 126 72 0 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
## ALIGNMENTS 71 - 71
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....8....:....9....:....0....:....1....:....2....:....3....:....4
1 1 A L 0 0 67 72 7 L
2 2 A Q - 0 0 82 72 65 Q
3 3 A a E -A 17 0A 4 72 0 C
4 4 A Y E +A 16 0A 94 72 7 Y
5 5 A N - 0 0 10 72 71 S
6 6 A b - 0 0 11 72 0 C
7 7 A P S S+ 0 0 89 72 83 K
8 8 A N S S- 0 0 108 71 67 E
9 9 A P S S+ 0 0 51 72 65 l
10 10 A T + 0 0 19 67 71 a
11 11 A A S S+ 0 0 86 69 72 S
12 12 A D S S+ 0 0 165 71 83 Q
13 13 A b + 0 0 35 72 2 C
14 14 A K - 0 0 74 72 66 M
15 15 A T - 0 0 86 72 69 T
16 16 A A E +A 4 0A 48 71 76 I
17 17 A V E -A 3 0A 62 72 77 V
18 18 A N + 0 0 117 72 60 N
19 19 A c + 0 0 34 72 0 C
20 20 A S > + 0 0 27 72 65 S
21 21 A S T 3 S+ 0 0 102 71 74 A
22 22 A D T 3 S+ 0 0 89 72 50 E
23 23 A F < + 0 0 62 72 87 D
24 24 A D + 0 0 31 71 6 T
25 25 A A E -B 40 0B 2 72 49 M
26 26 A a E -BC 39 64B 2 72 0 C
27 27 A L E -BC 38 63B 0 72 14 K
28 28 A I E -BC 37 62B 25 72 83 T
29 29 A T - 0 0 0 72 66 T
30 30 A K + 0 0 125 72 68 m
31 31 A A B > S-D 34 0C 29 68 70 s
32 32 A G T 3 S- 0 0 75 71 39 d
33 33 A L T 3 S+ 0 0 155 71 88 S
34 34 A Q B < S-D 31 0C 96 72 74 T
35 35 A V - 0 0 21 72 75 V
36 36 A Y + 0 0 50 72 66 T
37 37 A N E +B 28 0B 3 72 86 R
38 38 A K E -B 27 0B 101 72 58 A
39 39 A c E -B 26 0B 27 72 0 C
40 40 A W E -B 25 0B 36 72 22 S
41 41 A K > - 0 0 103 71 24 .
42 42 A F G > S+ 0 0 86 71 48 .
43 43 A E G 3 S+ 0 0 164 72 62 T
44 44 A H G < + 0 0 85 72 50 K
45 45 A d < + 0 0 33 72 0 C
46 46 A N S S- 0 0 118 72 49 E
47 47 A F S > S- 0 0 127 72 62 P
48 48 A N T 3> S+ 0 0 18 72 60 S
49 49 A D T 34 S+ 0 0 61 72 94 D
50 50 A V T <4 S+ 0 0 0 72 27 V
51 51 A T T 4 S+ 0 0 0 72 64 D
52 52 A T S >< S+ 0 0 66 72 77 G
53 53 A R T 3 S+ 0 0 226 72 87 I
54 54 A L T 3 S- 0 0 29 72 26 G
55 55 A R < - 0 0 190 72 78 T
56 56 A E + 0 0 71 72 65 T
57 57 A N S S- 0 0 126 71 74 .
58 58 A E S S+ 0 0 179 71 65 .
59 59 A L - 0 0 25 72 30 R
60 60 A T + 0 0 110 72 75 P
61 61 A Y - 0 0 46 72 13 V
62 62 A Y E -C 28 0B 136 72 92 T
63 63 A d E -C 27 0B 36 72 0 C
64 64 A e E -C 26 0B 9 72 0 C
65 65 A K + 0 0 95 72 51 F
66 66 A K S S- 0 0 175 72 59 T
67 67 A D S S- 0 0 60 72 25 D
68 68 A L - 0 0 46 72 4 L
69 69 A e 0 0 37 72 0 C
70 70 A N 0 0 126 72 0 N
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 93 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 72 0 0 0.252 8 0.93
2 2 A 4 0 0 3 0 0 0 0 0 0 0 13 0 0 19 8 49 3 0 1 72 0 0 1.527 50 0.35
3 3 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 72 0 0 0.000 0 1.00
4 4 A 3 0 0 0 1 0 96 0 0 0 0 0 0 0 0 0 0 0 0 0 72 0 0 0.200 6 0.93
5 5 A 1 0 1 0 0 0 1 0 1 0 24 11 0 15 0 3 1 1 35 4 72 0 0 1.828 61 0.29
6 6 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 72 0 0 0.000 0 1.00
7 7 A 11 13 8 1 4 0 0 0 3 39 6 1 0 0 0 4 0 4 3 3 72 1 0 2.054 68 0.17
8 8 A 3 7 3 0 0 0 4 3 0 3 1 0 0 3 0 0 4 3 45 21 71 0 0 1.805 60 0.32
9 9 A 0 1 0 0 0 1 3 4 4 53 18 4 0 1 0 0 3 1 3 3 72 5 7 1.679 56 0.35
10 10 A 27 0 1 0 0 0 0 4 3 12 7 37 0 0 1 1 1 1 1 0 67 0 0 1.789 59 0.28
11 11 A 1 0 0 4 0 0 3 23 20 4 20 12 0 1 0 1 4 1 3 0 69 1 0 2.096 69 0.27
12 12 A 0 10 0 0 1 0 0 1 7 4 20 3 1 1 3 1 6 4 14 23 71 0 0 2.278 76 0.17
13 13 A 0 0 0 0 0 0 0 0 0 0 0 1 99 0 0 0 0 0 0 0 72 0 0 0.073 2 0.98
14 14 A 0 0 1 1 0 0 0 0 1 0 6 38 0 0 3 32 0 0 14 4 72 0 0 1.577 52 0.34
15 15 A 10 1 3 7 0 0 1 0 3 0 4 50 0 1 4 10 0 3 3 0 72 1 0 1.826 60 0.30
16 16 A 10 0 8 0 0 0 0 0 23 1 6 23 0 0 0 1 1 1 25 0 71 0 1 1.859 62 0.23
17 17 A 21 0 19 1 0 0 0 0 3 0 14 25 0 0 1 3 4 0 8 0 72 0 0 1.923 64 0.22
18 18 A 6 1 0 0 0 0 0 0 0 0 4 25 0 0 1 0 0 1 54 7 72 0 0 1.335 44 0.39
19 19 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 72 0 0 0.000 0 1.00
20 20 A 0 7 0 0 0 0 0 1 3 1 44 35 0 1 3 4 0 0 0 0 72 1 0 1.423 47 0.35
21 21 A 14 1 1 0 0 0 0 4 7 17 39 4 0 8 0 0 1 0 1 0 71 0 0 1.847 61 0.25
22 22 A 0 0 0 0 0 0 1 14 1 0 0 6 0 0 0 0 0 10 40 28 72 0 0 1.502 50 0.49
23 23 A 0 21 1 0 33 0 1 1 0 0 3 0 0 4 0 3 13 11 0 8 72 1 0 1.914 63 0.13
24 24 A 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 1 96 71 0 0 0.221 7 0.94
25 25 A 3 0 0 1 0 0 0 0 56 0 21 19 0 0 0 0 0 0 0 0 72 0 1 1.131 37 0.51
26 26 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 72 0 0 0.000 0 1.00
27 27 A 4 89 4 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 72 0 0 0.488 16 0.85
28 28 A 1 1 38 0 7 4 21 0 1 0 1 10 0 1 1 4 8 0 0 0 72 0 0 1.935 64 0.17
29 29 A 28 13 8 4 0 0 0 0 29 0 1 17 0 0 0 0 0 0 0 0 72 0 0 1.673 55 0.33
30 30 A 11 1 1 3 0 0 6 0 1 0 4 3 0 0 11 54 1 3 0 0 72 4 13 1.650 55 0.31
31 31 A 12 1 0 0 7 0 0 3 51 0 9 7 0 0 1 0 0 4 0 3 68 0 2 1.661 55 0.29
32 32 A 0 0 0 0 0 0 0 68 0 13 0 0 0 1 0 0 0 14 0 4 71 0 0 0.996 33 0.60
33 33 A 4 32 0 4 0 0 0 6 0 7 8 7 0 0 3 18 3 4 1 1 71 0 0 2.143 71 0.12
34 34 A 1 11 0 1 0 0 0 3 1 0 3 3 0 1 29 14 31 0 0 1 72 0 0 1.836 61 0.26
35 35 A 49 0 1 0 6 0 4 0 0 0 1 22 0 1 7 3 0 1 4 0 72 0 0 1.633 54 0.24
36 36 A 0 1 0 0 0 0 65 0 1 0 4 18 0 7 1 0 1 0 0 0 72 0 0 1.143 38 0.33
37 37 A 0 0 0 0 4 0 35 0 0 0 7 1 0 8 13 1 10 0 21 0 72 0 0 1.824 60 0.13
38 38 A 0 1 0 0 0 0 0 1 1 0 8 1 0 1 15 11 53 1 1 3 72 0 0 1.591 53 0.41
39 39 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 72 0 0 0.000 0 1.00
40 40 A 0 3 3 0 0 89 3 0 1 0 1 0 0 0 0 0 0 0 0 0 72 1 0 0.522 17 0.78
41 41 A 0 0 0 6 0 0 0 0 0 0 1 1 0 0 13 79 0 0 0 0 71 0 0 0.731 24 0.76
42 42 A 1 11 1 3 48 0 23 0 3 0 1 0 0 1 0 0 7 0 0 0 71 0 0 1.562 52 0.52
43 43 A 0 0 0 0 0 0 0 0 18 1 26 3 0 0 0 3 1 38 0 10 72 0 0 1.573 52 0.38
44 44 A 0 1 0 0 0 0 0 0 0 0 0 0 0 14 1 3 10 4 25 42 72 0 0 1.563 52 0.50
45 45 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 72 0 0 0.000 0 1.00
46 46 A 0 0 0 0 0 0 0 0 0 0 11 10 0 6 0 3 0 3 58 10 72 0 0 1.371 45 0.51
47 47 A 10 1 3 3 44 0 24 6 3 1 3 1 0 0 0 0 0 0 0 1 72 0 0 1.724 57 0.37
48 48 A 0 1 0 1 1 0 0 0 1 0 3 0 0 1 3 13 4 24 40 7 72 0 0 1.781 59 0.39
49 49 A 1 0 4 0 17 0 17 0 1 0 1 3 0 0 11 3 3 7 3 29 72 0 0 2.095 69 0.05
50 50 A 11 28 58 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 72 0 0 1.033 34 0.73
51 51 A 0 3 0 7 0 0 0 1 22 0 47 13 0 0 0 1 0 4 0 1 72 0 0 1.544 51 0.35
52 52 A 6 1 0 0 0 0 0 1 3 0 4 19 1 0 3 22 6 4 21 8 72 0 0 2.150 71 0.22
53 53 A 0 11 3 4 4 0 0 0 7 0 7 1 0 8 24 1 14 1 10 4 72 0 0 2.338 78 0.12
54 54 A 0 67 3 0 21 0 7 1 0 0 0 0 0 0 0 1 0 0 0 0 72 0 0 1.001 33 0.73
55 55 A 3 3 0 0 0 0 0 28 1 4 15 1 0 0 11 14 6 11 1 1 72 0 0 2.135 71 0.22
56 56 A 4 14 4 0 0 0 0 3 0 0 3 8 0 0 0 0 1 61 0 1 72 1 3 1.365 45 0.35
57 57 A 0 0 0 0 1 0 0 3 6 6 14 10 0 0 3 21 1 0 25 10 71 0 0 2.055 68 0.26
58 58 A 0 0 0 0 0 0 0 1 0 0 4 11 0 0 1 20 7 34 18 3 71 0 0 1.785 59 0.35
59 59 A 6 61 7 0 22 0 0 0 1 0 0 0 1 0 1 0 0 0 0 0 72 0 0 1.159 38 0.70
60 60 A 1 3 0 0 0 1 0 0 0 1 0 25 0 4 11 22 15 13 1 1 72 0 0 2.001 66 0.24
61 61 A 1 0 0 0 18 0 78 0 1 0 1 0 0 0 0 0 0 0 0 0 72 0 0 0.683 22 0.87
62 62 A 0 1 0 0 15 0 14 0 1 0 6 1 0 11 18 6 0 11 8 7 72 0 0 2.250 75 0.07
63 63 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 72 0 0 0.000 0 1.00
64 64 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 72 0 0 0.000 0 1.00
65 65 A 0 0 0 0 1 0 0 0 0 0 3 1 0 0 8 28 56 0 3 0 72 0 1 1.207 40 0.49
66 66 A 1 0 0 0 4 7 0 0 1 0 6 1 0 1 0 64 8 6 0 0 72 0 0 1.370 45 0.40
67 67 A 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 1 0 0 22 74 72 0 0 0.738 24 0.74
68 68 A 0 96 0 1 1 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 72 0 0 0.219 7 0.95
69 69 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 72 0 0 0.000 0 1.00
70 70 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 72 0 0 0.000 0 1.00
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
21 10 35 3 pTMEs
25 31 56 1 eAv
29 30 54 1 kAv
31 30 55 1 kAv
32 30 54 1 kAv
33 31 59 1 kAv
45 10 33 1 pVv
46 10 33 1 pVv
48 10 33 1 pVv
53 31 72 1 kAd
54 10 35 1 pVs
54 31 57 1 eAe
55 30 52 1 kAd
56 30 53 1 lTe
59 10 31 1 gSs
60 29 54 1 yHg
61 31 51 1 sLg
64 30 51 1 tEt
64 56 78 1 sVp
66 30 51 1 sEr
66 56 78 1 qVs
67 32 53 1 eRg
67 57 79 1 sVp
68 17 69 1 nEv
68 26 79 3 sYKYc
68 66 122 1 nDk
69 29 53 1 yHg
70 29 53 1 yHg
71 10 30 2 lTTa
71 31 53 1 mYs
71 32 55 4 sLEDGd
//