Complet list of 2alc hssp file
Complete list of 2alc.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 2ALC
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-16
HEADER DNA BINDING PROTEIN 20-JAN-99 2ALC
COMPND MOL_ID: 1; MOLECULE: PROTEIN (ETHANOL REGULON TRANSCRIPTIONAL ACTIVATO
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: EMERICELLA NIDULANS; ORGANISM_TAXID: 1
AUTHOR R.CERDAN,B.CAHUZAC,B.FELENBOK,E.GUITTET
DBREF 2ALC A 1 63 UNP P21228 ALCR_EMENI 1 63
SEQLENGTH 65
NCHAIN 1 chain(s) in 2ALC data set
NALIGN 65
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : ALCR_EMENI 1F5E 0.95 0.97 3 65 1 63 63 0 0 821 P21228 Regulatory protein alcR OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=alcR PE=1 SV=2
2 : B6HNY2_PENCW 0.52 0.68 3 65 1 55 63 1 8 805 B6HNY2 Pc21g22800 protein OS=Penicillium chrysogenum (strain ATCC 28089 / DSM 1075 / Wisconsin 54-1255) GN=Pc21g22800 PE=4 SV=1
3 : W6Q4H2_PENRO 0.52 0.73 3 65 1 60 63 1 3 800 W6Q4H2 Regulatory protein alcR OS=Penicillium roqueforti GN=alcR PE=4 SV=1
4 : K9GCT0_PEND2 0.51 0.70 3 65 1 60 63 1 3 806 K9GCT0 Regulatory protein alcR OS=Penicillium digitatum (strain PHI26 / CECT 20796) GN=PDIG_06570 PE=4 SV=1
5 : B0Y3Q7_ASPFC 0.47 0.63 3 65 1 73 73 1 10 866 B0Y3Q7 C6 transcription factor AlcR OS=Neosartorya fumigata (strain CEA10 / CBS 144.89 / FGSC A1163) GN=AFUB_055060 PE=4 SV=1
6 : Q4WU81_ASPFU 0.47 0.63 3 65 1 73 73 1 10 866 Q4WU81 C6 transcription factor AlcR OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=AFUA_5G07510 PE=4 SV=1
7 : V5FVH6_BYSSN 0.47 0.62 7 64 37 108 72 3 14 957 V5FVH6 C6 zinc finger domain protein OS=Byssochlamys spectabilis (strain No. 5 / NBRC 109023) GN=PVAR5_4717 PE=4 SV=1
8 : A1CA83_ASPCL 0.46 0.65 3 65 1 74 74 1 11 1308 A1CA83 C6 zinc finger domain protein OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1) GN=ACLA_010770 PE=4 SV=1
9 : K9H238_PEND1 0.43 0.65 3 65 1 60 63 1 3 806 K9H238 Regulatory protein alcR OS=Penicillium digitatum (strain Pd1 / CECT 20795) GN=PDIP_11210 PE=4 SV=1
10 : I8TPH2_ASPO3 0.41 0.70 7 65 4 69 66 3 7 826 I8TPH2 Uncharacterized protein OS=Aspergillus oryzae (strain 3.042) GN=Ao3042_07751 PE=4 SV=1
11 : Q2UDT6_ASPOR 0.41 0.70 7 65 4 69 66 3 7 826 Q2UDT6 Predicted protein OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=AO090012000035 PE=4 SV=1
12 : V5HYH2_BYSSN 0.41 0.54 3 56 1 74 74 3 20 966 V5HYH2 C6 zinc finger domain protein OS=Byssochlamys spectabilis (strain No. 5 / NBRC 109023) GN=PVAR5_3660 PE=4 SV=1
13 : W6PYQ3_PENRO 0.41 0.55 4 59 13 74 64 3 10 849 W6PYQ3 Zn(2)-C6 fungal-type DNA-binding domain OS=Penicillium roqueforti GN=PROQFM164_S01g003210 PE=4 SV=1
14 : M2SSG1_COCSN 0.39 0.52 6 65 33 109 77 2 17 931 M2SSG1 Uncharacterized protein OS=Cochliobolus sativus (strain ND90Pr / ATCC 201652) GN=COCSADRAFT_181150 PE=4 SV=1
15 : N4WRU9_COCH4 0.39 0.52 6 65 33 109 77 2 17 913 N4WRU9 Uncharacterized protein OS=Cochliobolus heterostrophus (strain C4 / ATCC 48331 / race T) GN=COCC4DRAFT_51704 PE=4 SV=1
16 : W6XXZ5_COCCA 0.39 0.52 6 65 33 109 77 2 17 913 W6XXZ5 Uncharacterized protein OS=Bipolaris zeicola 26-R-13 GN=COCCADRAFT_7380 PE=4 SV=1
17 : W6Z947_COCMI 0.39 0.52 6 65 33 109 77 2 17 937 W6Z947 Uncharacterized protein OS=Bipolaris oryzae ATCC 44560 GN=COCMIDRAFT_4668 PE=4 SV=1
18 : W7ERS0_COCVI 0.39 0.52 6 65 33 109 77 2 17 913 W7ERS0 Uncharacterized protein OS=Bipolaris victoriae FI3 GN=COCVIDRAFT_26958 PE=4 SV=1
19 : W9WC41_9EURO 0.39 0.56 7 65 30 106 77 2 18 850 W9WC41 Uncharacterized protein OS=Cladophialophora yegresii CBS 114405 GN=A1O7_01854 PE=4 SV=1
20 : B6HLT5_PENCW 0.37 0.58 14 65 29 80 52 0 0 154 B6HLT5 Pc21g21320 protein OS=Penicillium chrysogenum (strain ATCC 28089 / DSM 1075 / Wisconsin 54-1255) GN=Pc21g21320 PE=4 SV=1
21 : Q3HUR7_PENCH 0.37 0.58 14 65 29 80 52 0 0 154 Q3HUR7 Putative uncharacterized protein OS=Penicillium chrysogenum PE=2 SV=1
22 : R1ECJ7_BOTPV 0.37 0.50 9 63 10 68 68 3 22 942 R1ECJ7 Putative c6 transcription factor protein OS=Botryosphaeria parva (strain UCR-NP2) GN=UCRNP2_7812 PE=4 SV=1
23 : W9K485_FUSOX 0.37 0.54 8 54 7 71 65 3 18 879 W9K485 Uncharacterized protein OS=Fusarium oxysporum Fo47 GN=FOZG_11954 PE=4 SV=1
24 : W9MUV4_FUSOX 0.37 0.54 8 54 7 71 65 3 18 879 W9MUV4 Uncharacterized protein OS=Fusarium oxysporum f. sp. lycopersici MN25 GN=FOWG_03339 PE=4 SV=1
25 : W9XH21_9EURO 0.37 0.57 7 65 8 83 76 2 17 836 W9XH21 Uncharacterized protein OS=Cladophialophora psammophila CBS 110553 GN=A1O5_01148 PE=4 SV=1
26 : X0KH65_FUSOX 0.37 0.52 3 54 1 49 52 1 3 205 X0KH65 Uncharacterized protein OS=Fusarium oxysporum f. sp. vasinfectum 25433 GN=FOTG_18575 PE=4 SV=1
27 : C1GKT2_PARBD 0.36 0.52 4 59 47 96 56 1 6 156 C1GKT2 Uncharacterized protein OS=Paracoccidioides brasiliensis (strain Pb18) GN=PADG_07939 PE=4 SV=1
28 : E4ZYL9_LEPMJ 0.36 0.51 8 64 35 108 74 2 17 909 E4ZYL9 Putative uncharacterized protein OS=Leptosphaeria maculans (strain JN3 / isolate v23.1.3 / race Av1-4-5-6-7-8) GN=LEMA_P108100.1 PE=4 SV=1
29 : S3BYJ6_OPHP1 0.36 0.48 3 59 1 77 77 3 20 1026 S3BYJ6 Uncharacterized protein OS=Ophiostoma piceae (strain UAMH 11346) GN=F503_03618 PE=4 SV=1
30 : W9YZJ3_9EURO 0.36 0.51 7 56 12 81 70 3 20 904 W9YZJ3 Uncharacterized protein OS=Capronia coronata CBS 617.96 GN=A1O1_03118 PE=4 SV=1
31 : S0EJ52_GIBF5 0.35 0.52 8 54 7 71 65 3 18 878 S0EJ52 Related to regulatory protein alcR OS=Gibberella fujikuroi (strain CBS 195.34 / IMI 58289 / NRRL A-6831) GN=FFUJ_09271 PE=4 SV=1
32 : U7PU59_SPOS1 0.35 0.47 8 59 2 79 78 3 26 1052 U7PU59 Uncharacterized protein OS=Sporothrix schenckii (strain ATCC 58251 / de Perez 2211183) GN=HMPREF1624_05078 PE=4 SV=1
33 : W9HY92_FUSOX 0.35 0.52 8 54 7 71 65 3 18 879 W9HY92 Uncharacterized protein OS=Fusarium oxysporum FOSC 3-a GN=FOYG_12872 PE=4 SV=1
34 : W9NLF7_FUSOX 0.35 0.52 8 54 7 71 65 3 18 879 W9NLF7 Uncharacterized protein OS=Fusarium oxysporum f. sp. pisi HDV247 GN=FOVG_15307 PE=4 SV=1
35 : W9ZE30_FUSOX 0.35 0.52 8 54 7 71 65 3 18 879 W9ZE30 Uncharacterized protein OS=Fusarium oxysporum f. sp. melonis 26406 GN=FOMG_14090 PE=4 SV=1
36 : X0CEY7_FUSOX 0.35 0.52 8 54 7 71 65 3 18 879 X0CEY7 Uncharacterized protein OS=Fusarium oxysporum f. sp. raphani 54005 GN=FOQG_07504 PE=4 SV=1
37 : X0FWT3_FUSOX 0.35 0.52 8 54 7 71 65 3 18 879 X0FWT3 Uncharacterized protein OS=Fusarium oxysporum f. sp. radicis-lycopersici 26381 GN=FOCG_06277 PE=4 SV=1
38 : X0HV93_FUSOX 0.35 0.52 8 54 7 71 65 3 18 879 X0HV93 Uncharacterized protein OS=Fusarium oxysporum f. sp. conglutinans race 2 54008 GN=FOPG_06044 PE=4 SV=1
39 : X0ISM4_FUSOX 0.35 0.52 8 54 7 71 65 3 18 879 X0ISM4 Uncharacterized protein OS=Fusarium oxysporum f. sp. cubense tropical race 4 54006 GN=FOIG_14951 PE=4 SV=1
40 : X0LL68_FUSOX 0.35 0.52 8 54 7 71 65 3 18 879 X0LL68 Uncharacterized protein OS=Fusarium oxysporum f. sp. vasinfectum 25433 GN=FOTG_10197 PE=4 SV=1
41 : C7Z9E1_NECH7 0.34 0.47 8 59 2 78 77 3 25 906 C7Z9E1 Predicted protein OS=Nectria haematococca (strain 77-13-4 / ATCC MYA-4622 / FGSC 9596 / MPVI) GN=NECHADRAFT_100913 PE=4 SV=1
42 : F9G2N7_FUSOF 0.34 0.47 8 59 2 78 77 3 25 921 F9G2N7 Uncharacterized protein OS=Fusarium oxysporum (strain Fo5176) GN=FOXB_12919 PE=4 SV=1
43 : J9N541_FUSO4 0.34 0.47 8 59 2 78 77 3 25 921 J9N541 Uncharacterized protein OS=Fusarium oxysporum f. sp. lycopersici (strain 4287 / CBS 123668 / FGSC 9935 / NRRL 34936) GN=FOXG_10302 PE=4 SV=1
44 : K3VI46_FUSPC 0.34 0.50 8 59 2 69 68 3 16 824 K3VI46 ZBC1 OS=Fusarium pseudograminearum (strain CS3096) GN=ZBC1 PE=4 SV=1
45 : K3VXN7_FUSPC 0.34 0.47 8 59 2 78 77 3 25 920 K3VXN7 Uncharacterized protein OS=Fusarium pseudograminearum (strain CS3096) GN=FPSE_10603 PE=4 SV=1
46 : N1RZV4_FUSC4 0.34 0.47 8 59 2 78 77 3 25 920 N1RZV4 Regulatory protein alcR OS=Fusarium oxysporum f. sp. cubense (strain race 4) GN=FOC4_g10009416 PE=4 SV=1
47 : N4TTV5_FUSC1 0.34 0.47 8 59 2 78 77 3 25 921 N4TTV5 Regulatory protein alcR OS=Fusarium oxysporum f. sp. cubense (strain race 1) GN=FOC1_g10008310 PE=4 SV=1
48 : S0E228_GIBF5 0.34 0.47 8 59 2 78 77 3 25 930 S0E228 Related to regulatory protein alcR OS=Gibberella fujikuroi (strain CBS 195.34 / IMI 58289 / NRRL A-6831) GN=FFUJ_07661 PE=4 SV=1
49 : W7MP93_GIBM7 0.34 0.47 8 59 2 78 77 3 25 924 W7MP93 Uncharacterized protein OS=Gibberella moniliformis (strain M3125 / FGSC 7600) GN=FVEG_08953 PE=4 SV=1
50 : W9IDF4_FUSOX 0.34 0.47 8 59 2 78 77 3 25 921 W9IDF4 Uncharacterized protein OS=Fusarium oxysporum FOSC 3-a GN=FOYG_08471 PE=4 SV=1
51 : W9K5F0_FUSOX 0.34 0.47 8 59 2 78 77 3 25 920 W9K5F0 Uncharacterized protein OS=Fusarium oxysporum Fo47 GN=FOZG_09759 PE=4 SV=1
52 : W9LLU3_FUSOX 0.34 0.47 8 59 2 78 77 3 25 920 W9LLU3 Uncharacterized protein OS=Fusarium oxysporum f. sp. lycopersici MN25 GN=FOWG_13455 PE=4 SV=1
53 : W9Q0Y1_FUSOX 0.34 0.47 8 59 2 78 77 3 25 921 W9Q0Y1 Uncharacterized protein OS=Fusarium oxysporum f. sp. pisi HDV247 GN=FOVG_06530 PE=4 SV=1
54 : X0AHG1_FUSOX 0.34 0.47 8 59 2 78 77 3 25 921 X0AHG1 Uncharacterized protein OS=Fusarium oxysporum f. sp. melonis 26406 GN=FOMG_07053 PE=4 SV=1
55 : X0BY56_FUSOX 0.34 0.47 8 59 2 78 77 3 25 921 X0BY56 Uncharacterized protein OS=Fusarium oxysporum f. sp. raphani 54005 GN=FOQG_09409 PE=4 SV=1
56 : X0H774_FUSOX 0.34 0.47 8 59 2 78 77 3 25 920 X0H774 Uncharacterized protein OS=Fusarium oxysporum f. sp. radicis-lycopersici 26381 GN=FOCG_04625 PE=4 SV=1
57 : X0HEM7_FUSOX 0.34 0.47 8 59 2 78 77 3 25 921 X0HEM7 Uncharacterized protein OS=Fusarium oxysporum f. sp. conglutinans race 2 54008 GN=FOPG_10339 PE=4 SV=1
58 : X0JKQ7_FUSOX 0.34 0.47 8 59 2 78 77 3 25 920 X0JKQ7 Uncharacterized protein OS=Fusarium oxysporum f. sp. cubense tropical race 4 54006 GN=FOIG_10672 PE=4 SV=1
59 : X0LN83_FUSOX 0.34 0.47 8 59 2 78 77 3 25 921 X0LN83 Uncharacterized protein OS=Fusarium oxysporum f. sp. vasinfectum 25433 GN=FOTG_09845 PE=4 SV=1
60 : G8XZQ9_PICSO 0.33 0.60 1 65 84 146 70 3 12 974 G8XZQ9 Piso0_005711 protein OS=Pichia sorbitophila (strain ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC 10061 / NRRL Y-12695) GN=Piso0_005711 PE=4 SV=1
61 : K3V9R3_FUSPC 0.33 0.44 8 59 2 73 72 3 20 845 K3V9R3 Uncharacterized protein OS=Fusarium pseudograminearum (strain CS3096) GN=FPSE_09446 PE=4 SV=1
62 : B6QQE4_PENMQ 0.32 0.48 7 57 3 73 71 3 20 1571 B6QQE4 Putative uncharacterized protein OS=Penicillium marneffei (strain ATCC 18224 / CBS 334.59 / QM 7333) GN=PMAA_041210 PE=4 SV=1
63 : C7ZD87_NECH7 0.32 0.54 7 59 2 70 69 3 16 797 C7ZD87 Putative uncharacterized protein OS=Nectria haematococca (strain 77-13-4 / ATCC MYA-4622 / FGSC 9596 / MPVI) GN=NECHADRAFT_105211 PE=4 SV=1
64 : W9YGY5_9EURO 0.32 0.53 7 64 12 89 78 3 20 912 W9YGY5 Uncharacterized protein OS=Capronia epimyces CBS 606.96 GN=A1O3_01969 PE=4 SV=1
65 : H6C1L0_EXODN 0.31 0.47 7 58 14 93 80 3 28 925 H6C1L0 Putative uncharacterized protein OS=Exophiala dermatitidis (strain ATCC 34100 / CBS 525.76 / NIH/UT8656) GN=HMPREF1120_06608 PE=4 SV=1
## ALIGNMENTS 1 - 65
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 -1 A G 0 0 95 2 53 A
2 0 A S + 0 0 79 2 73 A
3 1 A M S S+ 0 0 193 13 11 MMMMMM MM M M M V
4 2 A A S S- 0 0 95 15 59 AEEEEE DE EG AA P S
5 3 A D + 0 0 55 15 73 DDDDAA TD AS GA V R
6 4 A T S S+ 0 0 92 20 87 TSSFHH QF PRTTTTT TP G K
7 5 A R - 0 0 138 30 49 RRRRRRRRRRRLSSSSSSR RRR MR R KKRR
8 6 A R + 0 0 195 63 9 RRRRRRRRRRRRRRRRRRR RRRRQRRRRRRRRRRRRRRRRKRRRRRRRRRRRRRRRSKRRRR
9 7 A R + 0 0 89 64 14 RRRRRRRRRRRRRKKKKKR RRRRRRKRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRKRKQRR
10 8 A Q + 0 0 148 64 18 QQQQQQQQQQQQQQQQQQQ QQQQRTQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQVSQTQQ
11 9 A N - 0 0 26 64 54 NLLLHHNHLHHHYNNNNNN HNNNRANNHNNNNNNNNNNNNNQNNNNNNNNNNNNNNNSHFKHH
12 10 A H + 0 0 63 64 58 HHHHHHRHHRRRRRRRRRK RHHKRIRHSHHHHHHHHHHRRRYRRRRRRRRRRRRRRRRYHLSS
13 11 A S S S- 0 0 6 63 22 SSSSSSSS.SSSSCCCCCS SSSSSACSSSSSSSSSSSSSSSASSSSSSSSSSSSSSSAASASS
14 12 A C >> - 0 0 0 65 4 CCCCCCCC.CCCCCCCCCCCCCCCCNCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
15 13 A D H 3> S+ 0 0 11 65 7 DDDDDDDD.DDDDDDDDDDDDDDDDPRDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
16 14 A P H 3> S+ 0 0 22 66 54 PPPPPPQPSQQQQQQQQQQCCQHHQRYQQQHQHHHHHHHHQQQQQQQQQQQQQQQQQQQSQRQQQ
17 15 A C H <4>S+ 0 0 0 66 2 CCCCCCCCCCCCCCCCCCCCCCCCCSCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
18 16 A R H ><5S+ 0 0 87 66 3 RRRRRRRRDRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
19 17 A K H 3<5S+ 0 0 132 65 25 KKK.KKKKPKKKKKKKKKFSSKKKAKRKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRQRKK
20 18 A G T 3<5S- 0 0 37 65 51 GGG.GGGGCGGGSGGGGGGAAGSSGGRGAGSASSSSSSSSAAASAAAAAAAAAAAAAAAKSSSAA
21 19 A K T < 5S+ 0 0 192 65 9 KKK.KKKKRKKKKKKKKKKKKKKKKCKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRKKK
22 20 A R < - 0 0 83 66 28 RRRKRRRRKRRRRRRRRRRRRRKKREIRRRKRKKKKKKKKRRRRRRRRRRRRRRRRRRRVRRRRR
23 21 A R - 0 0 226 66 33 RGGGAAAAGAAAAAAAAARAAAAARTRAAAAAAAAAAAAAAAAGAAAAAAAAAAAAAAARGAAAA
24 22 A C - 0 0 24 66 9 CCCKCCCCKCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
25 23 A D S S+ 0 0 102 63 10 D..RDDDDRDDDDDDDDDDDDDDDDK.DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
26 24 A A - 0 0 1 63 40 A..GAAaAGaagAAAAAAVLLvggVR.aaagaggggggggaaaaaaaaaaaaaaaaaaaAaaaaa
27 25 A P > - 0 0 50 60 81 P..CPPiPCaap.AAAAAQPP.rrER.llmrsrrrrrrrrrqqprqqqqqqqqqqrqqq.pkpqh
28 26 A E T 3 S+ 0 0 98 61 89 E.DDAAVADDDD.IIIIIVLL.NNVH.DRRNVNNNNNNNNNRRENRRRRRRRRRRNRRR.ARDAQ
29 27 A N T 3> + 0 0 92 62 81 N.AARRGRAEEDPLLLLLSSS.EEAM.TEEEREEEEEEEESNNDSNNNNNNNNNNSNNN.SGHEH
30 28 A R H <> S+ 0 0 46 62 87 R.PPRRKRPLLICEEEEEITT.KKVR.NAQKRKKKKKKKKESSMESSSSSSSSSSESSS.SIDHH
31 29 A N H > S+ 0 0 111 63 81 N.KKDDTDKEELLDDDDDQII.PPHCSKATPSPPPPPPPPREEDREEEEEEEEEEREEE.ANSEQ
32 30 A E H > S+ 0 0 86 64 90 E.DERRSRERRADTTTTTHII.AAEDGPRHALAAAAAAAALRRPLRRRRRRRRRRLRRREGPFHR
33 31 A A H X>S+ 0 0 11 65 85 ADRRHHADRSSHPLLLLLLSS.AAIEFCRLVLVVVVVVVVRLLIRLLLLLLLLLLRLLLLNFSFS
34 32 A N H <5S+ 0 0 65 65 86 NAQQAASRQSSKTLLLLLRNN.SSPNEGGDSASSSSSSSSNRRRNRRRRRRRRRRNRRRNNSNHI
35 33 A E H <5S+ 0 0 135 65 77 EPKTDDvhTnnpreeeeegNN.ppaFSHstpdppppppppgnndgnnnnnnnnnngnnnnraghn
36 34 A N H <5S- 0 0 121 65 78 NSSSAAnsSqqndpppppsLLpsssPSPsdssssssssssleepleeeeeeeeeeeeee.spphl
37 35 A G T <5S+ 0 0 62 65 66 GTTAgggDAaaagtttttkPPgggdQQtdpgnggggggggaeepaeeeeeeeeeeeeee.pgata
38 36 A W < + 0 0 53 65 86 WFFFnnaTFnnprlllllpLLllls.Dlyalyllllllllaaasaaaaaaaaaaaaaaavhtlvv
39 37 A V + 0 0 70 65 94 VSSSIIGISHHDSAAAAASSSGKKS.GTRAKRKKKKKKKKLLLSLLLLLLLLLLLLLLLKKLQAT
40 38 A S S S- 0 0 23 65 69 STTSPPPTSSSPPSSSSSPPPPSSP.RASKSPSSSSSSSSRRRPRRRRRRRRRRRRRRRISAPRR
41 39 A C > - 0 0 0 66 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
42 40 A S H > S+ 0 0 62 66 18 SSSSSSSSSSSSSGGGGGSSSSSSTHSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSST
43 41 A N H >>S+ 0 0 17 66 65 NNNNNNNNNNNNYNNNNNNMMNYYNNNNYNYYYYYYYYYYYYYYYYYYYYYYYYYYYYYNYLYNN
44 42 A C I >>S+ 0 0 7 66 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
45 43 A K I <5S+ 0 0 133 66 88 KTTTRRRRTRRKVEEEEEKKKKIIKTVDLQILIIIIIIIILLLILLLLLLLLLLLLLLLVAETQQ
46 44 A R I <5S+ 0 0 183 66 31 RRRRKKRKRKKKRKKKKKKLLRRRKRRKKKRKRRRRRRRRRRRKRRRRRRRRRRRRRRRKKKKKR
47 45 A W I <5S- 0 0 145 66 71 WWWWYYWYWYYWTTTTTTWRRTTTWHFTTATTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTATAA
48 46 A N I << + 0 0 135 66 62 NKKKNNRNKKKENKKKKKKGGGKKKENKNGKSKKKKKKKKRRRHRRRRRRRRRRRRRRRNNNKGG
49 47 A K < - 0 0 74 66 27 KKKKRRKRKRRKKKKKKKKTTKKKKFQKKRKKKKKKKKKKKKKKKKKKKKKKKKKKKKKSKKKRR
50 48 A D - 0 0 99 66 82 DEEDEEEEDKKTRTTTTTTDDNVVARESHNVHVVVVVVVVQQQSQQQQQQQQQQQQQQQDTKQNT
51 49 A C + 0 0 16 66 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
52 50 A T - 0 0 19 66 37 TTTTTTTTTTTTTTTTTTTIITSSTSMTTTSTSSSSSSSSTTTTTTTTTTTTTTTTTTTTTSTTT
53 51 A F > + 0 0 11 66 22 FFFFFFFFFFFFMFFFFFIVVLLLIYFFLFLLLLLLLLLLFFFMFFFFFFFFFFFFFFFFMFMFF
54 52 A N H > + 0 0 70 66 61 NNNNNNKNNDDNEEEEEEDIIQNNENTDDDNNNNNNNNNNHHHNHHHHHHHHHHHHHHHSNTNED
55 53 A W H 4 S+ 0 0 17 54 7 WWWWWWWWWWWWWWWWWWWWWW W PWWW W WWWWWWWWWWWWWWWWWWWRWWWWW
56 54 A L H >4 S+ 0 0 5 54 53 LIIILLLLILLLALLLLLIRRL I VLAL A VVVAVVVVVVVVVVVVVVVVALALL
57 55 A S H 3< S+ 0 0 1 52 68 SSSSVVHSSLL RRRRRRRAAR K SRR R RRRWRRRRRRRRRRRRRRRPWTWEE
58 56 A S T 3< S+ 0 0 85 51 54 SSSSEESESSS SSSSSSSSSA S SSS S AAATAAAAAAAAAAAAAAALS SCS
59 57 A Q S < S- 0 0 61 50 48 QKKKNNVNKHH QQQQQQRKKV H QQQ Q QQQQQQQQQQQQQQQQQQQKQ QA
60 58 A R S S- 0 0 185 25 44 RRRRRRQRRKK RRRRRKQQH Q R R L
61 59 A S S S- 0 0 86 25 81 SVVAAATAAEE VVVVVSSSR R V G P
62 60 A K + 0 0 135 25 70 KDDDAAKADSS SSSSSKSSA R S P K
63 61 A N S S+ 0 0 91 25 77 ASSSAASASRR QQQQQGRRS Q Q S A
64 62 A S 0 0 69 24 82 KNRRRRARRHH AAAAATNN S A K N
65 63 A S 0 0 155 21 50 GGGSAA GSAA SSSSSSAA T G
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
165535 A 0 0 0 0 0 0 0 50 50 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0.693 23 0.47
2 0 A 0 0 0 0 0 0 0 0 50 0 50 0 0 0 0 0 0 0 0 0 2 0 0 0.693 23 0.27
3 1 A 8 0 0 92 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 13 0 0 0.271 9 0.88
4 2 A 0 0 0 0 0 0 0 7 27 7 7 0 0 0 0 0 0 47 0 7 15 0 0 1.430 47 0.41
5 3 A 7 0 0 0 0 0 0 7 27 0 7 7 0 0 7 0 0 0 0 40 15 0 0 1.622 54 0.27
6 4 A 0 0 0 0 10 0 0 5 0 10 10 40 0 10 5 5 5 0 0 0 20 0 0 1.887 62 0.12
7 5 A 0 3 0 3 0 0 0 0 0 0 20 0 0 0 67 7 0 0 0 0 30 0 0 0.999 33 0.51
8 6 A 0 0 0 0 0 0 0 0 0 0 2 0 0 0 94 3 2 0 0 0 63 0 0 0.302 10 0.90
9 7 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 86 13 2 0 0 0 64 0 0 0.455 15 0.86
10 8 A 2 0 0 0 0 0 0 0 0 0 2 3 0 0 2 0 92 0 0 0 64 0 0 0.378 12 0.81
11 9 A 0 6 0 0 2 0 2 0 2 0 2 0 0 17 2 2 2 0 66 0 64 0 0 1.207 40 0.46
12 10 A 0 2 2 0 0 0 3 0 0 0 5 0 0 36 50 3 0 0 0 0 64 1 0 1.204 40 0.42
13 11 A 0 0 0 0 0 0 0 0 8 0 83 0 10 0 0 0 0 0 0 0 63 0 0 0.583 19 0.77
14 12 A 0 0 0 0 0 0 0 0 0 0 0 0 98 0 0 0 0 0 2 0 65 0 0 0.079 2 0.95
15 13 A 0 0 0 0 0 0 0 0 0 2 0 0 0 0 2 0 0 0 0 97 65 0 0 0.159 5 0.92
16 14 A 0 0 0 0 0 0 2 0 0 12 3 0 3 17 3 0 61 0 0 0 66 0 0 1.239 41 0.46
17 15 A 0 0 0 0 0 0 0 0 0 0 2 0 98 0 0 0 0 0 0 0 66 0 0 0.079 2 0.98
18 16 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 98 0 0 0 0 2 66 1 0 0.079 2 0.96
19 17 A 0 0 0 0 2 0 0 0 2 2 3 0 0 0 5 86 2 0 0 0 65 0 0 0.634 21 0.75
20 18 A 0 0 0 0 0 0 0 34 37 0 25 0 2 0 2 2 0 0 0 0 65 0 0 1.272 42 0.48
21 19 A 0 0 0 0 0 0 0 0 0 0 0 0 2 0 3 95 0 0 0 0 65 0 0 0.216 7 0.91
22 20 A 2 0 2 0 0 0 0 0 0 0 0 0 0 0 76 20 0 2 0 0 66 0 0 0.721 24 0.72
23 21 A 0 0 0 0 0 0 0 9 80 0 0 2 0 0 9 0 0 0 0 0 66 0 0 0.676 22 0.66
24 22 A 0 0 0 0 0 0 0 0 0 0 0 0 97 0 0 3 0 0 0 0 66 3 0 0.136 4 0.90
25 23 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 2 0 0 0 95 63 0 0 0.222 7 0.89
26 24 A 5 3 0 0 0 0 0 22 68 0 0 0 0 0 2 0 0 0 0 0 63 3 43 0.915 30 0.59
27 25 A 0 3 2 2 0 0 0 0 12 18 2 0 3 2 25 2 28 2 0 0 60 0 0 1.902 63 0.19
28 26 A 7 3 8 0 0 0 0 0 8 0 0 0 0 2 30 0 2 5 23 13 61 0 0 1.948 65 0.11
29 27 A 0 8 0 2 0 0 0 3 6 2 11 2 0 3 6 0 0 26 27 3 62 0 0 2.039 68 0.19
30 28 A 2 3 5 2 0 0 0 0 2 5 26 3 2 3 11 19 2 13 2 2 62 0 0 2.269 75 0.12
31 29 A 0 3 3 0 0 0 0 0 3 17 5 3 2 2 5 6 3 29 5 14 63 0 0 2.230 74 0.19
32 30 A 0 6 3 0 2 0 0 3 19 5 2 8 0 5 34 0 0 9 0 5 64 0 0 2.052 68 0.09
33 31 A 14 37 3 0 5 0 0 0 8 2 9 0 2 5 11 0 0 2 2 3 65 0 0 2.054 68 0.14
34 32 A 0 8 2 0 0 0 0 3 6 2 23 2 0 2 28 2 5 2 17 2 65 0 0 2.062 68 0.14
35 33 A 2 0 0 0 2 0 0 8 3 20 3 5 0 5 3 2 0 11 32 6 65 1 51 2.094 69 0.22
36 34 A 0 8 0 0 0 0 0 0 3 17 34 0 0 2 0 0 3 25 6 3 65 0 0 1.767 58 0.22
37 35 A 0 0 0 0 0 0 0 29 14 8 0 14 0 0 0 2 3 25 2 5 65 1 54 1.827 60 0.34
38 36 A 5 32 0 0 6 3 3 0 31 3 3 3 0 2 2 0 0 0 6 2 65 0 0 1.941 64 0.13
39 37 A 3 29 5 0 0 0 0 5 11 0 15 3 0 3 3 20 2 0 0 2 65 0 0 2.050 68 0.06
40 38 A 0 0 2 0 0 0 0 0 3 20 37 5 0 0 32 2 0 0 0 0 65 0 0 1.432 47 0.31
41 39 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 66 0 0 0.000 0 1.00
42 40 A 0 0 0 0 0 0 0 8 0 0 88 3 0 2 0 0 0 0 0 0 66 0 0 0.478 15 0.81
43 41 A 0 2 0 3 0 0 53 0 0 0 0 0 0 0 0 0 0 0 42 0 66 0 0 0.870 29 0.34
44 42 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 66 0 0 0.000 0 1.00
45 43 A 5 30 18 0 0 0 0 0 2 0 0 9 0 0 9 12 5 9 0 2 66 0 0 1.989 66 0.12
46 44 A 0 3 0 0 0 0 0 0 0 0 0 0 0 0 62 35 0 0 0 0 66 0 0 0.769 25 0.68
47 45 A 0 0 0 0 2 15 8 0 6 0 0 64 0 2 3 0 0 0 2 0 66 0 0 1.235 41 0.29
48 46 A 0 0 0 0 0 0 0 9 0 0 2 0 0 2 29 39 0 3 17 0 66 0 0 1.475 49 0.38
49 47 A 0 0 0 0 2 0 0 0 0 0 2 3 0 0 12 80 2 0 0 0 66 0 0 0.728 24 0.72
50 48 A 17 0 0 0 0 0 0 0 2 0 3 14 0 3 3 5 29 11 5 11 66 0 0 2.067 69 0.17
51 49 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 66 0 0 0.000 0 1.00
52 50 A 0 0 3 2 0 0 0 0 0 0 20 76 0 0 0 0 0 0 0 0 66 0 0 0.700 23 0.62
53 51 A 3 21 3 6 65 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 66 0 0 1.053 35 0.77
54 52 A 0 0 3 0 0 0 0 0 0 0 2 3 0 27 0 2 2 12 39 11 66 0 0 1.617 53 0.39
55 53 A 0 0 0 0 0 96 0 0 0 2 0 0 0 0 2 0 0 0 0 0 54 0 0 0.184 6 0.92
56 54 A 37 37 11 0 0 0 0 0 11 0 0 0 0 0 4 0 0 0 0 0 54 0 0 1.346 44 0.46
57 55 A 4 4 0 0 0 6 0 0 4 2 15 2 0 2 56 2 0 4 0 0 52 0 0 1.583 52 0.31
58 56 A 0 2 0 0 0 0 0 0 37 0 51 2 2 0 0 0 0 6 0 0 51 0 0 1.109 37 0.46
59 57 A 4 0 0 0 0 0 0 0 2 0 0 0 0 6 2 14 66 0 6 0 50 0 0 1.172 39 0.51
60 58 A 0 4 0 0 0 0 0 0 0 0 0 0 0 4 64 12 16 0 0 0 25 0 0 1.091 36 0.55
61 59 A 32 0 0 0 0 0 0 4 20 4 20 4 0 0 8 0 0 8 0 0 25 0 0 1.799 60 0.18
62 60 A 0 0 0 0 0 0 0 0 16 4 40 0 0 0 4 20 0 0 0 16 25 0 0 1.532 51 0.29
63 61 A 0 0 0 0 0 0 0 4 20 0 28 0 0 0 16 0 28 0 4 0 25 0 0 1.585 52 0.22
64 62 A 0 0 0 0 0 0 0 0 29 0 8 4 0 8 25 8 0 0 17 0 24 0 0 1.758 58 0.17
65 63 A 0 0 0 0 0 0 0 24 29 0 43 5 0 0 0 0 0 0 0 0 21 0 0 1.208 40 0.49
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
5 36 36 10 gSRRVLAESNLn
6 36 36 10 gSRRVLAESNLn
7 21 57 1 aAi
7 30 67 5 vSSSTSn
7 32 74 8 gSPETAIPAa
8 34 34 11 hTGEVSSSNQASs
10 21 24 2 aLLa
10 30 35 4 nTAARq
10 32 41 1 aYn
11 21 24 2 aLLa
11 30 35 4 nTAARq
11 32 41 1 aYn
12 25 25 2 gKAp
12 34 36 8 pSSQPPPGLn
12 36 46 10 aAGHGELDIRAp
13 31 43 7 rRTVVNNFd
13 33 52 1 gQr
14 31 63 7 eTSRTGASp
14 33 72 10 tVFHYSDVYGPl
15 31 63 7 eTSRTGASp
15 33 72 10 tVFHYSDVYGPl
16 31 63 7 eTSRTGASp
16 33 72 10 tVFHYSDVYGPl
17 31 63 7 eTSRTGASp
17 33 72 10 tVFHYSDVYGPl
18 31 63 7 eTSRTGASp
18 33 72 10 tVFHYSDVYGPl
19 30 59 8 gIDPLAFGQs
19 32 69 10 kDTETSTVFSHp
22 19 28 5 vAFAKEp
22 21 35 8 gANSDDIAAl
23 20 26 2 gYIr
23 29 37 6 pDATSPAs
23 31 45 10 gSQIYGGSEPGl
24 20 26 2 gYIr
24 29 37 6 pDATSPAs
24 31 45 10 gSQIYGGSEPGl
25 30 37 8 aVSGLGFPSs
25 32 47 9 dEHEATNIFAs
28 20 54 7 aAILEDTLl
28 31 72 10 tVFHYSDVFGPl
29 25 25 2 aPSl
29 34 36 8 sVPESWRSDs
29 36 46 10 dNNDFLPGEFEy
30 21 32 2 aAFm
30 30 43 10 tVAGTASIARYd
30 32 55 8 pHQGKPRAPa
31 20 26 2 gYIr
31 29 37 6 pDATSPTs
31 31 45 10 gSQIYGGSEPGl
32 20 21 5 aPSLREs
32 29 35 11 dSDKSDALLGRDs
32 31 48 10 nANGASIPETEy
33 20 26 2 gYIr
33 29 37 6 pDATSPNs
33 31 45 10 gSQIYGGSEPGl
34 20 26 2 gYIr
34 29 37 6 pDATSPTs
34 31 45 10 gSQIYGGSEPGl
35 20 26 2 gYIr
35 29 37 6 pDATSPAs
35 31 45 10 gSQIYGGSEPGl
36 20 26 2 gYIr
36 29 37 6 pDATSPTs
36 31 45 10 gSQIYGGSEPGl
37 20 26 2 gYIr
37 29 37 6 pDATSPTs
37 31 45 10 gSQIYGGSEPGl
38 20 26 2 gYIr
38 29 37 6 pDAMSPTs
38 31 45 10 gSQIYGGSEPGl
39 20 26 2 gYIr
39 29 37 6 pDATSPAs
39 31 45 10 gSQIYGGSEPGl
40 20 26 2 gYIr
40 29 37 6 pDATSPAs
40 31 45 10 gSQIYGGSEPGl
41 20 21 7 aPSLWDIQr
41 29 37 8 gADGANGVSl
41 31 47 10 aEEHLDEIDSRa
42 20 21 6 aPSLWDIq
42 29 36 11 nGGDSTSGTSLAe
42 31 49 8 eHLDEIDSRa
43 20 21 6 aPSLWDIq
43 29 36 11 nGGDSTSGTSLAe
43 31 49 8 eHIDEIDSRa
44 20 21 5 aPPLEYp
44 29 35 8 dGKLIVGEKp
44 31 45 3 pLVQs
45 20 21 7 aPSLWDIQr
45 29 37 8 gSESASGASl
45 31 47 10 aEEHFDEIDSRa
46 20 21 6 aPSLWDIq
46 29 36 11 nSGDSTSGTSLAe
46 31 49 8 eHLDEIDSRa
47 20 21 6 aPSLWDIq
47 29 36 11 nGGDSTSGTSLAe
47 31 49 8 eHLDEIDSRa
48 20 21 6 aPSLWDIq
48 29 36 11 nGGDSTSGTSLAe
48 31 49 8 eHLDEIDSRa
49 20 21 6 aPSLWDIq
49 29 36 11 nGGDSTSGTSLAe
49 31 49 8 eHLDEIDSRa
50 20 21 6 aPSLWDIq
50 29 36 11 nGGDSTSGTSLAe
50 31 49 8 eHIDEIDSRa
51 20 21 6 aPSLWDIq
51 29 36 11 nGGDSTSGTSLAe
51 31 49 8 eHIDEIDSRa
52 20 21 6 aPSLWDIq
52 29 36 11 nGGDSTSGTSLAe
52 31 49 8 eHIDEIDSRa
53 20 21 6 aPSLWDIq
53 29 36 11 nGGDSTSGTSLAe
53 31 49 8 eHLDEIDSRa
54 20 21 6 aPSLWDIq
54 29 36 11 nGGDSTSGTSLAe
54 31 49 8 eHIDEIDSRa
55 20 21 6 aPSLWDIq
55 29 36 11 nGGDSTSGTSLAe
55 31 49 8 eHLDEIDSRa
56 20 21 7 aPSLWDIQr
56 29 37 10 gGDSTSGTSLAe
56 31 49 8 eHIDEIDSRa
57 20 21 6 aPSLWDIq
57 29 36 11 nGGDSTSGTSLAe
57 31 49 8 eHLDEIDSRa
58 20 21 6 aPSLWDIq
58 29 36 11 nSGDSTSGTSLAe
58 31 49 8 eHLDEIDSRa
59 20 21 6 aPSLWDIq
59 29 36 11 nGGDSTSGTSLAe
59 31 49 8 eHLDEIDSRa
60 31 114 5 nTLQTVv
61 20 21 5 aPPLLGp
61 29 35 8 rVNGDDSVSs
61 31 45 7 pASASESDh
62 21 23 1 aAk
62 30 33 10 aTNQQLPVLLPp
62 32 45 9 gGSSGEALPPt
63 21 22 5 aPPLELp
63 30 36 7 gTIVLGKRp
63 32 45 4 aSETEl
64 21 32 2 aAFq
64 30 43 9 hASPVASTPRh
64 32 54 9 tAHQGKPQIPv
65 21 34 7 aAFKSEHQh
65 30 50 11 nHSNGKTTPQRRl
65 32 63 10 aHHHQDKLQVPv
//