Complet list of 1vd4 hssp file
Complete list of 1vd4.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1VD4
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-11
HEADER TRANSCRIPTION 18-MAR-04 1VD4
COMPND MOL_ID: 1; MOLECULE: TRANSCRIPTION INITIATION FACTOR IIE, ALPHA SUBUNI
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: HOMO SAPIENS; ORGANISM_COMMON: HUMAN;
AUTHOR M.OKUDA,A.TANAKA,Y.ARAI,M.SATOH,H.OKAMURA,A.NAGADOI, F.HANAOKA,Y.OHKUM
DBREF 1VD4 A 113 174 UNP P29083 T2EA_HUMAN 113 174
SEQLENGTH 62
NCHAIN 1 chain(s) in 1VD4 data set
NALIGN 156
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : D2HTQ9_AILME 1.00 1.00 1 62 113 174 62 0 0 438 D2HTQ9 Uncharacterized protein (Fragment) OS=Ailuropoda melanoleuca GN=LOC100475747 PE=4 SV=1
2 : E2R3P2_CANFA 1.00 1.00 1 62 113 174 62 0 0 460 E2R3P2 Uncharacterized protein OS=Canis familiaris GN=GTF2E1 PE=4 SV=2
3 : F6WDT3_HORSE 1.00 1.00 1 62 113 174 62 0 0 436 F6WDT3 Uncharacterized protein OS=Equus caballus GN=GTF2E1 PE=4 SV=1
4 : F7GNV1_CALJA 1.00 1.00 1 62 113 174 62 0 0 439 F7GNV1 General transcription factor IIE subunit 1 OS=Callithrix jacchus GN=GTF2E1 PE=2 SV=1
5 : G1QZ47_NOMLE 1.00 1.00 1 62 113 174 62 0 0 439 G1QZ47 Uncharacterized protein OS=Nomascus leucogenys GN=GTF2E1 PE=4 SV=1
6 : G1SIB1_RABIT 1.00 1.00 1 62 113 174 62 0 0 438 G1SIB1 Uncharacterized protein OS=Oryctolagus cuniculus GN=GTF2E1 PE=4 SV=1
7 : G3S0U4_GORGO 1.00 1.00 1 62 113 174 62 0 0 439 G3S0U4 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101131077 PE=4 SV=1
8 : G5AU87_HETGA 1.00 1.00 1 62 113 174 62 0 0 440 G5AU87 General transcription factor IIE subunit 1 OS=Heterocephalus glaber GN=GW7_00974 PE=4 SV=1
9 : G9K3K9_MUSPF 1.00 1.00 1 62 113 174 62 0 0 441 G9K3K9 Proteinral transcription factor IIE, polypeptide 1, alpha 56kDa (Fragment) OS=Mustela putorius furo PE=2 SV=1
10 : H0XJE8_OTOGA 1.00 1.00 1 62 113 174 62 0 0 439 H0XJE8 Uncharacterized protein OS=Otolemur garnettii GN=GTF2E1 PE=4 SV=1
11 : H2QN66_PANTR 1.00 1.00 1 62 113 174 62 0 0 439 H2QN66 General transcription factor IIE, polypeptide 1, alpha 56kDa OS=Pan troglodytes GN=GTF2E1 PE=2 SV=1
12 : I3LMA9_PIG 1.00 1.00 1 62 113 174 62 0 0 438 I3LMA9 Uncharacterized protein OS=Sus scrofa GN=LOC100624061 PE=4 SV=1
13 : L9L2F8_TUPCH 1.00 1.00 1 62 113 174 62 0 0 438 L9L2F8 General transcription factor IIE subunit 1 OS=Tupaia chinensis GN=TREES_T100020387 PE=4 SV=1
14 : M3WU46_FELCA 1.00 1.00 1 62 113 174 62 0 0 438 M3WU46 Uncharacterized protein OS=Felis catus GN=GTF2E1 PE=4 SV=1
15 : M3YA33_MUSPF 1.00 1.00 1 62 113 174 62 0 0 439 M3YA33 Uncharacterized protein OS=Mustela putorius furo GN=GTF2E1 PE=4 SV=1
16 : Q53F88_HUMAN 1.00 1.00 1 62 113 174 62 0 0 357 Q53F88 General transcription factor IIE, polypeptide 1 (Alpha subunit, 56kD) variant (Fragment) OS=Homo sapiens PE=2 SV=1
17 : S9XYS8_9CETA 1.00 1.00 1 62 113 174 62 0 0 438 S9XYS8 General transcription factor IIE subunit 1 isoform 1 OS=Camelus ferus GN=CB1_001373028 PE=4 SV=1
18 : T2EA_HUMAN 1VD4 1.00 1.00 1 62 113 174 62 0 0 439 P29083 General transcription factor IIE subunit 1 OS=Homo sapiens GN=GTF2E1 PE=1 SV=2
19 : T2EA_PONAB 1.00 1.00 1 62 113 174 62 0 0 439 Q5R8H5 General transcription factor IIE subunit 1 OS=Pongo abelii GN=GTF2E1 PE=2 SV=1
20 : U6CSZ9_NEOVI 1.00 1.00 1 62 113 174 62 0 0 438 U6CSZ9 General transcription factor IIE subunit 1 OS=Neovison vison GN=T2EA PE=2 SV=1
21 : F1MGT5_BOVIN 0.98 1.00 1 62 113 174 62 0 0 439 F1MGT5 General transcription factor IIE subunit 1 OS=Bos taurus GN=GTF2E1 PE=4 SV=2
22 : F7GV34_MACMU 0.98 0.98 1 62 113 174 62 0 0 439 F7GV34 General transcription factor IIE subunit 1 OS=Macaca mulatta GN=GTF2E1 PE=2 SV=1
23 : G7MKF6_MACMU 0.98 0.98 1 62 113 174 62 0 0 439 G7MKF6 Putative uncharacterized protein OS=Macaca mulatta GN=EGK_11338 PE=4 SV=1
24 : G7NXQ1_MACFA 0.98 0.98 1 62 113 174 62 0 0 439 G7NXQ1 Putative uncharacterized protein OS=Macaca fascicularis GN=EGM_10388 PE=4 SV=1
25 : H0VF39_CAVPO 0.98 0.98 1 62 113 174 62 0 0 431 H0VF39 Uncharacterized protein OS=Cavia porcellus GN=GTF2E1 PE=4 SV=1
26 : L8HL99_9CETA 0.98 1.00 1 62 113 174 62 0 0 445 L8HL99 General transcription factor IIE subunit 1 OS=Bos mutus GN=M91_19056 PE=4 SV=1
27 : T2EA_BOVIN 0.98 1.00 1 62 113 174 62 0 0 438 A6QLI8 General transcription factor IIE subunit 1 OS=Bos taurus GN=GTF2E1 PE=2 SV=1
28 : W5QFC9_SHEEP 0.98 1.00 1 62 113 174 62 0 0 438 W5QFC9 Uncharacterized protein OS=Ovis aries GN=GTF2E1 PE=4 SV=1
29 : G3HAI6_CRIGR 0.97 0.98 1 62 113 174 62 0 0 438 G3HAI6 General transcription factor IIE subunit 1 OS=Cricetulus griseus GN=I79_007440 PE=4 SV=1
30 : G3T4L1_LOXAF 0.97 0.98 1 62 113 174 62 0 0 438 G3T4L1 Uncharacterized protein OS=Loxodonta africana GN=GTF2E1 PE=4 SV=1
31 : K9J6F6_DESRO 0.97 0.98 1 62 113 174 62 0 0 434 K9J6F6 Putative transcription initiation factor iie alpha subunit (Fragment) OS=Desmodus rotundus PE=2 SV=1
32 : L5L477_PTEAL 0.97 1.00 1 62 113 174 62 0 0 438 L5L477 General transcription factor IIE subunit 1 OS=Pteropus alecto GN=PAL_GLEAN10006529 PE=4 SV=1
33 : Q8BV40_MOUSE 0.97 0.98 1 62 94 155 62 0 0 421 Q8BV40 Putative uncharacterized protein OS=Mus musculus GN=Gtf2e1 PE=2 SV=1
34 : T2EA_MOUSE 0.97 0.98 1 62 113 174 62 0 0 440 Q9D0D5 General transcription factor IIE subunit 1 OS=Mus musculus GN=Gtf2e1 PE=2 SV=1
35 : F7CJV3_ORNAN 0.95 0.97 1 62 113 175 63 1 1 298 F7CJV3 Uncharacterized protein OS=Ornithorhynchus anatinus GN=GTF2E1 PE=4 SV=2
36 : G1PC90_MYOLU 0.94 0.97 1 62 113 174 62 0 0 438 G1PC90 Uncharacterized protein OS=Myotis lucifugus GN=GTF2E1 PE=4 SV=1
37 : H0ZT21_TAEGU 0.94 0.97 1 62 113 174 62 0 0 442 H0ZT21 Uncharacterized protein OS=Taeniopygia guttata GN=GTF2E1 PE=4 SV=1
38 : L5MDW3_MYODS 0.94 0.97 1 62 113 174 62 0 0 438 L5MDW3 General transcription factor IIE subunit 1 OS=Myotis davidii GN=MDA_GLEAN10018592 PE=4 SV=1
39 : Q6DFT2_XENTR 0.94 0.95 1 62 113 174 62 0 0 435 Q6DFT2 General transcription factor IIE, polypeptide 1, alpha 56kDa OS=Xenopus tropicalis GN=gtf2e1 PE=2 SV=1
40 : Q7ZTQ1_XENLA 0.94 0.95 1 62 113 174 62 0 0 263 Q7ZTQ1 Tfiiealpha protein OS=Xenopus laevis GN=tfiiealpha PE=2 SV=1
41 : Q91859_XENLA 0.94 0.95 1 62 113 174 62 0 0 433 Q91859 Transcription factor IIE OS=Xenopus laevis GN=gtf2e1 PE=4 SV=1
42 : S7N861_MYOBR 0.94 0.97 1 62 113 174 62 0 0 438 S7N861 General transcription factor IIE subunit 1 OS=Myotis brandtii GN=D623_10031800 PE=4 SV=1
43 : U3IFR3_ANAPL 0.94 0.97 1 62 113 174 62 0 0 437 U3IFR3 Uncharacterized protein OS=Anas platyrhynchos GN=GTF2E1 PE=4 SV=1
44 : U3JRU8_FICAL 0.94 0.97 1 62 113 174 62 0 0 442 U3JRU8 Uncharacterized protein OS=Ficedula albicollis GN=GTF2E1 PE=4 SV=1
45 : W5MGT1_LEPOC 0.94 0.97 1 62 115 176 62 0 0 442 W5MGT1 Uncharacterized protein (Fragment) OS=Lepisosteus oculatus PE=4 SV=1
46 : C0H900_SALSA 0.92 0.97 1 62 113 174 62 0 0 454 C0H900 General transcription factor IIE subunit 1 OS=Salmo salar GN=T2EA PE=2 SV=1
47 : G3V992_RAT 0.92 0.98 1 62 113 174 62 0 0 438 G3V992 General transcription factor II E, polypeptide 1 (Alpha subunit) OS=Rattus norvegicus GN=Gtf2e1 PE=4 SV=1
48 : K7G0N8_PELSI 0.92 0.95 1 62 113 174 62 0 0 439 K7G0N8 Uncharacterized protein OS=Pelodiscus sinensis GN=GTF2E1 PE=4 SV=1
49 : Q28GU2_XENTR 0.92 0.94 1 62 113 174 62 0 0 325 Q28GU2 General transcription factor IIE, polypeptide 1, alpha 56kDa (Fragment) OS=Xenopus tropicalis GN=gtf2e1 PE=2 SV=1
50 : Q4FZQ9_RAT 0.92 0.98 3 62 1 60 60 0 0 324 Q4FZQ9 Gtf2e1 protein (Fragment) OS=Rattus norvegicus GN=Gtf2e1 PE=2 SV=1
51 : V9KTS0_CALMI 0.92 0.97 1 62 113 174 62 0 0 429 V9KTS0 General transcription factor IIE subunit 1 OS=Callorhynchus milii PE=2 SV=1
52 : F6PGB5_MONDO 0.90 0.94 1 62 122 183 62 0 0 446 F6PGB5 Uncharacterized protein OS=Monodelphis domestica GN=GTF2E1 PE=4 SV=2
53 : Q5SQE1_DANRE 0.90 0.98 1 62 113 174 62 0 0 447 Q5SQE1 Novel protein similar to vertebrate general transcription factor IIE polypeptide 1 alpha 56kDa (GTF2E1) OS=Danio rerio GN=gtf2e1 PE=4 SV=1
54 : V8P5M0_OPHHA 0.90 0.97 1 62 113 174 62 0 0 442 V8P5M0 General transcription factor IIE subunit 1 (Fragment) OS=Ophiophagus hannah GN=GTF2E1 PE=4 SV=1
55 : W5L681_ASTMX 0.90 0.97 1 62 113 174 62 0 0 442 W5L681 Uncharacterized protein OS=Astyanax mexicanus PE=4 SV=1
56 : F1NTL4_CHICK 0.89 0.95 1 62 113 174 62 0 0 441 F1NTL4 Uncharacterized protein OS=Gallus gallus GN=GTF2E1 PE=4 SV=2
57 : G1NN89_MELGA 0.89 0.95 1 62 113 174 62 0 0 441 G1NN89 Uncharacterized protein OS=Meleagris gallopavo GN=GTF2E1 PE=4 SV=2
58 : G3NWX0_GASAC 0.89 0.93 1 55 113 167 55 0 0 444 G3NWX0 Uncharacterized protein OS=Gasterosteus aculeatus PE=4 SV=1
59 : H2T3F2_TAKRU 0.88 0.91 1 58 90 147 58 0 0 414 H2T3F2 Uncharacterized protein (Fragment) OS=Takifugu rubripes GN=LOC101079645 PE=4 SV=1
60 : I3J965_ORENI 0.88 0.91 1 58 113 170 58 0 0 446 I3J965 Uncharacterized protein OS=Oreochromis niloticus GN=LOC100699283 PE=4 SV=1
61 : H9GHD6_ANOCA 0.87 0.92 1 62 38 99 62 0 0 365 H9GHD6 Uncharacterized protein OS=Anolis carolinensis GN=GTF2E1 PE=4 SV=2
62 : H2T3F1_TAKRU 0.85 0.88 1 58 113 172 60 1 2 439 H2T3F1 Uncharacterized protein OS=Takifugu rubripes GN=LOC101079645 PE=4 SV=1
63 : Q4T5L9_TETNG 0.84 0.88 1 58 113 170 58 0 0 439 Q4T5L9 Chromosome 2 SCAF9198, whole genome shotgun sequence. (Fragment) OS=Tetraodon nigroviridis GN=GTF2E1 PE=4 SV=1
64 : H2N1N7_ORYLA 0.83 0.88 1 58 113 170 58 0 0 441 H2N1N7 Uncharacterized protein OS=Oryzias latipes GN=LOC101173377 PE=4 SV=1
65 : M4AAW7_XIPMA 0.83 0.88 1 58 121 178 58 0 0 455 M4AAW7 Uncharacterized protein OS=Xiphophorus maculatus PE=4 SV=1
66 : B9EKG9_MOUSE 0.81 0.92 1 62 113 174 62 0 0 439 B9EKG9 Expressed sequence AU015228 OS=Mus musculus GN=AU015228 PE=2 SV=1
67 : Q3TQQ1_MOUSE 0.81 0.92 1 62 113 174 62 0 0 439 Q3TQQ1 MCG52242 OS=Mus musculus GN=AU015228 PE=2 SV=1
68 : S4RLL6_PETMA 0.77 0.89 1 62 113 174 62 0 0 261 S4RLL6 Uncharacterized protein OS=Petromyzon marinus PE=4 SV=1
69 : S4RLL9_PETMA 0.77 0.89 1 62 113 174 62 0 0 308 S4RLL9 Uncharacterized protein OS=Petromyzon marinus PE=4 SV=1
70 : S4RLM1_PETMA 0.77 0.89 1 62 113 174 62 0 0 306 S4RLM1 Uncharacterized protein OS=Petromyzon marinus PE=4 SV=1
71 : G1NG37_MELGA 0.76 0.89 1 62 113 174 62 0 0 429 G1NG37 Uncharacterized protein OS=Meleagris gallopavo GN=LOC100550389 PE=4 SV=1
72 : H0Z9Q4_TAEGU 0.76 0.89 1 62 113 174 62 0 0 425 H0Z9Q4 Uncharacterized protein OS=Taeniopygia guttata PE=4 SV=1
73 : R0LH27_ANAPL 0.76 0.89 1 62 113 174 62 0 0 421 R0LH27 General transcription factor IIE subunit 1 (Fragment) OS=Anas platyrhynchos GN=Anapl_04571 PE=4 SV=1
74 : E1BY73_CHICK 0.74 0.89 1 62 113 174 62 0 0 426 E1BY73 Uncharacterized protein OS=Gallus gallus GN=LOC428778 PE=4 SV=2
75 : U3K9S1_FICAL 0.74 0.85 1 62 113 174 62 0 0 426 U3K9S1 Uncharacterized protein (Fragment) OS=Ficedula albicollis PE=4 SV=1
76 : K7GHN9_PELSI 0.73 0.87 1 62 113 174 62 0 0 419 K7GHN9 Uncharacterized protein OS=Pelodiscus sinensis PE=4 SV=1
77 : M7BM62_CHEMY 0.73 0.87 1 62 113 174 62 0 0 422 M7BM62 General transcription factor IIE subunit 1 OS=Chelonia mydas GN=UY3_09739 PE=4 SV=1
78 : Q16MT1_AEDAE 0.73 0.85 1 62 113 174 62 0 0 423 Q16MT1 AAEL012202-PA OS=Aedes aegypti GN=AAEL012202 PE=4 SV=1
79 : B0XL91_CULQU 0.71 0.87 1 62 113 174 62 0 0 425 B0XL91 Transcription initiation factor IIE subunit alpha OS=Culex quinquefasciatus GN=CpipJ_CPIJ020226 PE=4 SV=1
80 : B3NGY8_DROER 0.71 0.85 1 62 127 188 62 0 0 429 B3NGY8 GG15520 OS=Drosophila erecta GN=Dere\GG15520 PE=4 SV=1
81 : B4HEW9_DROSE 0.71 0.85 1 62 127 188 62 0 0 429 B4HEW9 GM25289 OS=Drosophila sechellia GN=Dsec\GM25289 PE=4 SV=1
82 : B4PFM4_DROYA 0.71 0.85 1 62 127 188 62 0 0 429 B4PFM4 GE21836 OS=Drosophila yakuba GN=Dyak\GE21836 PE=4 SV=1
83 : B4QQ94_DROSI 0.71 0.85 1 62 127 188 62 0 0 429 B4QQ94 GD14320 OS=Drosophila simulans GN=Dsim\GD14320 PE=4 SV=1
84 : C3Z2E0_BRAFL 0.71 0.89 1 62 113 174 62 0 0 429 C3Z2E0 Putative uncharacterized protein OS=Branchiostoma floridae GN=BRAFLDRAFT_260843 PE=4 SV=1
85 : E9J600_SOLIN 0.71 0.87 1 62 149 210 62 0 0 454 E9J600 Putative uncharacterized protein (Fragment) OS=Solenopsis invicta GN=SINV_01444 PE=4 SV=1
86 : F4X6I2_ACREC 0.71 0.87 1 62 114 175 62 0 0 418 F4X6I2 General transcription factor IIE subunit 1 OS=Acromyrmex echinatior GN=G5I_14130 PE=4 SV=1
87 : G6DD83_DANPL 0.71 0.89 1 62 113 174 62 0 0 413 G6DD83 Uncharacterized protein OS=Danaus plexippus GN=KGM_03878 PE=4 SV=1
88 : O96880_DROME 0.71 0.85 1 62 127 188 62 0 0 429 O96880 GH11150p OS=Drosophila melanogaster GN=TfIIEalpha PE=2 SV=1
89 : Q0P4H8_XENTR 0.71 0.85 1 62 113 174 62 0 0 421 Q0P4H8 General transcription factor IIE, polypeptide 1, alpha 56kDa, gene 2 OS=Xenopus tropicalis GN=gtf2e1.2 PE=2 SV=1
90 : S4PX37_9NEOP 0.71 0.87 1 62 113 174 62 0 0 420 S4PX37 General transcription factor IIE subunit 1 OS=Pararge aegeria PE=4 SV=1
91 : W8ARR2_CERCA 0.71 0.84 1 62 116 177 62 0 0 428 W8ARR2 General transcription factor IIE subunit 1 OS=Ceratitis capitata GN=T2EA PE=2 SV=1
92 : B3M4A6_DROAN 0.69 0.87 1 62 127 188 62 0 0 429 B3M4A6 GF25281 OS=Drosophila ananassae GN=Dana\GF25281 PE=4 SV=1
93 : B4HAH7_DROPE 0.69 0.87 2 62 1 61 61 0 0 303 B4HAH7 GL16279 OS=Drosophila persimilis GN=Dper\GL16279 PE=4 SV=1
94 : B4KUS0_DROMO 0.69 0.85 1 62 127 188 62 0 0 431 B4KUS0 GI13717 OS=Drosophila mojavensis GN=Dmoj\GI13717 PE=4 SV=1
95 : B4LCQ5_DROVI 0.69 0.85 1 62 127 188 62 0 0 431 B4LCQ5 GJ14060 OS=Drosophila virilis GN=Dvir\GJ14060 PE=4 SV=1
96 : B4N4M3_DROWI 0.69 0.89 1 62 130 191 62 0 0 432 B4N4M3 GK10470 OS=Drosophila willistoni GN=Dwil\GK10470 PE=4 SV=1
97 : B7T4I0_DROAI 0.69 0.87 1 62 85 146 62 0 0 307 B7T4I0 CG104150-like protein (Fragment) OS=Drosophila affinis PE=4 SV=1
98 : D6WNN0_TRICA 0.69 0.85 1 62 114 175 62 0 0 404 D6WNN0 Putative uncharacterized protein OS=Tribolium castaneum GN=TcasGA2_TC013870 PE=4 SV=1
99 : E2BAG8_HARSA 0.69 0.85 1 62 114 175 62 0 0 419 E2BAG8 General transcription factor IIE subunit 1 OS=Harpegnathos saltator GN=EAI_06101 PE=4 SV=1
100 : H3ALB9_LATCH 0.69 0.90 1 61 113 173 61 0 0 426 H3ALB9 Uncharacterized protein OS=Latimeria chalumnae PE=4 SV=1
101 : H9KKE8_APIME 0.69 0.85 1 62 114 175 62 0 0 419 H9KKE8 Uncharacterized protein OS=Apis mellifera GN=TfIIEalpha PE=4 SV=1
102 : N6UBP6_DENPD 0.69 0.85 1 62 113 174 62 0 0 404 N6UBP6 Uncharacterized protein (Fragment) OS=Dendroctonus ponderosae GN=YQE_07439 PE=4 SV=1
103 : Q2LZ72_DROPS 0.69 0.87 1 62 126 187 62 0 0 429 Q2LZ72 GA10304 OS=Drosophila pseudoobscura pseudoobscura GN=Dpse\GA10304 PE=4 SV=1
104 : R4FRA4_RHOPR 0.69 0.90 1 62 114 175 62 0 0 408 R4FRA4 Putative transcription initiation factor iie (Fragment) OS=Rhodnius prolixus PE=2 SV=1
105 : T1IC26_RHOPR 0.69 0.90 1 62 114 175 62 0 0 426 T1IC26 Uncharacterized protein OS=Rhodnius prolixus PE=4 SV=1
106 : U4U9V1_DENPD 0.69 0.85 1 62 113 174 62 0 0 404 U4U9V1 Uncharacterized protein OS=Dendroctonus ponderosae GN=D910_08061 PE=4 SV=1
107 : B4J2J0_DROGR 0.68 0.85 1 62 129 190 62 0 0 438 B4J2J0 GH14864 OS=Drosophila grimshawi GN=Dgri\GH14864 PE=4 SV=1
108 : E0VIK4_PEDHC 0.68 0.82 1 62 112 173 62 0 0 408 E0VIK4 Predicted protein OS=Pediculus humanus subsp. corporis GN=Phum_PHUM228670 PE=4 SV=1
109 : E2AHX6_CAMFO 0.68 0.87 1 62 114 175 62 0 0 420 E2AHX6 General transcription factor IIE subunit 1 OS=Camponotus floridanus GN=EAG_15323 PE=4 SV=1
110 : K7IUT8_NASVI 0.68 0.82 1 62 114 175 62 0 0 411 K7IUT8 Uncharacterized protein OS=Nasonia vitripennis PE=4 SV=1
111 : T1DT40_ANOAQ 0.68 0.87 1 62 75 136 62 0 0 318 T1DT40 Putative transcription initiation factor (Fragment) OS=Anopheles aquasalis PE=2 SV=1
112 : W5JJE3_ANODA 0.68 0.87 1 62 114 175 62 0 0 436 W5JJE3 Transcription initiation factor IIE, alpha subunit OS=Anopheles darlingi GN=AND_003765 PE=4 SV=1
113 : H9ISX7_BOMMO 0.66 0.85 1 62 113 174 62 0 0 421 H9ISX7 Uncharacterized protein OS=Bombyx mori PE=4 SV=1
114 : Q5TR07_ANOGA 0.66 0.84 1 62 113 174 62 0 0 434 Q5TR07 AGAP006355-PA OS=Anopheles gambiae GN=AGAP006355 PE=4 SV=3
115 : T1J924_STRMM 0.66 0.85 1 62 112 173 62 0 0 389 T1J924 Uncharacterized protein OS=Strigamia maritima PE=4 SV=1
116 : W4WX21_ATTCE 0.66 0.82 1 62 105 169 65 1 3 412 W4WX21 Uncharacterized protein OS=Atta cephalotes PE=4 SV=1
117 : T1GWN2_MEGSC 0.65 0.85 1 62 59 120 62 0 0 284 T1GWN2 Uncharacterized protein OS=Megaselia scalaris PE=4 SV=1
118 : V4BKR8_LOTGI 0.65 0.87 1 62 113 174 62 0 0 404 V4BKR8 Uncharacterized protein OS=Lottia gigantea GN=LOTGIDRAFT_106483 PE=4 SV=1
119 : H2ZEG5_CIOSA 0.62 0.78 1 60 115 174 60 0 0 384 H2ZEG5 Uncharacterized protein OS=Ciona savignyi PE=4 SV=1
120 : F6RRS1_CIOIN 0.61 0.81 1 62 63 124 62 0 0 343 F6RRS1 Uncharacterized protein (Fragment) OS=Ciona intestinalis GN=Cin.31714 PE=4 SV=2
121 : M7BZU2_CHEMY 0.61 0.71 1 62 113 166 62 1 8 224 M7BZU2 General transcription factor IIE subunit 1 OS=Chelonia mydas GN=UY3_09173 PE=4 SV=1
122 : E4X5Q3_OIKDI 0.60 0.85 1 62 122 183 62 0 0 404 E4X5Q3 Whole genome shotgun assembly, allelic scaffold set, scaffold scaffoldA_7 OS=Oikopleura dioica GN=GSOID_T00002630001 PE=4 SV=1
123 : J9JQN6_ACYPI 0.58 0.77 1 62 113 174 62 0 0 412 J9JQN6 Uncharacterized protein OS=Acyrthosiphon pisum GN=LOC100161371 PE=4 SV=1
124 : L7M8Y0_9ACAR 0.58 0.87 1 62 114 175 62 0 0 411 L7M8Y0 Putative transcription factor iiealpha OS=Rhipicephalus pulchellus PE=2 SV=1
125 : T1G7S2_HELRO 0.58 0.71 1 62 113 174 62 0 0 217 T1G7S2 Uncharacterized protein OS=Helobdella robusta GN=HELRODRAFT_90493 PE=4 SV=1
126 : E9G1T1_DAPPU 0.56 0.73 1 62 111 174 64 1 2 413 E9G1T1 Putative uncharacterized protein OS=Daphnia pulex GN=DAPPUDRAFT_44338 PE=4 SV=1
127 : T1II69_STRMM 0.56 0.73 1 54 134 188 55 1 1 388 T1II69 Uncharacterized protein OS=Strigamia maritima PE=4 SV=1
128 : B7P1M1_IXOSC 0.55 0.84 1 62 114 175 62 0 0 425 B7P1M1 Transcription initiation factor iie, alpha subunit, putative OS=Ixodes scapularis GN=IscW_ISCW015578 PE=4 SV=1
129 : T1JQC6_TETUR 0.55 0.74 1 62 108 169 62 0 0 403 T1JQC6 Uncharacterized protein OS=Tetranychus urticae PE=4 SV=1
130 : V5IIZ1_IXORI 0.55 0.84 1 62 114 175 62 0 0 406 V5IIZ1 Proteinral transcription factor iie subunit 1 apis mellifera OS=Ixodes ricinus PE=2 SV=1
131 : T1JQE5_TETUR 0.52 0.71 7 62 8 63 56 0 0 146 T1JQE5 Uncharacterized protein OS=Tetranychus urticae PE=4 SV=1
132 : T2MGU0_HYDVU 0.52 0.78 1 54 121 174 54 0 0 482 T2MGU0 General transcription factor IIE subunit 1 (Fragment) OS=Hydra vulgaris GN=GTF2E1 PE=2 SV=1
133 : W4Z4F9_STRPU 0.52 0.80 1 61 117 177 61 0 0 436 W4Z4F9 Uncharacterized protein OS=Strongylocentrotus purpuratus GN=Sp-Gtf2E1 PE=4 SV=1
134 : R7TXU0_CAPTE 0.48 0.82 1 62 117 178 62 0 0 419 R7TXU0 Uncharacterized protein OS=Capitella teleta GN=CAPTEDRAFT_176545 PE=4 SV=1
135 : A8WMX3_CAEBR 0.44 0.68 1 62 903 964 62 0 0 1226 A8WMX3 Protein CBG00366 OS=Caenorhabditis briggsae GN=CBG00366 PE=3 SV=2
136 : C1LH57_SCHJA 0.44 0.75 1 62 155 217 63 1 1 510 C1LH57 Transcription factor IIE OS=Schistosoma japonicum GN=TfIIEalpha PE=2 SV=1
137 : C1LH58_SCHJA 0.44 0.75 1 62 155 217 63 1 1 510 C1LH58 Transcription factor IIE OS=Schistosoma japonicum GN=TfIIEalpha PE=2 SV=1
138 : U6HLG1_ECHMU 0.44 0.81 1 62 126 187 62 0 0 477 U6HLG1 General transcription factor iie subunit 1 OS=Echinococcus multilocularis GN=EmuJ_000470300 PE=4 SV=1
139 : U6J7Y2_ECHGR 0.44 0.81 1 62 126 187 62 0 0 477 U6J7Y2 General transcription factor IIE subunit OS=Echinococcus granulosus GN=EGR_01033 PE=4 SV=1
140 : J0WXG4_AURDE 0.43 0.67 2 55 115 168 54 0 0 489 J0WXG4 Uncharacterized protein OS=Auricularia delicata (strain TFB10046) GN=AURDEDRAFT_116120 PE=4 SV=1
141 : E3NBB7_CAERE 0.42 0.61 1 62 127 188 62 0 0 439 E3NBB7 Putative uncharacterized protein OS=Caenorhabditis remanei GN=CRE_12281 PE=4 SV=1
142 : U6IRC9_HYMMI 0.42 0.71 1 62 125 186 62 0 0 476 U6IRC9 General transcription factor iie subunit 1 OS=Hymenolepis microstoma GN=HmN_000353900 PE=4 SV=1
143 : I1FC00_AMPQE 0.41 0.76 1 54 67 120 54 0 0 374 I1FC00 Uncharacterized protein OS=Amphimedon queenslandica PE=4 SV=1
144 : I1FC01_AMPQE 0.41 0.76 1 54 117 170 54 0 0 424 I1FC01 Uncharacterized protein OS=Amphimedon queenslandica GN=LOC100634699 PE=4 SV=1
145 : A7SES2_NEMVE 0.40 0.71 1 62 120 181 62 0 0 421 A7SES2 Predicted protein OS=Nematostella vectensis GN=v1g115691 PE=4 SV=1
146 : B3S9K5_TRIAD 0.40 0.61 1 62 106 167 62 0 0 394 B3S9K5 Putative uncharacterized protein (Fragment) OS=Trichoplax adhaerens GN=TRIADDRAFT_14507 PE=4 SV=1
147 : H2KP13_CLOSI 0.40 0.75 1 62 71 133 63 1 1 423 H2KP13 Transcription initiation factor TFIIE subunit alpha OS=Clonorchis sinensis GN=CLF_101185 PE=4 SV=1
148 : R7SM49_DICSQ 0.39 0.63 2 55 114 167 54 0 0 484 R7SM49 Uncharacterized protein OS=Dichomitus squalens (strain LYAD-421) GN=DICSQDRAFT_112614 PE=4 SV=1
149 : E5SGT5_TRISP 0.38 0.64 1 53 78 128 53 1 2 355 E5SGT5 General transcription factor IIE subunit 1 OS=Trichinella spiralis GN=Tsp_03661 PE=4 SV=1
150 : G0MRB8_CAEBE 0.37 0.60 1 62 125 185 62 1 1 450 G0MRB8 Putative uncharacterized protein OS=Caenorhabditis brenneri GN=CAEBREN_20920 PE=4 SV=1
151 : G5EG49_CAEEL 0.35 0.66 1 62 126 187 62 0 0 433 G5EG49 Protein ZK550.4 OS=Caenorhabditis elegans GN=TFIIE-alpha PE=4 SV=1
152 : M2RA39_CERS8 0.35 0.69 1 62 112 173 62 0 0 484 M2RA39 Uncharacterized protein OS=Ceriporiopsis subvermispora (strain B) GN=CERSUDRAFT_116392 PE=4 SV=1
153 : F1L4D0_ASCSU 0.34 0.66 1 62 124 185 62 0 0 404 F1L4D0 Cre-flp-8 protein OS=Ascaris suum GN=ASU_08670 PE=2 SV=1
154 : G4TQY8_PIRID 0.34 0.65 1 62 123 184 62 0 0 559 G4TQY8 Related to TFA1-TFIIE subunit (Transcription initiation factor), 66 kD OS=Piriformospora indica (strain DSM 11827) GN=PIIN_07686 PE=4 SV=1
155 : J9EKF6_WUCBA 0.34 0.63 1 62 126 187 62 0 0 200 J9EKF6 Uncharacterized protein OS=Wuchereria bancrofti GN=WUBG_06144 PE=4 SV=1
156 : S8E0I0_FOMPI 0.34 0.69 2 62 113 173 61 0 0 484 S8E0I0 Uncharacterized protein OS=Fomitopsis pinicola (strain FP-58527) GN=FOMPIDRAFT_1126886 PE=4 SV=1
## ALIGNMENTS 1 - 70
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 113 A R 0 0 305 151 21 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR RRRRRRRRRRRRRRRRRKKK
2 114 A I - 0 0 148 155 30 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIIIIIILLL
3 115 A E + 0 0 138 156 6 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
4 116 A T + 0 0 81 156 47 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTMMTTT
5 117 A D + 0 0 99 156 43 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
6 118 A E S S- 0 0 160 156 22 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
7 119 A R S S+ 0 0 207 157 14 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
8 120 A D S S+ 0 0 113 157 37 DDDDDDDDDDDDDDDDDDDDDDDDDDDDNDDDNNDDDDDDDDDDDDNDDNDDDDDDDDDDDDDDDNNDDD
9 121 A S - 0 0 59 157 59 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
10 122 A T - 0 0 120 157 30 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT
11 123 A N S S+ 0 0 122 157 59 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSSSNNN
12 124 A R S S+ 0 0 111 157 13 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
13 125 A A S S+ 0 0 10 157 29 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
14 126 A S E +A 25 0A 14 157 29 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
15 127 A F E -AB 24 51A 4 157 1 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF
16 128 A K E -AB 23 50A 67 157 43 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRKKKKRKRRRKKRRRRRRRRKKRRR
17 129 A C - 0 0 2 157 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
18 130 A P S S+ 0 0 92 157 51 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
19 131 A V S S+ 0 0 92 157 88 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVIVNNNVIICCVINVTTCICIICCCICCCCVVDDD
20 132 A C S S- 0 0 41 157 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
21 133 A S + 0 0 77 157 91 SSSSSSSSSSSSSSSSSSSSSCCCCSSSCKSSCCFFFFCCCFFFFFCFCCSFTSHFFFSFFSFSFCCSSS
22 134 A S - 0 0 31 157 63 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSNSSSSNSNSSSNSSSSSSSSNSNSNSSSSSSSSSSSSKKK
23 135 A T E -A 16 0A 63 157 30 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATTTTTT
24 136 A F E -A 15 0A 7 157 3 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF
25 137 A T E >> -A 14 0A 51 157 31 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT
26 138 A D H >> S+ 0 0 61 157 24 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
27 139 A L H 34 S+ 0 0 108 157 3 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
28 140 A E H <> S+ 0 0 47 157 8 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDDEEE
29 141 A A H > -C 39 0B 47 157 6 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDEDDEDDDEDDDDDDDDDDDEEDDD
35 147 A P T 45S+ 0 0 116 157 80 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPLLPPPPPPPPIPSMHPPLLL
36 148 A M T 45S+ 0 0 149 157 82 MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMRRRMMMRRRMMMMMMMRMNMRRMMMKMMMMTTAAA
37 149 A T T 45S- 0 0 65 157 43 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATTTKTTTTIISSS
38 150 A G T <5S+ 0 0 41 157 55 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGgGGGGGGGGGGGGGGGGGGGGGGGGGGgGGGGGGGG
39 151 A T E - 0 0 30 157 16 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDNNDDD
53 165 A E T 3 S+ 0 0 192 157 59 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAAEEAASEEEEEE
54 166 A S T 3 S+ 0 0 109 156 47 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSLSSSSSSSSSSSSSSSSPPSSS
55 167 A A < + 0 0 44 152 45 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAVVVVVAVVVVAAAAA
56 168 A M S S- 0 0 87 149 61 MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMLVMTMMLMVMVMM CCMCCCCMMVVV
57 169 A P + 0 0 116 149 27 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP PPPPPPPPPPPP
58 170 A K - 0 0 142 149 44 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK DDKDDDDKKKKK
59 171 A K - 0 0 100 143 50 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK K KKKKK
60 172 A D - 0 0 127 143 19 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD D DDDDD
61 173 A A 0 0 92 142 53 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA A AAAAA
62 174 A R 0 0 292 140 6 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR R RRRRR
## ALIGNMENTS 71 - 140
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....8....:....9....:....0....:....1....:....2....:....3....:....4
1 113 A R 0 0 305 151 21 KKKKKKKRRRRRRKRRRRKRRR RRRRRRKRRRRRRRRRRRRRRKRRKRRRKKKKKKKKK KKKKRRRR
2 114 A I - 0 0 148 155 30 IIIIIIIMMMMMMIMMMMIMMMMMMMMLMIMLMMMLMMMMMMLMIMMIIIIIMIIMIIII LIILLLLLI
3 115 A E + 0 0 138 156 6 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE EEEEEEEED
4 116 A T + 0 0 81 156 47 AAAAAAATTTTTTTTTTTSTTTTTTTTTTATTTTTTTTTTTTTTTTTMTTTSTTTTTTTT NNTSAAAAK
5 117 A D + 0 0 99 156 43 DDDDDDDEEEEEEDEEEEDEEEEEEEEEEDEEEEEEEEEEEEEEEEEERRDQQEMSDEEE EREREEEEK
6 118 A E S S- 0 0 160 156 22 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE EEEEQQQQL
7 119 A R S S+ 0 0 207 157 14 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRQRRQRRRRR
8 120 A D S S+ 0 0 113 157 37 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDNDDDNDDDDQDDMQQQQN
9 121 A S - 0 0 59 157 59 SSSSSSSAAAAAASAAAASAAAAAAAAAASAAAAAAAAAAAAAASAANSSSNASMASSSSSASNDSSTTE
10 122 A T - 0 0 120 157 30 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNTSTTTTTTTTTKTTTTTSSL
11 123 A N S S+ 0 0 122 157 59 TTTTTTTSSSSSSNSSSSTSSSSSSSSSSISSSSSSSSSSSSSSSSSSNNNNSSISnSKSKNCHNSSSSD
12 124 A R S S+ 0 0 111 157 13 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRrRRRRRKRRRRRRN
13 125 A A S S+ 0 0 10 157 29 SSSSSSSAAAAAAAAAAAAAAAAAAAAAASAAAAAAAAAAAAAAAAAAPPAAAAAAAASASPAAAAAAAK
14 126 A S E +A 25 0A 14 157 29 SSSSFSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSMSSSSSLSHSSLLG
15 127 A F E -AB 24 51A 4 157 1 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFYFFFFY
16 128 A K E -AB 23 50A 67 157 43 KKKKKKKKKKKKKLKKKKKKKKKKKKKKKQKKKKKKKKKKKKKKKKRKKKKIKRRHKRIRIQKQKKKRRI
17 129 A C - 0 0 2 157 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
18 130 A P S S+ 0 0 92 157 51 PPPPPPPPPSSSSPPPPSPPTSSSSSSPTPTLSPPLSPPTPPPSPPVPPPPPVTPPPTVTVETSNSSFFP
19 131 A V S S+ 0 0 92 157 88 SSSSSSSASTATAMSSSSGSNMMTTMMGTANAMKKAANSSSSANQSQSTTIMQGRCQGNGNQEQTSSSSR
20 132 A C S S- 0 0 41 157 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
21 133 A S + 0 0 77 157 91 SSSSSFFKNSSSSGLLGSLGDSSSSSSLLLLLSAALGLLLNNGKQLNQQGFKFSHKKDKDKSTQQNNNNS
22 134 A S - 0 0 31 157 63 SNSSSSSKKKKKKKKKKKSKKKKKKKKKKSKKKKKKKKKKKKKKKKKKNNSNKKKKKKKKKKRKSTTSSH
23 135 A T E -A 16 0A 63 157 30 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTSTTTTTTTTTTTTTTLTTATTTTTSTNTSTSTTTSTTTAAC
24 136 A F E -A 15 0A 7 157 3 YYYYYYYFFFFFFFFFFFYFFFFFFFFFFYFFFFFFFFFFFFFFFFFYYYFYFFFFFFFFFFYYYYYYYY
25 137 A T E >> -A 14 0A 51 157 31 TTTTTTTTTTTTTTTTTTSTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSSTTTTTTGTTTTSNTDTTTTT
26 138 A D H >> S+ 0 0 61 157 24 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDEDDDDDEDLDDDDP
27 139 A L H 34 S+ 0 0 108 157 3 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLFLLLLLLL
28 140 A E H <> S+ 0 0 47 157 8 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEEDEEQEEEEEEEEEEDEDEEEEE
29 141 A A H > -C 39 0B 47 157 6 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDNDDDDDDDDDDDDDD
35 147 A P T 45S+ 0 0 116 157 80 VAVVAIIFFMMMMPMMMMPMFMMMMMMYMPMFMPPFVMMMFFMFAMFFPPPPFMSPRFHFHFPFPFFPPF
36 148 A M T 45S+ 0 0 149 157 82 FFFFFFFTAAAAALRRMAFMNAAAAAATTHTMAIIMAFRANSAAMRAMIMMMQAAAKAVAVSIAETTLLA
37 149 A T T 45S- 0 0 65 157 43 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSTSSSTTTTTTTTTTTTMFTSTSTDTSSSSTTSTNNTTT
38 150 A G T <5S+ 0 0 41 157 55 EGEEEEEGGLLLLGGGQLEQGQLCCCLSGGGGLAAGCQDEDDQSNGQQGGVGNGQpGGNGNGMGGppGGQ
39 151 A T E - 0 0 30 157 16 DDDEDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDEDEDDDEDIDDDGDDEDEEEEN
53 165 A E T 3 S+ 0 0 192 157 59 AGAAGAASSSSSSQQQQSSQSSSSSSSLQTQSSLLSSQQSSSMSEQMAAKSATQVEEQPQPVEEEEEEEE
54 166 A S T 3 S+ 0 0 109 156 47 SSSSSSSSSAAAASSSSASSSAAAAAASSSSSASSSASSSSSSSSSSTSESSSGYSNDDDDSSASNNDDS
55 167 A A < + 0 0 44 152 45 AAAAAAAAAAAAARAAAASAAAAAAAAAAGAAAAAAAAAAAAAASAASASPAVADA GVGV ASANNNNA
56 168 A M S S- 0 0 87 149 61 LFLLFLLLLMMMMELLLMLLMMMMMMMLLLLLMLLLMLLLLLLLLLLMSATGLLKI LLLL VEVAAAA
57 169 A P + 0 0 116 149 27 PPPPPPPPPPPPPQPPPPPPPPPPPPPPPAPPPPPPPPPPPPPPPPPPNSKPPPNP PPPP PPPSSQQ
58 170 A K - 0 0 142 149 44 KKKKKKKKKKKKKEKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKTNQKQRHK RKRK KRSRRRR
59 171 A K - 0 0 100 143 50 RHRRHRRKKKKKKKKKKKRKKKKKKKKKKRKKKKKKKKKKKKKKQKKQTQQTQQFT QAQA RRRTTSS
60 172 A D - 0 0 127 143 19 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDNDDQDDDDD DDDD DDTDDDD
61 173 A A 0 0 92 142 53 AAAAAAASSSSSSSSSSSASSSSSSSSSSASSSSSSSSSSSSSSSSSA AQSSSSS SSSS ASTAAAA
62 174 A R 0 0 292 140 6 RRRRRRRRRRRRRRRRRRRRRRRRRRRRR RRRRRRRRRRRRRRRRRR RRRRRRR RRRR RRRRRR
## ALIGNMENTS 141 - 156
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....5....:....6....:....7....:....8....:....9....:....0....:....1
1 113 A R 0 0 305 151 21 KKRRKRR RKKRRQR
2 114 A I - 0 0 148 155 30 LLIIIILILLLILILI
3 115 A E + 0 0 138 156 6 EEQQEQEDEEEDEEED
4 116 A T + 0 0 81 156 47 SAAASNAQSSSSVDVS
5 117 A D + 0 0 99 156 43 REEEDEEGQRRRKSKG
6 118 A E S S- 0 0 160 156 22 EQEEEEQLQEELDLDL
7 119 A R S S+ 0 0 207 157 14 QRRRKSRRKQQRKKKR
8 120 A D S S+ 0 0 113 157 37 MQTTQFQNSMMNDADN
9 121 A S - 0 0 59 157 59 DNVVASTEVDDEEDEQ
10 122 A T - 0 0 120 157 30 TTSSKRTLTTTLVLVL
11 123 A N S S+ 0 0 122 157 59 NCNNNNSDNNNDHDHD
12 124 A R S S+ 0 0 111 157 13 RRRRRRRSRRRNKNKN
13 125 A A S S+ 0 0 10 157 29 AATTPPAKSAAKAKAK
14 126 A S E +A 25 0A 14 157 29 HLSSSSSGSHHGSGSG
15 127 A F E -AB 24 51A 4 157 1 YFYYFFFYFYYYYYYY
16 128 A K E -AB 23 50A 67 157 43 KRNNVYRLKKRVKVRI
17 129 A C - 0 0 2 157 0 CCCCCCCCCCCCCCCC
18 130 A P S S+ 0 0 92 157 51 NQPPSPVPTLGPSPTP
19 131 A V S S+ 0 0 92 157 88 ASNNEFSQSNAQGRGQ
20 132 A C S S- 0 0 41 157 0 CCCCCCCCCCCCCCCC
21 133 A S + 0 0 77 157 91 QNQQHNARDQQQNTQH
22 134 A S - 0 0 31 157 63 SSNNNNTKKKSKYKYK
23 135 A T E -A 16 0A 63 157 30 TTTTKRTTLTSTHSQS
24 136 A F E -A 15 0A 7 157 3 YYYYYYYFYYYFYFYF
25 137 A T E >> -A 14 0A 51 157 31 DTSSSETQSDDSDTDS
26 138 A D H >> S+ 0 0 61 157 24 TDDDDDDTDMMPATAP
27 139 A L H 34 S+ 0 0 108 157 3 LLLLLTLLLLLLMLML
28 140 A E H <> S+ 0 0 47 157 8 DEDDEDEEDEEDEDEE
29 141 A A H > -C 39 0B 47 157 6 DDDDDDSNNDDDDDDD
35 147 A P T 45S+ 0 0 116 157 80 IVPPMPLPGDAPPPPF
36 148 A M T 45S+ 0 0 149 157 82 ECVVTIDAECEFLMLA
37 149 A T T 45S- 0 0 65 157 43 TTEETENLNGSTSLTA
38 150 A G T <5S+ 0 0 41 157 55 GGGGGQpGMNGNQNQG
39 151 A T E - 0 0 30 157 16 DEDDDDENNDDNDNDN
53 165 A E T 3 S+ 0 0 192 157 59 EETTTTEEEEEEEEEE
54 166 A S T 3 S+ 0 0 109 156 47 SDEESSDN STNTNTN
55 167 A A < + 0 0 44 152 45 VN EENA VVTTSAA
56 168 A M S S- 0 0 87 149 61 AG IIV AVEGEGE
57 169 A P + 0 0 116 149 27 PQ QES PPSPKPD
58 170 A K - 0 0 142 149 44 TR EAR TSVTVTV
59 171 A K - 0 0 100 143 50 KT GST KRRDQYK
60 172 A D - 0 0 127 143 19 AD NKD ATGEGEG
61 173 A A 0 0 92 142 53 TI NVA TTSTSTS
62 174 A R 0 0 292 140 6 RR KRR RRQRKRQ
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 113 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 77 22 1 0 0 0 151 0 0 0.563 18 0.78
2 114 A 0 14 65 22 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 155 0 0 0.886 29 0.69
3 115 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 96 0 3 156 0 0 0.214 7 0.94
4 116 A 1 0 0 2 0 0 0 0 10 0 6 76 0 0 0 1 1 0 2 1 156 0 0 0.921 30 0.52
5 117 A 0 0 0 1 0 0 0 1 0 0 1 0 0 0 5 2 2 34 0 54 156 0 0 1.148 38 0.56
6 118 A 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 91 0 1 156 0 0 0.391 13 0.78
7 119 A 0 0 0 0 0 0 0 0 0 0 1 0 0 0 93 3 3 0 0 0 157 0 0 0.319 10 0.85
8 120 A 0 0 0 3 1 0 0 0 1 0 1 1 0 0 0 0 5 0 8 81 157 0 0 0.775 25 0.63
9 121 A 2 0 0 1 0 0 0 0 25 0 61 2 0 0 0 0 1 3 3 3 157 0 0 1.181 39 0.40
10 122 A 1 3 0 0 0 0 0 0 0 0 3 89 0 0 1 1 0 0 1 0 157 0 0 0.521 17 0.69
11 123 A 0 0 1 0 0 0 0 0 0 0 32 5 1 2 0 1 0 0 54 3 157 0 1 1.204 40 0.41
12 124 A 0 0 0 0 0 0 0 0 0 0 1 0 0 0 95 2 0 0 3 0 157 0 0 0.251 8 0.87
13 125 A 0 0 0 0 0 0 0 0 85 3 7 1 0 0 0 3 0 0 0 0 157 0 0 0.597 19 0.70
14 126 A 0 3 0 1 1 0 0 3 0 0 90 0 0 3 0 0 0 0 0 0 157 0 0 0.452 15 0.71
15 127 A 0 0 0 0 92 0 8 0 0 0 0 0 0 0 0 0 0 0 0 0 157 0 0 0.286 9 0.98
16 128 A 2 1 3 0 0 0 1 0 0 0 0 0 0 1 17 72 2 0 1 0 157 0 0 0.976 32 0.56
17 129 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 157 0 0 0.000 0 1.00
18 130 A 3 2 0 0 1 0 0 1 0 72 12 6 0 0 0 0 1 1 1 0 157 0 0 1.061 35 0.48
19 131 A 27 0 6 4 1 0 0 4 6 0 17 6 8 0 2 1 6 1 8 2 157 0 0 2.306 76 0.12
20 132 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 157 0 0 0.000 0 1.00
21 133 A 0 8 0 0 14 0 0 4 2 0 35 2 10 3 1 5 8 0 7 3 157 0 0 2.103 70 0.09
22 134 A 0 0 0 0 0 0 1 0 0 0 48 2 0 1 1 38 0 0 10 0 157 0 0 1.139 38 0.37
23 135 A 0 1 0 0 0 0 0 0 3 0 5 87 1 1 1 1 1 0 1 0 157 0 0 0.613 20 0.69
24 136 A 0 0 0 0 79 0 21 0 0 0 0 0 0 0 0 0 0 0 0 0 157 0 0 0.514 17 0.97
25 137 A 0 0 0 0 0 0 0 1 0 0 6 87 0 0 0 0 1 1 1 4 157 0 0 0.548 18 0.68
26 138 A 0 1 0 1 0 0 0 0 1 2 0 2 0 0 0 0 0 1 0 92 157 0 0 0.429 14 0.75
27 139 A 0 97 0 1 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 157 0 0 0.145 4 0.97
28 140 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 91 0 8 157 0 0 0.324 10 0.92
29 141 A 18 0 3 1 0 0 0 0 76 0 0 1 0 0 0 0 0 0 0 0 157 0 0 0.734 24 0.54
30 142 A 0 0 0 0 0 0 0 3 0 0 1 1 0 0 0 1 0 1 54 39 157 0 1 0.987 32 0.61
31 143 A 0 2 0 1 0 0 0 0 0 0 0 0 0 1 14 3 78 0 1 0 157 0 0 0.779 26 0.59
32 144 A 0 95 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 157 0 0 0.201 6 0.92
33 145 A 0 10 4 1 75 2 8 0 0 0 0 0 0 0 0 0 0 0 0 0 157 0 0 0.905 30 0.82
34 146 A 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 3 2 94 157 0 0 0.273 9 0.93
35 147 A 3 4 3 17 12 0 1 1 3 54 1 0 0 2 1 0 0 0 0 1 157 0 0 1.570 52 0.19
36 148 A 3 3 3 41 6 0 0 0 19 0 1 6 1 1 8 1 1 3 2 1 157 0 0 1.972 65 0.18
37 149 A 0 1 1 1 1 0 0 1 2 0 10 78 0 0 0 1 0 2 3 1 157 0 0 0.946 31 0.57
38 150 A 1 5 0 1 0 0 0 66 1 3 1 0 3 0 0 0 8 5 4 2 157 3 6 1.373 45 0.44
39 151 A 1 4 2 3 0 0 0 0 2 0 0 51 0 0 2 3 3 28 1 0 154 0 0 1.480 49 0.27
40 152 A 0 14 0 1 85 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 155 0 0 0.502 16 0.90
41 153 A 4 1 3 2 1 0 0 0 1 0 0 0 2 4 72 6 1 1 4 0 156 0 0 1.233 41 0.39
42 154 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 156 0 0 0.000 0 1.00
43 155 A 4 1 1 0 1 4 0 0 0 0 4 80 1 0 0 0 0 1 1 3 156 0 0 0.893 29 0.44
44 156 A 1 0 2 1 56 0 34 1 1 0 0 0 0 1 3 0 0 0 0 0 156 0 0 1.085 36 0.64
45 157 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 156 0 0 0.000 0 1.00
46 158 A 0 1 0 0 0 0 1 18 0 0 4 0 1 23 15 3 17 6 13 1 156 0 0 2.004 66 0.23
47 159 A 1 1 0 0 0 0 0 4 10 0 15 50 0 1 3 0 3 10 3 0 157 0 0 1.616 53 0.32
48 160 A 7 2 8 0 0 0 0 0 2 3 10 1 0 0 0 0 0 66 1 1 157 0 0 1.274 42 0.38
49 161 A 97 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 157 0 0 0.119 3 0.96
50 162 A 3 3 1 0 0 0 0 0 1 0 0 2 1 0 0 3 3 83 0 1 157 0 0 0.793 26 0.57
51 163 A 0 0 0 0 0 0 0 0 2 1 0 1 0 0 1 0 0 93 0 3 157 0 0 0.357 11 0.84
52 164 A 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0 0 7 5 87 157 0 0 0.527 17 0.83
53 165 A 1 2 0 1 0 0 0 1 8 1 16 4 0 0 0 1 8 57 0 0 157 0 0 1.472 49 0.41
54 166 A 0 1 0 0 0 0 1 1 9 1 74 3 0 0 0 0 0 2 4 5 156 0 0 1.056 35 0.53
55 167 A 10 0 0 0 0 0 0 2 76 1 4 1 0 0 1 0 0 1 4 1 152 0 0 0.985 32 0.55
56 168 A 7 26 2 47 1 0 0 3 5 0 1 1 4 0 0 1 0 3 0 0 149 0 0 1.630 54 0.38
57 169 A 0 0 0 0 0 0 0 0 1 89 3 0 0 0 0 1 3 1 1 1 149 0 0 0.552 18 0.72
58 170 A 2 0 0 0 0 0 0 0 1 0 1 3 0 1 7 78 1 1 1 4 149 0 0 0.972 32 0.56
59 171 A 0 0 0 0 1 0 1 1 1 0 2 5 0 1 8 73 6 0 0 1 143 0 0 1.101 36 0.49
60 172 A 0 0 0 0 0 0 0 2 1 0 0 1 0 0 0 1 1 1 1 91 143 0 0 0.476 15 0.81
61 173 A 1 0 1 0 0 0 0 0 57 0 36 4 0 0 0 0 1 0 1 0 142 0 0 0.961 32 0.46
62 174 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 97 1 1 0 0 0 140 0 0 0.150 4 0.93
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
35 39 151 1 gGt
62 39 151 2 gMGt
116 31 135 3 sMADq
126 39 149 2 pSGe
127 12 145 1 nSr
136 39 193 1 pGk
137 39 193 1 pGk
147 39 109 1 pGk
//