Complet list of 1wgx hssp file
Complete list of 1wgx.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1WGX
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-12
HEADER STRUCTURAL GENOMICS, UNKNOWN FUNCTION 28-MAY-04 1WGX
COMPND MOL_ID: 1; MOLECULE: KIAA1903 PROTEIN; CHAIN: A; FRAGMENT: MYB DNA BIN
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: HOMO SAPIENS; ORGANISM_COMMON: HUMAN;
AUTHOR Y.DOI-KATAYAMA,H.HIROTA,F.HAYASHI,S.YOKOYAMA,RIKEN STRUCTURAL GENOMICS
DBREF 1WGX A 8 67 UNP Q6P0N0 CV106_HUMAN 759 818
SEQLENGTH 73
NCHAIN 1 chain(s) in 1WGX data set
NALIGN 69
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : M18BP_HUMAN 1WGX 0.86 0.90 3 73 872 942 71 0 0 1132 Q6P0N0 Mis18-binding protein 1 OS=Homo sapiens GN=MIS18BP1 PE=1 SV=1
2 : F6WE28_MACMU 0.84 0.91 3 72 605 674 70 0 0 865 F6WE28 Uncharacterized protein OS=Macaca mulatta GN=LOC702644 PE=4 SV=1
3 : H9FFU0_MACMU 0.84 0.91 3 72 494 563 70 0 0 695 H9FFU0 Mis18-binding protein 1 (Fragment) OS=Macaca mulatta GN=C14orf106 PE=2 SV=1
4 : F6WJY1_MACMU 0.83 0.90 3 73 870 940 71 0 0 1130 F6WJY1 Uncharacterized protein OS=Macaca mulatta GN=LOC702644 PE=4 SV=1
5 : G1QWZ8_NOMLE 0.83 0.92 3 73 872 942 71 0 0 1133 G1QWZ8 Uncharacterized protein OS=Nomascus leucogenys GN=MIS18BP1 PE=4 SV=1
6 : G3QUW9_GORGO 0.83 0.90 3 73 880 950 71 0 0 1140 G3QUW9 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101123952 PE=4 SV=1
7 : G7MXP2_MACMU 0.83 0.90 3 73 871 941 71 0 0 1131 G7MXP2 Putative uncharacterized protein OS=Macaca mulatta GN=EGK_18154 PE=4 SV=1
8 : G7PA64_MACFA 0.83 0.90 3 73 871 941 71 0 0 1131 G7PA64 Putative uncharacterized protein OS=Macaca fascicularis GN=EGM_16588 PE=4 SV=1
9 : H2Q885_PANTR 0.83 0.90 3 73 872 942 71 0 0 1132 H2Q885 MIS18 binding protein 1 OS=Pan troglodytes GN=MIS18BP1 PE=2 SV=1
10 : K7DLG2_PANTR 0.83 0.90 3 73 872 942 71 0 0 1132 K7DLG2 MIS18 binding protein 1 OS=Pan troglodytes GN=MIS18BP1 PE=2 SV=1
11 : H2NL56_PONAB 0.82 0.89 3 73 871 941 71 0 0 1132 H2NL56 Uncharacterized protein OS=Pongo abelii GN=MIS18BP1 PE=4 SV=1
12 : S9Y4R0_9CETA 0.82 0.89 3 68 219 284 66 0 0 351 S9Y4R0 Uncharacterized protein OS=Camelus ferus GN=CB1_000659009 PE=4 SV=1
13 : F7C0V1_HORSE 0.80 0.90 3 71 129 197 69 0 0 387 F7C0V1 Uncharacterized protein OS=Equus caballus GN=MIS18BP1 PE=4 SV=1
14 : F7IA26_CALJA 0.80 0.87 3 73 871 941 71 0 0 1132 F7IA26 Uncharacterized protein OS=Callithrix jacchus GN=MIS18BP1 PE=4 SV=1
15 : G9L0X2_MUSPF 0.77 0.86 3 71 216 284 69 0 0 474 G9L0X2 Kinetochore-associated protein KNL-2-like protein (Fragment) OS=Mustela putorius furo PE=2 SV=1
16 : D2I0W0_AILME 0.76 0.83 3 73 880 950 71 0 0 1139 D2I0W0 Putative uncharacterized protein (Fragment) OS=Ailuropoda melanoleuca GN=PANDA_018866 PE=4 SV=1
17 : G1L5Q2_AILME 0.76 0.83 3 73 893 963 71 0 0 1153 G1L5Q2 Uncharacterized protein OS=Ailuropoda melanoleuca GN=MIS18BP1 PE=4 SV=1
18 : U6CRZ8_NEOVI 0.76 0.85 3 73 846 916 71 0 0 1109 U6CRZ8 Mis18-binding protein 1 OS=Neovison vison GN=M18BP PE=2 SV=1
19 : L5JW52_PTEAL 0.75 0.87 3 73 761 831 71 0 0 958 L5JW52 Kinetochore-associated protein KNL-2 like protein OS=Pteropus alecto GN=PAL_GLEAN10013604 PE=4 SV=1
20 : M3WH88_FELCA 0.75 0.83 3 73 882 952 71 0 0 1142 M3WH88 Uncharacterized protein OS=Felis catus GN=MIS18BP1 PE=4 SV=1
21 : M3Y0H4_MUSPF 0.75 0.85 3 73 892 962 71 0 0 1153 M3Y0H4 Uncharacterized protein OS=Mustela putorius furo GN=MIS18BP1 PE=4 SV=1
22 : G1P2U0_MYOLU 0.74 0.87 3 71 746 814 69 0 0 1004 G1P2U0 Uncharacterized protein OS=Myotis lucifugus GN=MIS18BP1 PE=4 SV=1
23 : S7NAG4_MYOBR 0.74 0.87 3 71 867 935 69 0 0 1124 S7NAG4 Mis18-binding protein 1 OS=Myotis brandtii GN=D623_10025437 PE=4 SV=1
24 : E1BMQ3_BOVIN 0.73 0.84 10 73 763 826 64 0 0 1014 E1BMQ3 Uncharacterized protein OS=Bos taurus GN=MIS18BP1 PE=4 SV=2
25 : E2RMG8_CANFA 0.73 0.83 3 73 878 948 71 0 0 1138 E2RMG8 Uncharacterized protein OS=Canis familiaris GN=MIS18BP1 PE=4 SV=2
26 : E2RMJ1_CANFA 0.73 0.83 3 73 892 962 71 0 0 1152 E2RMJ1 Uncharacterized protein OS=Canis familiaris GN=MIS18BP1 PE=4 SV=2
27 : G1STB0_RABIT 0.73 0.87 3 73 860 930 71 0 0 1122 G1STB0 Uncharacterized protein OS=Oryctolagus cuniculus GN=MIS18BP1 PE=4 SV=2
28 : J9NTS4_CANFA 0.73 0.83 3 73 889 959 71 0 0 1149 J9NTS4 Uncharacterized protein OS=Canis familiaris GN=MIS18BP1 PE=4 SV=1
29 : F1SI12_PIG 0.71 0.89 3 72 343 412 70 0 0 603 F1SI12 Uncharacterized protein OS=Sus scrofa GN=LOC100620483 PE=4 SV=2
30 : G3SN14_LOXAF 0.71 0.86 3 72 757 826 70 0 0 1015 G3SN14 Uncharacterized protein OS=Loxodonta africana GN=MIS18BP1 PE=4 SV=1
31 : M3UZ41_PIG 0.71 0.89 3 72 799 868 70 0 0 1059 M3UZ41 MIS18 binding protein 1 OS=Sus scrofa GN=MIS18BP1 PE=2 SV=1
32 : H0WLY8_OTOGA 0.70 0.85 3 73 879 949 71 0 0 1137 H0WLY8 Uncharacterized protein OS=Otolemur garnettii GN=MIS18BP1 PE=4 SV=1
33 : Q8BS57_MOUSE 0.70 0.94 3 65 737 799 63 0 0 800 Q8BS57 Putative uncharacterized protein OS=Mus musculus GN=Mis18bp1 PE=2 SV=1
34 : H0UW87_CAVPO 0.69 0.84 1 70 757 826 70 0 0 1017 H0UW87 Uncharacterized protein OS=Cavia porcellus GN=MIS18BP1 PE=4 SV=1
35 : W5PIK5_SHEEP 0.69 0.80 3 73 794 864 71 0 0 1054 W5PIK5 Uncharacterized protein OS=Ovis aries GN=MIS18BP1 PE=4 SV=1
36 : Q8K3D2_MOUSE 0.67 0.91 3 68 126 191 66 0 0 386 Q8K3D2 C79407 protein OS=Mus musculus GN=Mis18bp1 PE=2 SV=1
37 : G5BUQ9_HETGA 0.66 0.80 3 73 819 889 71 0 0 1076 G5BUQ9 Kinetochore-associated protein KNL-2-like protein OS=Heterocephalus glaber GN=GW7_10416 PE=4 SV=1
38 : D3ZKM9_RAT 0.65 0.86 3 73 728 798 71 0 0 990 D3ZKM9 Protein Mis18bp1 OS=Rattus norvegicus GN=Mis18bp1 PE=4 SV=1
39 : I3M039_SPETR 0.65 0.80 3 73 876 946 71 0 0 1135 I3M039 Uncharacterized protein OS=Spermophilus tridecemlineatus GN=MIS18BP1 PE=4 SV=1
40 : M18BP_MOUSE 0.62 0.86 3 73 738 808 71 0 0 998 Q80WQ8 Mis18-binding protein 1 OS=Mus musculus GN=Mis18bp1 PE=1 SV=1
41 : U3IVV4_ANAPL 0.62 0.82 11 70 372 431 60 0 0 618 U3IVV4 Uncharacterized protein (Fragment) OS=Anas platyrhynchos GN=MIS18BP1 PE=4 SV=1
42 : U3IVV6_ANAPL 0.62 0.84 11 73 85 147 63 0 0 326 U3IVV6 Uncharacterized protein (Fragment) OS=Anas platyrhynchos GN=MIS18BP1 PE=4 SV=1
43 : K7GFW3_PELSI 0.60 0.79 9 70 935 996 62 0 0 1180 K7GFW3 Uncharacterized protein OS=Pelodiscus sinensis GN=MIS18BP1 PE=4 SV=1
44 : H0ZSM1_TAEGU 0.58 0.81 11 72 1 62 62 0 0 123 H0ZSM1 Uncharacterized protein (Fragment) OS=Taeniopygia guttata GN=C14orf106 PE=4 SV=1
45 : R0KVM8_ANAPL 0.57 0.78 11 73 656 723 68 1 5 826 R0KVM8 Putative DNA-binding protein C14orf106 (Fragment) OS=Anas platyrhynchos GN=Anapl_16724 PE=4 SV=1
46 : U3JG84_FICAL 0.55 0.84 11 72 148 209 62 0 0 384 U3JG84 Uncharacterized protein OS=Ficedula albicollis GN=MIS18BP1 PE=4 SV=1
47 : U3JG85_FICAL 0.55 0.84 11 72 148 209 62 0 0 270 U3JG85 Uncharacterized protein OS=Ficedula albicollis GN=MIS18BP1 PE=4 SV=1
48 : F6QN82_XENTR 0.54 0.79 10 72 37 99 63 0 0 288 F6QN82 Uncharacterized protein (Fragment) OS=Xenopus tropicalis GN=mis18bp1 PE=4 SV=1
49 : F1NE16_CHICK 0.52 0.72 9 72 856 919 64 0 0 1106 F1NE16 Uncharacterized protein (Fragment) OS=Gallus gallus GN=LOC100858033 PE=4 SV=2
50 : G1NM23_MELGA 0.52 0.70 9 72 391 454 64 0 0 574 G1NM23 Uncharacterized protein (Fragment) OS=Meleagris gallopavo GN=MIS18BP1 PE=4 SV=2
51 : H3AB46_LATCH 0.52 0.78 11 68 40 97 58 0 0 287 H3AB46 Uncharacterized protein OS=Latimeria chalumnae PE=4 SV=1
52 : F6T7U3_XENTR 0.49 0.76 3 72 531 600 70 0 0 704 F6T7U3 Uncharacterized protein (Fragment) OS=Xenopus tropicalis GN=mis18bp1 PE=4 SV=1
53 : H9GVZ9_ANOCA 0.49 0.71 3 71 469 537 69 0 0 689 H9GVZ9 Uncharacterized protein (Fragment) OS=Anolis carolinensis GN=MIS18BP1 PE=4 SV=1
54 : G3CQZ6_XENLA 0.47 0.72 5 72 1098 1165 68 0 0 1356 G3CQZ6 M18BP1 isoform 1 OS=Xenopus laevis PE=2 SV=1
55 : V8NCX0_OPHHA 0.47 0.65 10 71 748 806 62 1 3 1028 V8NCX0 Mis18-binding protein 1 (Fragment) OS=Ophiophagus hannah GN=MIS18BP1 PE=4 SV=1
56 : F6PIQ1_ORNAN 0.46 0.72 3 70 947 1014 68 0 0 1211 F6PIQ1 Uncharacterized protein OS=Ornithorhynchus anatinus GN=MIS18BP1 PE=4 SV=2
57 : G3CQZ7_XENLA 0.46 0.74 3 72 1126 1195 70 0 0 1387 G3CQZ7 M18BP1 isoform 2 OS=Xenopus laevis PE=2 SV=1
58 : H3AB45_LATCH 0.46 0.72 3 70 490 557 68 0 0 745 H3AB45 Uncharacterized protein (Fragment) OS=Latimeria chalumnae PE=4 SV=1
59 : V9KK85_CALMI 0.46 0.67 8 70 494 556 63 0 0 731 V9KK85 MIS18-binding protein 1 a (Fragment) OS=Callorhynchus milii PE=2 SV=1
60 : W5MZZ2_LEPOC 0.46 0.64 4 72 1074 1142 69 0 0 1323 W5MZZ2 Uncharacterized protein OS=Lepisosteus oculatus GN=MIS18BP1 PE=4 SV=1
61 : G3WFU3_SARHA 0.44 0.75 3 73 895 965 71 0 0 1156 G3WFU3 Uncharacterized protein OS=Sarcophilus harrisii GN=MIS18BP1 PE=4 SV=1
62 : W5KL44_ASTMX 0.44 0.64 8 73 857 922 66 0 0 1017 W5KL44 Uncharacterized protein OS=Astyanax mexicanus GN=MIS18BP1 PE=4 SV=1
63 : W5KL45_ASTMX 0.44 0.64 8 73 785 850 66 0 0 994 W5KL45 Uncharacterized protein OS=Astyanax mexicanus GN=MIS18BP1 PE=4 SV=1
64 : F1QFV9_DANRE 0.43 0.68 11 73 856 918 63 0 0 1011 F1QFV9 Uncharacterized protein OS=Danio rerio GN=si:dkey-11j8.1 PE=4 SV=1
65 : Q8JFV2_DANRE 0.43 0.67 11 73 856 918 63 0 0 1010 Q8JFV2 Putative uncharacterized protein OS=Danio rerio GN=si:dkey-11j8.1 PE=4 SV=2
66 : F2UNC2_SALR5 0.42 0.64 2 73 1215 1288 74 1 2 1534 F2UNC2 Putative uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_12939 PE=4 SV=1
67 : F7ADT3_MONDO 0.40 0.72 3 69 283 349 67 0 0 542 F7ADT3 Uncharacterized protein (Fragment) OS=Monodelphis domestica PE=4 SV=1
68 : D2UXL6_NAEGR 0.36 0.54 10 65 323 378 56 0 0 455 D2UXL6 Predicted protein (Fragment) OS=Naegleria gruberi GN=NAEGRDRAFT_1385 PE=4 SV=1
69 : G3P109_GASAC 0.35 0.57 11 70 156 215 60 0 0 400 G3P109 Uncharacterized protein OS=Gasterosteus aculeatus GN=MIS18BP1 PE=4 SV=1
## ALIGNMENTS 1 - 69
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A G 0 0 130 2 0 G
2 2 A S + 0 0 123 3 104 L G
3 3 A S + 0 0 132 48 20 PPPPPPPPPPPPPPPPPPPPPPP PPTPPPPPPPSPPPPP PP PPS P TP
4 4 A G - 0 0 77 49 39 GGGGGGGGGGGGGGGGGGGGGGG GGGGGGGDGGGGGGSG KP PKD GD SG
5 5 A S + 0 0 125 50 56 LLLLLLLLLLLLLLIIIISLILL LLSLLLLLLLFLISLL MSM LMN EL AL
6 6 A S + 0 0 126 50 76 IIIIIIIIIIIITIIVVIVIIII IIIITTTITNTTNTIT NEN NNI EI ST
7 7 A G S S+ 0 0 79 50 60 QQQQQQQQQQQQQQQQQQQQQQQ QQQQQQQQDLQDQDQD HTH VHK EK SR
8 8 A D S S- 0 0 121 53 29 DDDDDDDDDDDDDDDDDDDDDDD DDDDDDDDDNDDDDDD GNG EGEEGEDD SD
9 9 A K S S+ 0 0 165 56 51 KKKKKKKKKKKEGKEEEEEEEEE EEEEEEEEEKEEEGDE E DD EEE DEVDDDGG SD
10 10 A E - 0 0 121 60 36 EEEEEEEEEEEEEEEEEEEEEEEEEEEEDEDEEEEEEEEE D DDD DEENDEKIANKK ENE
11 11 A W - 0 0 22 70 3 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWSWWWWWWWW
12 12 A N >> - 0 0 70 70 61 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSGNSNNNSSSTSSSSTSSTTSTTTTTNSTTTTTTTNT
13 13 A E H 3>>S+ 0 0 153 70 24 EEEEEEEEEEEEEEEEEEEEEEEKKKEKEEEEEEKEEEEEEEEEEEEEQQVEEEEEEVDEEEEEEEEHE
14 14 A K H 3>5S+ 0 0 129 70 38 KKKKKKKKKKEKKEKKKKNKKKKKKKNKKKKTQKKQKQKQKKKRKKKKKKKKKKEEKKKTKKKEERKEA
15 15 A E H <>5S+ 0 0 17 70 0 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
16 16 A L H >X5S+ 0 0 30 70 11 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLSLLLLLLLLVLLLVVVLLVLLLLLLLLLLLL
17 17 A Q H 3X5S+ 0 0 102 70 35 QQQQQQQQQQQQQQQQQQQQQEEQQQHQQQQQQQQQQQEQQQQQQQQEQQDEREEQEDEGQQQQQRQSM
18 18 A K H 3XX S+ 0 0 70 70 91 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCRCCCCCCCRCCRRRRrRRKRRRKRKASKRRKGEEEEnGKE
22 22 A A H 3X S+ 0 0 4 69 14 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaAAAAAAAAA.AAAAAAAAGVhVAA
23 23 A F H 3< S+ 0 0 36 69 47 FFFFFFFFFFFFFFFFFFFFFFFVFFVFFFFLFFVFFFFFVVVIVIIIVVVIMI.FVVVVLVVVVFLIV
24 24 A A H << S+ 0 0 69 69 44 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATAVTAAATAAAAAAASAAASAS.VSAASSNNNNASSS
25 25 A S H < S+ 0 0 99 70 28 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTSAASSSSSSASSSSKSSAASSSTMC
26 26 A L S < S- 0 0 48 70 21 LLLLLLLLLLLLLLLLLLVFLLLLLLLLLLLFLLLLLLLLLLFFLLLLFFFLLLFLLFLLVLLLLKVFH
27 27 A P - 0 0 36 70 5 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPIPPP
28 28 A K S S+ 0 0 114 70 11 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRKKKKKKKKKKKKAKGK
29 29 A H S S+ 0 0 165 70 29 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHYHHHHHHHHHHHHHHHYHHHHHQNNHHGQGG
30 30 A K S S- 0 0 92 70 23 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRRKRRRRKKKKKKKKRKKKKKKKQQKKTG
31 31 A P S S+ 0 0 120 70 63 PPPPPPPPPPPPPPPPPPPPPPPSPPPPPPPPPPPPPPPPNNNSNSSNNNSNSNSNNSDRHSSSSPHPA
32 32 A G S > S+ 0 0 32 70 6 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGEGGGGGGGGGGGGGQG
33 33 A F H > S+ 0 0 20 70 8 FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFYYYYFYRY
34 34 A W H > S+ 0 0 69 70 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
35 35 A S H 4 S+ 0 0 73 70 80 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSVVVQVQQVVVLVLVMLVLVLLVVAAKLIA
36 36 A E H >X S+ 0 0 40 70 37 EDDDDDDDDDDNDNDDDDDDDDDDDDNDEDEDDDDDDDDDEEDEEDDEDEDEEEDDDDDDNNNHHYNKK
37 37 A V H 3X S+ 0 0 0 70 5 VVVVVVVVVVVVVVIVVVVVIVVVVVVVVIVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVGVV
38 38 A A H 3< S+ 0 0 8 70 1 AAAAAAAAAAAAAAAAAAAAAAAAAASAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
39 39 A A H X4 S+ 0 0 78 70 59 AAAAAAAAAAAMMAMMMMMMMMMMMMMMMMMMMLMMLMKMMMMMMMMMMMMMGMRMMMMMTLLYYKSER
40 40 A A H 3< S+ 0 0 55 70 60 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAMAAVAAAAAAAAAASAATSCSTYSTICAIIVVIFYM
41 41 A V T >< + 0 0 0 70 7 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVLVVVVVVVVVVVVVVVVVMVVVVVGVV
42 42 A G T < + 0 0 48 70 12 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGRGGGGGGGGGGGSKG
43 43 A S T 3 S+ 0 0 71 70 37 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTSSSSSSSTSSSTTSTTTPTTTTTSTT
44 44 A R S < S- 0 0 83 70 13 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRHHRRRHRRRRRRHRRRRRRRRQKR
45 45 A S > - 0 0 58 70 24 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTTSTTTTTSSSSSSSSSSSSTSSSSSSSTSSSSSTTS
46 46 A P H > S+ 0 0 45 70 9 PAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAPAPA
47 47 A E H > S+ 0 0 144 70 26 EEEEEEEEEEENEEEEEEEEEEEDEEEEDEDQDEDDKDKDEEEEEQQEEEMEEEEEEMEEEEEEEHEEE
48 48 A E H > S+ 0 0 48 70 2 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDDEEEEEEEEEEEEEEEEEEE
49 49 A C H X S+ 0 0 0 70 7 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCSCCCCCYVC
50 50 A Q H X S+ 0 0 72 70 20 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHHQQHQQQHHQQQQQQQQQQQQQHHCQQH
51 51 A R H X S+ 0 0 159 70 66 RRRRRRRRRRRRKRRRRRRRRRRRRRRRRKRKKKRKKKKKQQQGQEEEWQQEQEQKEQQQKEEEEHKQF
52 52 A K H >X S+ 0 0 31 70 24 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRRKKKKKRKKKKKKQKQQQQMKKQ
53 53 A Y H 3< S+ 0 0 49 70 24 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYHYYHYYYYHYYYYHHTH
54 54 A M H 3< S+ 0 0 107 70 65 MMMMMMMMMMMMMMMMMMLMMMMMLLMLLMLMTMMTMTLTTTMRTLLLTTILHLMMLISTMAANNMTGT
55 55 A E H << S+ 0 0 117 70 37 EEEEEEEEEEEEEEEEEEEEEEEEEEEEKEKEEEEEEEEEEEEEEQQEEEEEAEAEEESDEAAAATEEC
56 56 A N S < S+ 0 0 84 70 54 NNNNNNNNNNNDDNDDDDDDDDDDDDDDDDDNEDDEDEDEEEDEEEEKEEEKEKEDKEEDEHHHHLEIQ
57 57 A P S S- 0 0 34 70 73 PPPPPPPPPPPPPPPPPPPPPSSPPPPPSPSPPPPPPPPPQQHQQQQQQQHQQQQWQHHCQYYQQMHQE
58 58 A R S S- 0 0 243 70 58 RRRRRRRRRRRRRRRRRRRRRQQRRRQRQRQRQQRQRQRQQQQEQQQQAAQQEQEQQQQTQQQRRDQET
59 59 A G S S- 0 0 43 70 56 GGGGGGGGGGGRGGEEEERGEGGGGGGGGGGGGDGGRGGGSSTGSGGSKKASGTGGSASSGTTKKTGES
60 60 A K S S+ 0 0 204 70 34 KKKKKKKKKKKKKKKKKKKKKNNKKKKKKKKKQKKQKRKQKKKKKKKKATKKKRKNKKKHRKKSSKKKH
61 61 A G - 0 0 46 70 62 GGGGGGGGGGGGGRGGGGGEGGGRGGGGGRGRGVRGQGGGAAGGAGGGSSGGVAKGAGLRGRRKKAGKT
62 62 A S - 0 0 101 70 63 SSSSSSSSSSSSSSSSSSSSSFFFTTSTSFSSSSFSSSPSSSSSSSSSKKSSQSRLTSIKSKKIIKSMP
63 63 A Q - 0 0 142 70 58 QQQQQQQQQQQQQQQQQQQQQQQQQQQQRRRQRQQRQRQRKKKKKKKKRRKKAKSRKKKSQQQPPQRDP
64 64 A K + 0 0 121 70 54 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKRRKQRQQAPPEAPAPPAETRKRRRRKKEK
65 65 A H S S+ 0 0 176 70 74 HHHHHHHHHHHHHHPPPPHPPHHHPPHPHHHHHNRHNYHHRRAQRQQQAAIQKQKQQITARGGQQGRQR
66 66 A V S S+ 0 0 104 68 80 VVVVVVVVVVVVVVVVVVVVVVVDVVVVVVVV TDGTVVGAAAPAPPSVTSSSSKASSKTAKKRRSA A
67 67 A T S S- 0 0 127 68 66 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTSASISTTTRTRRKKKSKTKTTKSAKVQQPPSI K
68 68 A S - 0 0 87 68 24 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK KKKKKKKKKKRKKKKKKKKAKSKKKKQKKKSSSK K
69 69 A G S S- 0 0 49 65 48 KKKKKKKKKKK EKKKKKKKKKKKKKKKKKKK RK RKKKEATTATTKTT KRKKKKKQKKEEKKGK P
70 70 A P - 0 0 139 64 63 KKKKKKKKKKK KKKKKKKKKKKKQQKQNKNK KK KKKKKTATTTTSTT SPPPAPKKPIKKKKG N
71 71 A S + 0 0 113 57 66 PPPPPPPPPPP PPTTTTPPTPPPTTPTPPPP P QQQQ T STSSGSS GGEG G SPKKPPG
72 72 A S 0 0 129 51 53 AAAAAAAAAAA A VVVVVV AVVAVAAAA A AAAA S GSGGSGG S S A SSAAAAG
73 73 A G 0 0 127 36 62 N NSNNNNNN N NNSSNS NNNNN N N NNNN G G PPPAAG
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0.000 0 1.00
2 2 A 0 33 0 0 0 0 0 33 0 0 33 0 0 0 0 0 0 0 0 0 3 0 0 1.099 36 -0.05
3 3 A 0 0 0 0 0 0 0 0 0 90 6 4 0 0 0 0 0 0 0 0 48 0 0 0.404 13 0.79
4 4 A 0 0 0 0 0 0 0 82 0 4 4 0 0 0 0 4 0 0 0 6 49 0 0 0.728 24 0.61
5 5 A 0 64 12 6 2 0 0 0 2 0 10 0 0 0 0 0 0 2 2 0 50 0 0 1.252 41 0.44
6 6 A 6 0 54 0 0 0 0 0 0 0 4 20 0 0 0 0 0 4 12 0 50 0 0 1.335 44 0.24
7 7 A 2 2 0 0 0 0 0 2 0 0 2 2 0 6 2 4 68 2 0 8 50 0 0 1.310 43 0.39
8 8 A 0 0 0 0 0 0 0 8 0 0 2 0 0 0 0 0 0 8 4 79 53 0 0 0.773 25 0.71
9 9 A 2 0 0 0 0 0 0 7 0 0 2 0 0 0 0 25 0 50 0 14 56 0 0 1.303 43 0.48
10 10 A 0 0 2 0 0 0 0 0 2 0 0 0 0 0 0 5 0 73 5 13 60 0 0 0.932 31 0.64
11 11 A 0 0 0 0 0 99 0 0 0 0 1 0 0 0 0 0 0 0 0 0 70 0 0 0.075 2 0.97
12 12 A 0 0 0 0 0 0 0 1 0 0 19 24 0 0 0 0 0 0 56 0 70 0 0 1.043 34 0.39
13 13 A 3 0 0 0 0 0 0 0 0 0 0 0 0 1 0 7 3 84 0 1 70 0 0 0.657 21 0.76
14 14 A 0 0 0 0 0 0 0 0 1 0 0 3 0 0 3 74 6 10 3 0 70 0 0 0.980 32 0.62
15 15 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 70 0 0 0.000 0 1.00
16 16 A 7 91 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 70 0 0 0.331 11 0.89
17 17 A 0 0 0 1 0 0 0 1 0 0 1 0 0 1 3 0 76 13 0 3 70 0 0 0.920 30 0.65
18 18 A 0 1 0 0 0 0 0 0 0 0 3 0 0 0 10 84 0 0 1 0 70 0 0 0.597 19 0.75
19 19 A 0 96 0 0 3 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 70 0 0 0.204 6 0.94
20 20 A 1 1 0 0 0 0 6 0 1 0 0 0 0 79 0 0 9 0 3 0 70 0 0 0.847 28 0.60
21 21 A 0 0 0 0 0 0 0 3 1 0 1 0 56 0 21 9 0 7 1 0 70 1 2 1.339 44 0.08
22 22 A 3 0 0 0 0 0 0 1 94 0 0 0 0 1 0 0 0 0 0 0 69 0 0 0.282 9 0.85
23 23 A 28 4 10 1 57 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 69 0 0 1.107 36 0.53
24 24 A 3 0 0 0 0 0 0 0 74 0 13 4 0 0 0 0 0 0 6 0 69 0 0 0.893 29 0.56
25 25 A 0 0 0 1 0 0 0 0 7 0 86 3 1 0 0 1 0 0 0 0 70 0 0 0.604 20 0.71
26 26 A 4 79 0 0 14 0 0 0 0 0 0 0 0 1 0 1 0 0 0 0 70 0 0 0.724 24 0.78
27 27 A 0 0 1 0 0 0 0 0 0 99 0 0 0 0 0 0 0 0 0 0 70 0 0 0.075 2 0.94
28 28 A 0 0 0 0 0 0 0 1 1 0 0 0 0 0 1 96 0 0 0 0 70 0 0 0.224 7 0.88
29 29 A 0 0 0 0 0 0 3 4 0 0 0 0 0 87 0 0 3 0 3 0 70 0 0 0.560 18 0.71
30 30 A 0 0 0 0 0 0 0 1 0 0 0 1 0 0 10 84 3 0 0 0 70 0 0 0.597 19 0.76
31 31 A 0 0 0 0 0 0 0 0 1 60 17 0 0 3 1 0 0 0 16 1 70 0 0 1.183 39 0.37
32 32 A 0 0 0 0 0 0 0 97 0 0 0 0 0 0 0 0 1 1 0 0 70 0 0 0.150 4 0.93
33 33 A 0 0 0 0 90 0 9 0 0 0 0 0 0 0 1 0 0 0 0 0 70 0 0 0.366 12 0.92
34 34 A 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 70 0 0 0.000 0 1.00
35 35 A 19 10 1 1 0 0 0 0 4 0 59 0 0 0 0 1 4 0 0 0 70 0 0 1.308 43 0.19
36 36 A 0 0 0 0 0 0 1 0 0 0 0 0 0 3 0 3 0 19 10 64 70 0 0 1.091 36 0.62
37 37 A 94 0 4 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 70 0 0 0.251 8 0.95
38 38 A 0 0 0 0 0 0 0 0 99 0 1 0 0 0 0 0 0 0 0 0 70 0 0 0.075 2 0.98
39 39 A 0 6 0 61 0 0 3 1 19 0 1 1 0 0 3 3 0 1 0 0 70 0 0 1.323 44 0.41
40 40 A 4 0 6 3 1 0 3 0 70 0 6 4 3 0 0 0 0 0 0 0 70 0 0 1.212 40 0.39
41 41 A 96 1 0 1 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 70 0 0 0.224 7 0.93
42 42 A 0 0 0 0 0 0 0 96 0 0 1 0 0 0 1 1 0 0 0 0 70 0 0 0.224 7 0.88
43 43 A 0 0 0 0 0 0 0 0 0 1 79 20 0 0 0 0 0 0 0 0 70 0 0 0.572 19 0.62
44 44 A 0 0 0 0 0 0 0 0 0 0 0 0 0 6 91 1 1 0 0 0 70 0 0 0.367 12 0.86
45 45 A 0 0 0 0 0 0 0 0 0 0 84 16 0 0 0 0 0 0 0 0 70 0 0 0.435 14 0.75
46 46 A 0 0 0 0 0 0 0 0 94 6 0 0 0 0 0 0 0 0 0 0 70 0 0 0.219 7 0.91
47 47 A 0 0 0 3 0 0 0 0 0 0 0 0 0 1 0 3 4 76 1 11 70 0 0 0.918 30 0.74
48 48 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 97 0 3 70 0 0 0.130 4 0.98
49 49 A 1 0 0 0 0 0 1 0 0 0 1 0 96 0 0 0 0 0 0 0 70 0 0 0.224 7 0.92
50 50 A 0 0 0 0 0 0 0 0 0 0 0 0 1 11 0 0 87 0 0 0 70 0 0 0.429 14 0.80
51 51 A 0 0 0 0 1 1 0 1 0 0 0 0 0 1 44 19 17 14 0 0 70 0 0 1.496 49 0.33
52 52 A 0 0 0 1 0 0 0 0 0 0 0 0 0 0 4 86 9 0 0 0 70 0 0 0.538 17 0.75
53 53 A 0 0 0 0 0 0 90 0 0 0 0 1 0 9 0 0 0 0 0 0 70 0 0 0.366 12 0.76
54 54 A 0 19 3 50 0 0 0 1 3 0 1 17 0 1 1 0 0 0 3 0 70 0 0 1.509 50 0.35
55 55 A 0 0 0 0 0 0 0 0 9 0 1 1 1 0 0 3 3 80 0 1 70 0 0 0.835 27 0.63
56 56 A 0 1 1 0 0 0 0 0 0 0 0 0 0 6 0 6 1 27 20 37 70 0 0 1.553 51 0.46
57 57 A 0 0 0 1 0 1 3 0 0 53 6 0 1 7 0 0 26 1 0 0 70 0 0 1.383 46 0.27
58 58 A 0 0 0 0 0 0 0 0 3 0 0 3 0 0 47 0 40 6 0 1 70 0 0 1.148 38 0.42
59 59 A 0 0 0 0 0 0 0 57 3 0 13 7 0 0 4 6 0 9 0 1 70 0 0 1.443 48 0.43
60 60 A 0 0 0 0 0 0 0 0 1 0 3 1 0 3 4 79 4 0 4 0 70 0 0 0.919 30 0.66
61 61 A 3 1 0 0 0 0 0 63 9 0 3 1 0 0 11 6 1 1 0 0 70 0 0 1.360 45 0.38
62 62 A 0 1 4 1 7 0 0 0 0 3 66 6 0 0 1 9 1 0 0 0 70 0 0 1.318 43 0.37
63 63 A 0 0 0 0 0 0 0 0 1 4 3 0 0 0 16 20 54 0 0 1 70 0 0 1.302 43 0.42
64 64 A 0 0 0 0 0 0 0 0 6 7 0 1 0 0 11 66 4 4 0 0 70 0 0 1.207 40 0.46
65 65 A 0 0 3 0 0 0 1 4 6 13 0 1 0 40 10 3 16 0 3 0 70 0 0 1.876 62 0.26
66 66 A 51 0 0 0 0 0 0 3 12 4 12 6 0 0 3 6 0 0 0 3 68 0 0 1.628 54 0.19
67 67 A 1 0 3 0 0 0 0 0 3 3 9 62 0 0 4 12 3 0 0 0 68 0 0 1.378 46 0.34
68 68 A 0 0 0 0 0 0 0 0 1 0 7 0 0 0 1 88 1 0 0 0 68 0 0 0.489 16 0.75
69 69 A 0 0 0 0 0 0 0 3 3 2 0 9 0 0 5 71 2 6 0 0 65 0 0 1.121 37 0.51
70 70 A 0 0 2 0 0 0 0 2 3 9 3 11 0 0 0 61 5 0 5 0 64 0 0 1.399 46 0.36
71 71 A 0 0 0 0 0 0 0 11 0 47 12 18 0 0 0 4 7 2 0 0 57 0 0 1.529 51 0.33
72 72 A 18 0 0 0 0 0 0 12 55 0 16 0 0 0 0 0 0 0 0 0 51 0 0 1.178 39 0.46
73 73 A 0 0 0 0 0 0 0 11 6 8 11 0 0 0 0 0 0 0 64 0 36 0 0 1.142 38 0.38
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
45 12 667 5 rQVVPLa
66 21 1235 2 nAVh
//