Complet list of 1cto hssp file
Complete list of 1cto.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1CTO
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-04-09
HEADER BINDING PROTEIN 25-SEP-96 1CTO
COMPND MOL_ID: 1; MOLECULE: GRANULOCYTE COLONY-STIMULATING FACTOR RECEPTOR; C
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: MUS MUSCULUS; ORGANISM_COMMON: HOUSE M
AUTHOR K.YAMASAKI,S.NAITO,H.ANAGUCHI,T.OHKUBO,Y.OTA
DBREF 1CTO A 1 109 UNP P40223 CSF3R_MOUSE 211 333
SEQLENGTH 109
NCHAIN 1 chain(s) in 1CTO data set
NALIGN 49
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : CSF3R_MOUSE 1PGR 1.00 1.00 4 109 228 333 106 0 0 837 P40223 Granulocyte colony-stimulating factor receptor OS=Mus musculus GN=Csf3r PE=1 SV=2
2 : Q14AY3_MOUSE 1.00 1.00 4 109 228 333 106 0 0 837 Q14AY3 Colony stimulating factor 3 receptor (Granulocyte) OS=Mus musculus GN=Csf3r PE=2 SV=1
3 : D4A3R0_RAT 0.88 0.96 4 109 228 333 106 0 0 838 D4A3R0 Colony stimulating factor 3 receptor (Granulocyte) (Predicted), isoform CRA_b OS=Rattus norvegicus GN=Csf3r PE=4 SV=1
4 : G3H9X4_CRIGR 0.81 0.95 4 109 228 333 106 0 0 841 G3H9X4 Granulocyte colony-stimulating factor receptor OS=Cricetulus griseus GN=I79_007205 PE=4 SV=1
5 : I3LX16_SPETR 0.65 0.73 25 106 242 323 82 0 0 817 I3LX16 Uncharacterized protein OS=Spermophilus tridecemlineatus GN=CSF3R PE=4 SV=1
6 : G5C6G6_HETGA 0.62 0.77 4 107 228 331 104 0 0 887 G5C6G6 Granulocyte colony-stimulating factor receptor OS=Heterocephalus glaber GN=GW7_05417 PE=4 SV=1
7 : H0WL79_OTOGA 0.61 0.73 4 107 227 330 104 0 0 838 H0WL79 Uncharacterized protein OS=Otolemur garnettii GN=CSF3R PE=4 SV=1
8 : G1T0W6_RABIT 0.60 0.72 4 109 250 355 106 0 0 946 G1T0W6 Uncharacterized protein OS=Oryctolagus cuniculus GN=CSF3R PE=4 SV=2
9 : L9JC93_TUPCH 0.60 0.73 4 109 227 332 106 0 0 980 L9JC93 Granulocyte colony-stimulating factor receptor OS=Tupaia chinensis GN=TREES_T100000931 PE=3 SV=1
10 : G3TD98_LOXAF 0.59 0.75 4 109 227 332 106 0 0 840 G3TD98 Uncharacterized protein OS=Loxodonta africana PE=4 SV=1
11 : H0VHX4_CAVPO 0.59 0.75 4 109 228 333 106 0 0 842 H0VHX4 Uncharacterized protein OS=Cavia porcellus GN=LOC100717194 PE=4 SV=1
12 : S7PFH5_MYOBR 0.59 0.74 4 109 228 333 106 0 0 842 S7PFH5 Granulocyte colony-stimulating factor receptor OS=Myotis brandtii GN=D623_10028902 PE=4 SV=1
13 : F6ZM36_HORSE 0.58 0.73 4 109 227 332 106 0 0 841 F6ZM36 Uncharacterized protein OS=Equus caballus GN=CSF3R PE=4 SV=1
14 : G1P748_MYOLU 0.58 0.74 4 109 228 333 106 0 0 842 G1P748 Uncharacterized protein OS=Myotis lucifugus GN=CSF3R PE=4 SV=1
15 : S9WWB8_9CETA 0.58 0.70 4 109 227 332 106 0 0 986 S9WWB8 Granulocyte colony-stimulating factor receptor isoform 1 OS=Camelus ferus GN=CB1_000884035 PE=4 SV=1
16 : M3WIJ4_FELCA 0.57 0.69 4 109 227 332 106 0 0 839 M3WIJ4 Uncharacterized protein OS=Felis catus GN=CSF3R PE=4 SV=1
17 : M3YTJ9_MUSPF 0.57 0.67 4 109 227 332 106 0 0 839 M3YTJ9 Uncharacterized protein OS=Mustela putorius furo GN=CSF3R PE=4 SV=1
18 : E1B9W2_BOVIN 0.56 0.69 4 109 227 332 106 0 0 841 E1B9W2 Uncharacterized protein OS=Bos taurus GN=CSF3R PE=4 SV=2
19 : E2R8G9_CANFA 0.56 0.68 4 109 227 332 106 0 0 840 E2R8G9 Uncharacterized protein OS=Canis familiaris GN=CSF3R PE=4 SV=1
20 : F7EA72_CALJA 0.56 0.73 4 109 214 319 106 0 0 822 F7EA72 Uncharacterized protein OS=Callithrix jacchus GN=CSF3R PE=4 SV=1
21 : G1L3X5_AILME 0.56 0.69 4 109 226 331 106 0 0 838 G1L3X5 Uncharacterized protein OS=Ailuropoda melanoleuca GN=CSF3R PE=4 SV=1
22 : H2PYN7_PANTR 0.56 0.73 4 107 227 330 104 0 0 863 H2PYN7 Uncharacterized protein OS=Pan troglodytes GN=CSF3R PE=4 SV=1
23 : I3LQQ8_PIG 0.56 0.69 4 109 229 334 106 0 0 846 I3LQQ8 Uncharacterized protein OS=Sus scrofa GN=CSF3R PE=4 SV=1
24 : K9IWH4_PIG 0.56 0.69 4 109 228 333 106 0 0 848 K9IWH4 Granulocyte colony-stimulating factor receptor OS=Sus scrofa GN=CSF3R PE=2 SV=1
25 : M1ELE4_MUSPF 0.55 0.65 4 109 227 334 108 1 2 840 M1ELE4 Colony stimulating factor 3 receptor (Fragment) OS=Mustela putorius furo PE=2 SV=1
26 : CSF3R_HUMAN 2D9Q 0.54 0.73 4 109 227 332 106 0 0 836 Q99062 Granulocyte colony-stimulating factor receptor OS=Homo sapiens GN=CSF3R PE=1 SV=1
27 : E1B6W6_HUMAN 0.54 0.73 4 109 227 332 106 0 0 769 E1B6W6 Colony stimulating factor 3 receptor (Granulocyte), isoform CRA_a OS=Homo sapiens GN=CSF3R PE=2 SV=1
28 : G3RCL2_GORGO 0.54 0.73 4 109 227 332 106 0 0 862 G3RCL2 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101128750 PE=4 SV=1
29 : G3S1M9_GORGO 0.54 0.73 4 109 227 332 106 0 0 836 G3S1M9 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101128750 PE=4 SV=1
30 : G7NTI8_MACFA 0.54 0.73 4 109 227 332 106 0 0 863 G7NTI8 Putative uncharacterized protein OS=Macaca fascicularis GN=EGM_00486 PE=4 SV=1
31 : Q1ZYL6_HUMAN 0.54 0.73 4 109 227 332 106 0 0 836 Q1ZYL6 Granulocyte colony-stimulating factor receptor OS=Homo sapiens PE=2 SV=1
32 : G7MFW6_MACMU 0.53 0.73 4 109 227 332 106 0 0 863 G7MFW6 Putative uncharacterized protein OS=Macaca mulatta GN=EGK_00567 PE=4 SV=1
33 : H2N7Y9_PONAB 0.53 0.73 4 109 227 332 106 0 0 834 H2N7Y9 Uncharacterized protein OS=Pongo abelii GN=CSF3R PE=4 SV=1
34 : F6TTY1_MACMU 0.52 0.70 16 109 240 333 94 0 0 865 F6TTY1 Uncharacterized protein OS=Macaca mulatta GN=CSF3R PE=2 SV=1
35 : F6TTZ1_MACMU 0.52 0.70 16 109 240 333 94 0 0 837 F6TTZ1 Uncharacterized protein OS=Macaca mulatta GN=CSF3R PE=2 SV=1
36 : F6TU14_MACMU 0.52 0.70 16 109 240 333 94 0 0 784 F6TU14 Uncharacterized protein OS=Macaca mulatta GN=CSF3R PE=2 SV=1
37 : G1RLA1_NOMLE 0.51 0.73 4 109 227 332 106 0 0 863 G1RLA1 Uncharacterized protein OS=Nomascus leucogenys PE=4 SV=1
38 : F6ZY07_MONDO 0.42 0.58 21 106 245 333 89 1 3 866 F6ZY07 Uncharacterized protein OS=Monodelphis domestica GN=CSF3R PE=4 SV=2
39 : G3X148_SARHA 0.38 0.58 4 109 226 336 111 2 5 863 G3X148 Uncharacterized protein OS=Sarcophilus harrisii GN=CSF3R PE=4 SV=1
40 : G3X149_SARHA 0.38 0.58 4 109 234 344 111 2 5 871 G3X149 Uncharacterized protein (Fragment) OS=Sarcophilus harrisii GN=CSF3R PE=4 SV=1
41 : U3IK55_ANAPL 0.37 0.53 4 109 129 229 106 2 5 735 U3IK55 Uncharacterized protein (Fragment) OS=Anas platyrhynchos GN=CSF3R PE=4 SV=1
42 : U3JDQ5_FICAL 0.36 0.57 22 109 233 321 90 2 3 702 U3JDQ5 Uncharacterized protein OS=Ficedula albicollis GN=CSF3R PE=4 SV=1
43 : G1MWV1_MELGA 0.33 0.53 6 109 243 341 104 2 5 785 G1MWV1 Uncharacterized protein OS=Meleagris gallopavo GN=CSF3R PE=4 SV=1
44 : V8NJY1_OPHHA 0.33 0.61 5 109 346 444 105 3 6 1011 V8NJY1 Granulocyte colony-stimulating factor receptor (Fragment) OS=Ophiophagus hannah GN=CSF3R PE=4 SV=1
45 : K7FS22_PELSI 0.32 0.57 2 109 227 329 108 2 5 850 K7FS22 Uncharacterized protein OS=Pelodiscus sinensis GN=CSF3R PE=4 SV=1
46 : M7BAB9_CHEMY 0.32 0.60 2 109 227 329 108 2 5 855 M7BAB9 Granulocyte colony-stimulating factor receptor OS=Chelonia mydas GN=UY3_07910 PE=4 SV=1
47 : O57519_XENLA 0.30 0.47 6 99 218 307 96 4 8 881 O57519 Gp130p1 OS=Xenopus laevis GN=il6st PE=2 SV=1
48 : Q2TAD1_XENLA 0.30 0.47 6 99 197 286 96 4 8 861 Q2TAD1 Uncharacterized protein (Fragment) OS=Xenopus laevis PE=2 SV=1
49 : Q6DD37_XENLA 0.30 0.52 4 108 219 318 105 2 5 813 Q6DD37 Csf3r-prov protein OS=Xenopus laevis GN=csf3r PE=2 SV=1
## ALIGNMENTS 1 - 49
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A G 0 0 97 1 0
2 2 A S + 0 0 108 3 64 AA
3 3 A S S S- 0 0 93 3 76 KK
4 4 A L - 0 0 98 40 0 LLLL LLLLLLLLLLLLLLLLLLLLLLLLLLLL L LLL LL L
5 5 A E - 0 0 76 41 11 EEEE EEEEEEEEEEEEEEEEEEEEEEEEEEEE E DDE EDD D
6 6 A P - 0 0 56 44 0 PPPP PPPPPPPPPPPPPPPPPPPPPPPPPPPP P PPP PPPPPPP
7 7 A P S S+ 0 0 68 44 0 PPPP PPPPPPPPPPPPPPPPPPPPPPPPPPPP P PPP PPPPPPP
8 8 A M E -A 31 0A 105 44 72 MMMV TITRTTTTTTTTTTMTMTTTMMMMMMMM V IIA TIVVQQV
9 9 A L E +A 30 0A 7 44 6 LLLL LLLLLLLLLLLLLLLLLLLLLLLLLLLL L LLL LILLLLI
10 10 A Q E -A 29 0A 111 44 73 QQQQ MWWWESWWWWWWWWRWRWWWRRRRRRRR R QQQ QQKKSSK
11 11 A A E -A 28 0A 41 44 58 AAAA AVAAAAAAAAAAAATATAAATTTTTTTT T AAS SDSSEEE
12 12 A L - 0 0 42 44 41 LLLV LELLLLLLLLLLLLLLMLLLMMMMLMMM M AAI IIVILLA
13 13 A D + 0 0 159 44 61 DDDD DEDDDDDDDEAAEPDADEEADDDDDDDD D KKQ HRQQIIA
14 14 A I + 0 0 58 44 56 IIII PPSPPPPPPPPPPPPPPPPPPPPPPPPP P AAS SPSSSSP
15 15 A G S > S- 0 0 44 44 60 GGGS NSSRSNSKSSSSSSNSSSSSSSSSSSSS S SSV VHVVSSV
16 16 A P T 3 S+ 0 0 129 47 43 PPHP PPPQPSPTPPPPPPPPPPPPPPPPPPPPPPPP PPP PFPPLLI
17 17 A D T 3 + 0 0 130 47 52 DDDD EEKEEEEEEEEEAEEEEEEEEEEEEEEEEEEE ggP PEFFEES
18 18 A V S < S- 0 0 17 39 67 VVIL MAGPVVVVVVVVVAAVAVVVAAAAAAAAAAAA ss. .......
19 19 A V S S+ 0 0 149 39 51 VVVI AAASAAAVAPAAAAPAAAAAAAAAAAAAAAAA SS. .......
20 20 A S S S- 0 0 75 39 52 SSSP LSPPPRPPPPPPPPPPPPPPPPPPPPPPPPPP LL. .......
21 21 A H - 0 0 147 40 53 HHRY PPPPAPPLPLPPPPPPPPPPPPPPPPPPPPPPPRR. .......
22 22 A Q - 0 0 100 49 26 QQQQ QQQQQRQQQQQQQQEQQQQQQQQQQQQQQQQEQQQQQQEQQLLR
23 23 A P S S+ 0 0 99 49 45 PPPP PPPPPPPPPPPPPPPPAPPPAAAAAAAAAAAAPPPTTTSTTPPP
24 24 A G S S+ 0 0 22 49 31 GGGG DGGGDDGGGGGGGGGGGGGGGGGGGGGGGGGGGGGDDDNNDNNG
25 25 A a E - B 0 72A 6 50 10 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCAAC
26 26 A L E - B 0 71A 21 50 16 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLVIVMVILLV
27 27 A W E + B 0 70A 24 50 83 WWWWWWWQWWWQRQRLLRLHLQRRLQQQQQQQQQQQQQRRTATESSKKR
28 28 A L E +AB 11 69A 4 50 17 LLLLLLVLLFLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLVVVIII
29 29 A S E +A 10 0A 25 50 90 SSSSHRHHGRRHRHRRRRRCRCRRRCCCCCCCCCCCCQKKAATKAAEEQ
30 30 A W E -A 9 0A 10 50 8 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWT
31 31 A K E -A 8 0A 130 50 45 KKKKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEKKKADAEEEKKK
32 32 A P - 0 0 36 50 68 PPPPPPPPPSPSPSTLPTPPPPTTPPPPPPPPPPPPPKKKVVMAVGNNK
33 33 A W S > S+ 0 0 117 50 83 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWAAAAAAAAAPPS
34 34 A K T 3 S- 0 0 101 50 65 KKKKKKKKKKKKKKKKKRKKKQKKKQQQQQQQQQQQQRRRPWPKHPIIE
35 35 A P T 3 - 0 0 127 50 52 PPPPPPPPQPPPPSPPPPPPPPPPPPPPPPPPPPPPPQMMSGRDTTTTP
36 36 A S < - 0 0 41 50 49 SSSGSSSSSSSTGTSSSSSGSGSSsGGGGAGAGAAAGNSSSTNSSSNNG
37 37 A E S S- 0 0 147 48 86 EEEELILHLLFQLQLLLLLLLLLLlLLLLLLLLLLLLKNNAEAESS..S
38 38 A Y - 0 0 161 50 58 YYFYFYHFYFYYYYHYYYYFYHHHYHHHHHHHHHHHHHHHHHHLHHAAY
39 39 A M + 0 0 79 50 35 MMMIMMIIVMMIIIIIIIIIIIIIIIIIIIIIIIIIILMMMMMMIIFFI
40 40 A E + 0 0 139 50 37 EEEEDDSDEENDEDEEEEENENEEENNNNNNNNNNNNEEEEEEEEENNE
41 41 A Q - 0 0 59 50 29 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLQQQLLQ
42 42 A E - 0 0 97 50 61 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKTMMWQYAQQKKW
43 43 A b E -CD 62 86B 9 50 6 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCYYY
44 44 A E E - D 0 85B 17 50 14 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEEEEDEENNQ
45 45 A L E -CD 59 84B 0 50 5 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLIIL
46 46 A R E +CD 58 83B 64 50 11 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRQQRRRRRHRRR
47 47 A Y E -CD 57 82B 39 50 50 YYYYLYYHHYYHHHHHHHHHHHHHHHHHHHHHHHHHHYYYYYYYYYYYY
48 48 A Q E - D 0 81B 19 50 53 QQQRRRQQQQRQQQQQQQQKQKQQQKKKKKKKKKKKKQHHRRRQWRRRR
49 49 A P E - D 0 80B 9 50 39 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPHRRPAAVTTPPK
50 50 A Q E + D 0 79B 65 50 55 QQQQQQQQRQQQQQQQQRRQQQQQQQQQQQQQQQQQQPPPPPAEPHVVA
51 51 A L S S- 0 0 73 50 95 LLLLRLPLLLLRLPLLLLLRPRRRLRRRRSRSHSSSRGGGEEEGGEKKD
52 52 A K S S- 0 0 216 50 60 KKERGRGEGRGGGGGGGGGGGAGGGGGGGGGGGGGGGEEEDDDSDDTTQ
53 53 A G - 0 0 40 50 70 GGGREEDEEEEGEGEEEEEEEEEEEEEEEEEEEEEEELLLPPPLPPQQN
54 54 A A - 0 0 49 50 50 AAAAAAADAAAATAAADAAAAASSDAAAAAAAAAAAAnttNASEGGDDQ
55 55 A N - 0 0 156 41 58 NNNNNSSNSSSSSNSSSGSSSNSSSSSSSSSSSSSSSmvv.........
56 56 A W - 0 0 88 50 12 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWSSSWWWWWWWWW
57 57 A T E -C 47 0B 56 50 68 TTTTTATTVAATATDAVDDAAADDVAAAAAAAAAAAAPPPVADSTTEEI
58 58 A L E -C 46 0B 67 50 26 LLLLQQLLLLQLLLLLLLLLLLLLLLLLLLLLLLLLLFFFLLVLLLMME
59 59 A V E -C 45 0B 37 50 14 VVVVVVEVVVVVVVVVVVVEVVVVVVVVVVVVVVVVVLLLVVVVVVVVV
60 60 A F E + 0 0B 105 50 72 FFSSAAGAEAGGGGGGGSGSGGGGGGGGGGGGGGGGGSNNTTTHPPppQ
61 61 A H E S+ 0 0B 175 50 70 HHHHTPAPPPPPPPAPPAPPPPAAPPPPPPPPPPPPPFSSGGGNDDddP
62 62 A L E -C 43 0B 19 50 49 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLSSSIIIIIITTT
63 63 A P - 0 0 55 50 55 PPPPPPPPPPPPPPPLLPLPLPPPLPPPPPPPPPPPPPTTVVPTVVAAT
64 64 A S S S+ 0 0 27 50 76 SSSSSSTSSSSSSSSPSASSSLSSSLLLLLLLLLLLLQQQGGGTSSSSK
65 65 A S S S- 0 0 62 50 77 SSSSSSEMKCRKRKREQRQEQERRQEEEEEEEEEEEENHHQQHSPPHHD
66 66 A K - 0 0 75 50 73 KKTEITTTTTTTTTTTTTTATASSTAAAAAAAAAAAAEEEAAASNNRRS
67 67 A D S S- 0 0 145 50 84 DDDDLTLVLLTLLLLLLSLFLLLLLLLLLLLLLLLLLGGGGGGQLLDDD
68 68 A Q S S- 0 0 115 50 63 QQRQQRQQEHRQQQRQQQQQQQQQQQQQQRQRRRRRREDDTTSRKKSSE
69 69 A F E -B 28 0A 12 50 64 FFFFHYYYYHYHHHYYYYYYYYYYYYYYYYYYYYYYYHNNATVITTFFL
70 70 A E E +B 27 0A 122 50 40 EEEKEEEEEKEEEEEEEEEEEEEEEEEEEEEEEEEEESDDQQQQQHTTE
71 71 A L E -B 26 0A 27 50 44 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLCRHQHHLLY
72 72 A a E +B 25 0A 68 50 17 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCQQC
73 73 A G + 0 0 48 50 19 GGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGKGGGDDN
74 74 A L > + 0 0 11 50 5 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLFFFFFLLL
75 75 A H T 3 + 0 0 114 50 46 HHHHPRLLLLRLLLLLLLLLLLLLLLLLLLLLLLLLLHLLLLLLLLLLS
76 76 A Q T 3 S+ 0 0 150 50 74 QQQQPPPPPPAPPPPPPPPPPPPPPPPPPPPPPPPPPFSSFFFFFFPPP
77 77 A A < - 0 0 17 50 46 AAAAAAAAAAASSSSSSSSASASSSAAAAAAAAAAAAGAAGGGGGGNNA
78 78 A P S S+ 0 0 89 50 30 PPPPTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTMSATTTT
79 79 A V E S-D 50 0B 32 50 76 VVVVATVVVAAAAAAAAAATAAAAAAAAAAAAAAAAARRRQQLLEDVVR
80 80 A Y E -DE 49 104B 8 50 0 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY
81 81 A T E -DE 48 103B 18 50 75 TTTTAATVTTATASTVVTATATTTVTTTTTTTTTTTTVVVLHHRHREER
82 82 A L E +DE 47 102B 15 50 13 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLFFFFFFVVF
83 83 A Q E -D 46 0B 45 50 13 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQSSQ
84 84 A M E +DE 45 99B 13 50 32 MMLIVMMMMMMMMMMMMMMILIMMMIIIIIIIIIIIIMLLMMMMMMIIV
85 85 A R E -D 44 0B 57 50 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
86 86 A b E +D 43 0B 31 50 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
87 87 A I - 0 0 19 50 63 IIIIIIITTIITTTTVTTTITITTTIIIIIIIIIIIIIIIRRRRRRIII
88 88 A R - 0 0 135 50 8 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRHHR
89 89 A S S S+ 0 0 60 50 67 SSSSSWWWWWLWWWWWWWWWWWWWWWWWWWWWWWWWWWWWHnDLIVKKR
90 90 A S S S+ 0 0 103 49 58 SSSPHPPPPPHPPPRPPRPPPPPPPPPPPPPPPPPPPPPPSsS.TTDDF
91 91 A L + 0 0 91 50 52 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLMMMAAAHGAGGL
92 92 A P + 0 0 89 50 60 PPPPPPPPPPPPPPPPPPPPPPPPLPPPPPPPPPPPPSGGHLRLMRHHE
93 93 A G S S+ 0 0 67 50 6 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGSGGGGG
94 94 A F - 0 0 188 50 67 FFLLHHHHYHDHLHHQQHQLHHRRQHHHHHHHHHHHHHHHHYYYYYFFY
95 95 A W - 0 0 56 50 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
96 96 A S - 0 0 74 50 0 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
97 97 A P - 0 0 104 50 52 PPSSRTNANDSDNDDDDEDNDDDDDDDDDNDNDNNNDDDDEEESEEDDE
98 98 A W - 0 0 61 50 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
99 99 A S E -E 84 0B 51 50 3 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSG
100 100 A P E + 0 0B 103 48 28 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPRRAPVPQP S
101 101 A G E - 0 0B 25 48 49 GGGGSSGSSSSSSSSSSSSNSSRRSSSSSSSSSSSSSSSSAGGARE D
102 102 A L E -E 82 0B 52 48 44 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLRRRKWW A
103 103 A Q E +E 81 0B 112 48 56 QQQHEEKEEEEEEEEITETETEEETEEEEEEEEEEEEEEENNNLNN S
104 104 A L E -E 80 0B 110 48 27 LLLLLLLRLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHYYFFF V
105 105 A R - 0 0 200 48 72 RRRRRQRRRNQTTTITTTTRTRTTTRRRRRRRRRRRRTTTTTTTTI L
106 106 A P - 0 0 19 48 23 PPPPTTTTTTTSTSTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT T
107 107 A T - 0 0 42 46 66 TTTT TATTATTAATTAAATATAAATTTTTTTTTTTT KKHHHDHH A
108 108 A M 0 0 197 43 47 MMIM EEEQQQQQQQQQEQ QQQEEEEEEEEEEEE EEEEEEEE E
109 109 A K 0 0 195 42 40 KKKK RQRKQRQRQRRRRR QQRRRQQRRRRRRRR RRKKKKKK
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0.000 0 1.00
2 2 A 0 0 0 0 0 0 0 0 67 0 33 0 0 0 0 0 0 0 0 0 3 0 0 0.637 21 0.36
3 3 A 0 0 0 0 0 0 0 0 0 0 33 0 0 0 0 67 0 0 0 0 3 0 0 0.637 21 0.24
4 4 A 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 40 0 0 0.000 0 1.00
5 5 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 88 0 12 41 0 0 0.371 12 0.89
6 6 A 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 44 0 0 0.000 0 1.00
7 7 A 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 44 0 0 0.000 0 1.00
8 8 A 11 0 9 32 0 0 0 0 2 0 0 39 0 0 2 0 5 0 0 0 44 0 0 1.509 50 0.28
9 9 A 0 95 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 44 0 0 0.185 6 0.94
10 10 A 0 0 0 2 0 34 0 0 0 0 7 0 0 0 25 7 23 2 0 0 44 0 0 1.588 53 0.26
11 11 A 2 0 0 0 0 0 0 0 55 0 9 25 0 0 0 0 0 7 0 2 44 0 0 1.250 41 0.41
12 12 A 5 57 9 20 0 0 0 0 7 0 0 0 0 0 0 0 0 2 0 0 44 0 0 1.273 42 0.58
13 13 A 0 0 5 0 0 0 0 0 11 2 0 0 0 2 2 5 7 11 0 55 44 0 0 1.547 51 0.39
14 14 A 0 0 11 0 0 0 0 0 5 68 16 0 0 0 0 0 0 0 0 0 44 0 0 0.941 31 0.43
15 15 A 11 0 0 0 0 0 0 9 0 0 66 0 0 2 2 2 0 0 7 0 44 0 0 1.181 39 0.40
16 16 A 0 4 2 0 2 0 0 0 0 83 2 2 0 2 0 0 2 0 0 0 47 0 0 0.781 26 0.57
17 17 A 0 0 0 0 4 0 0 4 2 4 2 0 0 0 0 2 0 70 0 11 47 8 2 1.135 37 0.48
18 18 A 41 3 3 3 0 0 0 3 41 3 5 0 0 0 0 0 0 0 0 0 39 0 0 1.353 45 0.32
19 19 A 13 0 3 0 0 0 0 0 72 5 8 0 0 0 0 0 0 0 0 0 39 0 0 0.945 31 0.49
20 20 A 0 8 0 0 0 0 0 0 0 77 13 0 0 0 3 0 0 0 0 0 39 0 0 0.756 25 0.48
21 21 A 0 5 0 0 0 0 3 0 3 75 0 0 0 8 8 0 0 0 0 0 40 0 0 0.939 31 0.46
22 22 A 0 4 0 0 0 0 0 0 0 0 0 0 0 0 4 0 86 6 0 0 49 0 0 0.564 18 0.73
23 23 A 0 0 0 0 0 0 0 0 27 61 2 10 0 0 0 0 0 0 0 0 49 0 0 0.965 32 0.54
24 24 A 0 0 0 0 0 0 0 78 0 0 0 0 0 0 0 0 0 0 8 14 49 0 0 0.680 22 0.69
25 25 A 0 0 0 0 0 0 0 0 4 0 0 0 96 0 0 0 0 0 0 0 50 0 0 0.168 5 0.90
26 26 A 8 86 4 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 0.539 17 0.84
27 27 A 0 10 0 0 0 22 0 0 2 0 4 4 0 2 16 4 34 2 0 0 50 0 0 1.844 61 0.17
28 28 A 8 84 6 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 0.596 19 0.82
29 29 A 0 0 0 0 0 0 0 2 8 0 10 2 28 10 26 6 4 4 0 0 50 0 0 1.952 65 0.09
30 30 A 0 0 0 0 0 98 0 0 0 0 0 2 0 0 0 0 0 0 0 0 50 0 0 0.098 3 0.91
31 31 A 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 22 0 72 0 2 50 0 0 0.777 25 0.55
32 32 A 6 2 0 2 0 0 0 2 2 60 6 8 0 0 0 8 0 0 4 0 50 0 0 1.490 49 0.31
33 33 A 0 0 0 0 0 76 0 0 18 4 2 0 0 0 0 0 0 0 0 0 50 0 0 0.724 24 0.17
34 34 A 0 0 4 0 0 2 0 0 0 6 0 0 0 2 8 50 26 2 0 0 50 0 0 1.431 47 0.34
35 35 A 0 0 0 4 0 0 0 2 0 74 4 8 0 0 2 0 4 0 0 2 50 0 0 1.046 34 0.47
36 36 A 0 0 0 0 0 0 0 24 10 0 52 6 0 0 0 0 0 0 8 0 50 2 1 1.284 42 0.51
37 37 A 0 58 2 0 2 0 0 0 4 0 6 0 0 2 0 2 4 15 4 0 48 0 0 1.488 49 0.14
38 38 A 0 2 0 0 10 0 34 0 4 0 0 0 0 50 0 0 0 0 0 0 50 0 0 1.151 38 0.42
39 39 A 2 2 64 28 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 0.927 30 0.64
40 40 A 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 54 34 10 50 0 0 1.008 33 0.63
41 41 A 0 10 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 0 0 0 50 0 0 0.325 10 0.70
42 42 A 0 0 0 4 0 4 2 0 2 0 0 2 0 0 0 70 6 10 0 0 50 0 0 1.141 38 0.39
43 43 A 0 0 0 0 0 0 6 0 0 0 0 0 94 0 0 0 0 0 0 0 50 0 0 0.227 7 0.94
44 44 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 90 4 4 50 0 0 0.431 14 0.85
45 45 A 0 96 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 0.168 5 0.94
46 46 A 0 0 0 0 0 0 0 0 0 0 0 0 0 2 94 0 4 0 0 0 50 0 0 0.265 8 0.89
47 47 A 0 2 0 0 0 0 42 0 0 0 0 0 0 56 0 0 0 0 0 0 50 0 0 0.767 25 0.49
48 48 A 0 0 0 0 0 2 0 0 0 0 0 0 0 4 22 28 44 0 0 0 50 0 0 1.258 41 0.46
49 49 A 2 0 0 0 0 0 0 0 4 82 0 4 0 2 4 2 0 0 0 0 50 0 0 0.784 26 0.61
50 50 A 4 0 0 0 0 0 0 0 4 12 0 0 0 2 6 0 70 2 0 0 50 0 0 1.087 36 0.45
51 51 A 0 34 0 0 0 0 0 10 0 6 10 0 0 2 24 4 0 8 0 2 50 0 0 1.826 60 0.05
52 52 A 0 0 0 0 0 0 0 58 2 0 2 4 0 0 6 6 2 10 0 10 50 0 0 1.478 49 0.39
53 53 A 0 8 0 0 0 0 0 12 0 10 0 0 0 0 2 0 4 60 2 2 50 0 0 1.357 45 0.30
54 54 A 0 0 0 0 0 0 0 4 66 0 6 6 0 0 0 0 2 2 4 10 50 9 3 1.256 41 0.49
55 55 A 5 0 0 2 0 0 0 2 0 0 68 0 0 0 0 0 0 0 22 0 41 0 0 0.922 30 0.42
56 56 A 0 0 0 0 0 94 0 0 0 0 6 0 0 0 0 0 0 0 0 0 50 0 0 0.227 7 0.88
57 57 A 8 0 2 0 0 0 0 0 42 6 2 24 0 0 0 0 0 4 0 12 50 0 0 1.617 53 0.32
58 58 A 2 80 0 4 6 0 0 0 0 0 0 0 0 0 0 0 6 2 0 0 50 0 0 0.801 26 0.74
59 59 A 90 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 50 0 0 0.392 13 0.85
60 60 A 0 0 0 0 6 0 0 52 8 8 10 6 0 2 0 0 2 2 4 0 50 0 2 1.676 55 0.27
61 61 A 0 0 0 0 2 0 0 6 10 56 4 2 0 10 0 0 0 0 2 8 50 0 0 1.520 50 0.30
62 62 A 0 76 12 0 0 0 0 0 0 0 6 6 0 0 0 0 0 0 0 0 50 0 0 0.801 26 0.51
63 63 A 8 10 0 0 0 0 0 0 4 70 0 8 0 0 0 0 0 0 0 0 50 0 0 1.013 33 0.45
64 64 A 0 26 0 0 0 0 0 6 2 2 52 4 0 0 0 2 6 0 0 0 50 0 0 1.391 46 0.23
65 65 A 0 0 0 2 0 0 0 0 0 4 16 0 2 10 12 6 12 32 2 2 50 0 0 2.007 67 0.23
66 66 A 0 0 2 0 0 0 0 0 34 0 8 34 0 0 4 6 0 8 4 0 50 0 0 1.642 54 0.27
67 67 A 2 60 0 0 2 0 0 12 0 0 2 4 0 0 0 0 2 0 0 16 50 0 0 1.296 43 0.16
68 68 A 0 0 0 0 0 0 0 0 0 0 6 4 0 2 24 4 50 6 0 4 50 0 0 1.491 49 0.36
69 69 A 2 2 2 0 14 0 56 0 2 0 0 6 0 12 0 0 0 0 4 0 50 0 0 1.465 48 0.36
70 70 A 0 0 0 0 0 0 0 0 0 0 2 4 0 2 0 4 10 74 0 4 50 0 0 0.996 33 0.60
71 71 A 0 86 0 0 0 0 2 0 0 0 0 0 2 6 2 0 2 0 0 0 50 0 0 0.611 20 0.55
72 72 A 0 0 0 0 0 0 0 0 0 0 0 0 96 0 0 0 4 0 0 0 50 0 0 0.168 5 0.82
73 73 A 0 0 0 0 0 0 0 90 0 0 2 0 0 0 0 2 0 0 2 4 50 0 0 0.458 15 0.80
74 74 A 0 90 0 0 10 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 0.325 10 0.94
75 75 A 0 80 0 0 0 0 0 0 0 2 2 0 0 12 4 0 0 0 0 0 50 0 0 0.718 23 0.54
76 76 A 0 0 0 0 14 0 0 0 2 70 4 0 0 0 0 0 10 0 0 0 50 0 0 0.962 32 0.25
77 77 A 0 0 0 0 0 0 0 14 58 0 24 0 0 0 0 0 0 0 4 0 50 0 0 1.062 35 0.54
78 78 A 0 0 0 2 0 0 0 0 2 10 2 84 0 0 0 0 0 0 0 0 50 0 0 0.611 20 0.69
79 79 A 20 4 0 0 0 0 0 0 56 0 0 4 0 0 8 0 4 2 0 2 50 0 0 1.391 46 0.23
80 80 A 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 0.000 0 1.00
81 81 A 14 2 0 0 0 0 0 0 12 0 2 54 0 6 6 0 0 4 0 0 50 0 0 1.485 49 0.25
82 82 A 4 82 0 0 14 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 0.567 18 0.86
83 83 A 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 96 0 0 0 50 0 0 0.168 5 0.87
84 84 A 4 8 34 54 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 1.030 34 0.68
85 85 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 50 0 0 0.000 0 1.00
86 86 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 50 0 0 0.000 0 1.00
87 87 A 2 0 60 0 0 0 0 0 0 0 0 26 0 0 12 0 0 0 0 0 50 0 0 0.989 33 0.36
88 88 A 0 0 0 0 0 0 0 0 0 0 0 0 0 4 96 0 0 0 0 0 50 0 0 0.168 5 0.92
89 89 A 2 4 2 0 0 68 0 0 0 0 12 0 0 2 2 4 0 0 2 2 50 1 1 1.244 41 0.33
90 90 A 0 0 0 0 2 0 0 0 0 67 14 4 0 4 4 0 0 0 0 4 49 0 0 1.146 38 0.41
91 91 A 0 78 0 6 0 0 0 6 8 0 0 0 0 2 0 0 0 0 0 0 50 0 0 0.812 27 0.47
92 92 A 0 6 0 2 0 0 0 4 0 74 2 0 0 6 4 0 0 2 0 0 50 0 0 1.053 35 0.40
93 93 A 0 0 0 0 0 0 0 96 2 0 2 0 0 0 0 0 0 0 0 0 50 0 0 0.196 6 0.93
94 94 A 0 8 0 0 10 0 14 0 0 0 0 0 0 54 4 0 8 0 0 2 50 0 0 1.449 48 0.33
95 95 A 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 0.000 0 1.00
96 96 A 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 50 0 0 0.000 0 1.00
97 97 A 0 0 0 0 0 0 0 0 2 6 8 2 0 0 2 0 0 14 18 48 50 0 0 1.542 51 0.48
98 98 A 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 0.000 0 1.00
99 99 A 0 0 0 0 0 0 0 2 0 0 98 0 0 0 0 0 0 0 0 0 50 0 0 0.098 3 0.96
100 100 A 2 0 0 0 0 0 0 0 2 88 2 0 0 0 4 0 2 0 0 0 48 0 0 0.572 19 0.71
101 101 A 0 0 0 0 0 0 0 17 4 0 67 0 0 0 6 0 0 2 2 2 48 0 0 1.117 37 0.51
102 102 A 0 85 0 0 0 4 0 0 2 0 0 0 0 0 6 2 0 0 0 0 48 0 0 0.602 20 0.55
103 103 A 0 2 2 0 0 0 0 0 0 0 2 8 0 2 0 2 8 63 10 0 48 0 0 1.347 44 0.44
104 104 A 2 83 0 0 6 0 4 0 0 0 0 0 0 2 2 0 0 0 0 0 48 0 0 0.700 23 0.72
105 105 A 0 2 4 0 0 0 0 0 0 0 0 40 0 0 48 0 4 0 2 0 48 0 0 1.146 38 0.27
106 106 A 0 0 0 0 0 0 0 0 0 10 4 85 0 0 0 0 0 0 0 0 48 0 0 0.503 16 0.77
107 107 A 0 0 0 0 0 0 0 0 26 0 0 57 0 11 0 4 0 0 0 2 46 0 0 1.134 37 0.33
108 108 A 0 0 2 9 0 0 0 0 0 0 0 0 0 0 0 0 30 58 0 0 43 0 0 0.985 32 0.53
109 109 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 52 29 19 0 0 0 42 0 0 1.012 33 0.59
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
25 34 260 2 sLCl
38 35 279 3 nWVLm
39 15 240 2 gLDs
39 52 279 3 tWVLv
40 15 248 2 gLDs
40 52 287 3 tWVLv
42 68 300 2 nSAs
47 50 267 2 pEEd
48 50 246 2 pEEd
//