Complet list of 1gcf hssp file
Complete list of 1gcf.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1GCF
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-04-29
HEADER BINDING PROTEIN 10-APR-97 1GCF
COMPND MOL_ID: 1; MOLECULE: GRANULOCYTE COLONY-STIMULATING FACTOR RECEPTOR; C
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: MUS MUSCULUS; ORGANISM_COMMON: HOUSE M
AUTHOR K.YAMASAKI,S.NAITO,H.ANAGUCHI,T.OHKUBO,Y.OTA
DBREF 1GCF A 1 109 UNP P40223 CSF3R_MOUSE 211 333
SEQLENGTH 109
NCHAIN 1 chain(s) in 1GCF data set
NALIGN 50
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : CSF3R_MOUSE 1.00 1.00 4 109 228 333 106 0 0 837 P40223 Granulocyte colony-stimulating factor receptor OS=Mus musculus GN=Csf3r PE=1 SV=2
2 : Q14AY3_MOUSE 1.00 1.00 4 109 228 333 106 0 0 837 Q14AY3 Colony stimulating factor 3 receptor (Granulocyte) OS=Mus musculus GN=Csf3r PE=2 SV=1
3 : D4A3R0_RAT 0.88 0.96 4 109 228 333 106 0 0 838 D4A3R0 Colony stimulating factor 3 receptor (Granulocyte) (Predicted), isoform CRA_b OS=Rattus norvegicus GN=Csf3r PE=4 SV=1
4 : G3H9X4_CRIGR 0.81 0.95 4 109 228 333 106 0 0 841 G3H9X4 Granulocyte colony-stimulating factor receptor OS=Cricetulus griseus GN=I79_007205 PE=4 SV=1
5 : I3LX16_SPETR 0.65 0.73 25 106 242 323 82 0 0 817 I3LX16 Uncharacterized protein OS=Spermophilus tridecemlineatus GN=CSF3R PE=4 SV=1
6 : G5C6G6_HETGA 0.62 0.77 4 107 228 331 104 0 0 887 G5C6G6 Granulocyte colony-stimulating factor receptor OS=Heterocephalus glaber GN=GW7_05417 PE=4 SV=1
7 : H0WL79_OTOGA 0.61 0.73 4 107 227 330 104 0 0 838 H0WL79 Uncharacterized protein OS=Otolemur garnettii GN=CSF3R PE=4 SV=1
8 : G1T0W6_RABIT 0.60 0.72 4 109 250 355 106 0 0 946 G1T0W6 Uncharacterized protein OS=Oryctolagus cuniculus GN=CSF3R PE=4 SV=2
9 : L9JC93_TUPCH 0.60 0.73 4 109 227 332 106 0 0 980 L9JC93 Granulocyte colony-stimulating factor receptor OS=Tupaia chinensis GN=TREES_T100000931 PE=3 SV=1
10 : G3TD98_LOXAF 0.59 0.75 4 109 227 332 106 0 0 840 G3TD98 Uncharacterized protein OS=Loxodonta africana GN=CSF3R PE=4 SV=1
11 : H0VHX4_CAVPO 0.59 0.75 4 109 228 333 106 0 0 842 H0VHX4 Uncharacterized protein OS=Cavia porcellus GN=CSF3R PE=4 SV=1
12 : S7PFH5_MYOBR 0.59 0.74 4 109 228 333 106 0 0 842 S7PFH5 Granulocyte colony-stimulating factor receptor OS=Myotis brandtii GN=D623_10028902 PE=4 SV=1
13 : F6ZM36_HORSE 0.58 0.73 4 109 227 332 106 0 0 841 F6ZM36 Uncharacterized protein OS=Equus caballus GN=CSF3R PE=4 SV=1
14 : G1P748_MYOLU 0.58 0.74 4 109 228 333 106 0 0 842 G1P748 Uncharacterized protein OS=Myotis lucifugus GN=CSF3R PE=4 SV=1
15 : S9WWB8_9CETA 0.58 0.70 4 109 227 332 106 0 0 986 S9WWB8 Granulocyte colony-stimulating factor receptor isoform 1 OS=Camelus ferus GN=CB1_000884035 PE=4 SV=1
16 : M3WIJ4_FELCA 0.57 0.69 4 109 227 332 106 0 0 839 M3WIJ4 Uncharacterized protein OS=Felis catus GN=CSF3R PE=4 SV=1
17 : M3YTJ9_MUSPF 0.57 0.67 4 109 227 332 106 0 0 839 M3YTJ9 Uncharacterized protein OS=Mustela putorius furo GN=CSF3R PE=4 SV=1
18 : W5QEP2_SHEEP 0.57 0.69 4 109 227 332 106 0 0 843 W5QEP2 Uncharacterized protein OS=Ovis aries GN=CSF3R PE=4 SV=1
19 : E1B9W2_BOVIN 0.56 0.69 4 109 227 332 106 0 0 841 E1B9W2 Uncharacterized protein OS=Bos taurus GN=CSF3R PE=4 SV=2
20 : E2R8G9_CANFA 0.56 0.68 4 109 227 332 106 0 0 840 E2R8G9 Uncharacterized protein OS=Canis familiaris GN=CSF3R PE=4 SV=1
21 : F7EA72_CALJA 0.56 0.73 4 109 214 319 106 0 0 822 F7EA72 Uncharacterized protein OS=Callithrix jacchus GN=CSF3R PE=4 SV=1
22 : G1L3X5_AILME 0.56 0.69 4 109 226 331 106 0 0 838 G1L3X5 Uncharacterized protein OS=Ailuropoda melanoleuca GN=CSF3R PE=4 SV=1
23 : I3LQQ8_PIG 0.56 0.69 4 109 229 334 106 0 0 846 I3LQQ8 Uncharacterized protein OS=Sus scrofa GN=CSF3R PE=4 SV=1
24 : K9IWH4_PIG 0.56 0.69 4 109 228 333 106 0 0 848 K9IWH4 Granulocyte colony-stimulating factor receptor OS=Sus scrofa GN=CSF3R PE=2 SV=1
25 : H2PYN7_PANTR 0.55 0.72 4 108 227 331 105 0 0 863 H2PYN7 Uncharacterized protein OS=Pan troglodytes GN=CSF3R PE=4 SV=1
26 : M1ELE4_MUSPF 0.55 0.65 4 109 227 334 108 1 2 840 M1ELE4 Colony stimulating factor 3 receptor (Fragment) OS=Mustela putorius furo PE=2 SV=1
27 : CSF3R_HUMAN 0.54 0.73 4 109 227 332 106 0 0 836 Q99062 Granulocyte colony-stimulating factor receptor OS=Homo sapiens GN=CSF3R PE=1 SV=1
28 : E1B6W6_HUMAN 0.54 0.73 4 109 227 332 106 0 0 769 E1B6W6 Colony stimulating factor 3 receptor (Granulocyte), isoform CRA_a OS=Homo sapiens GN=CSF3R PE=4 SV=1
29 : G3RCL2_GORGO 0.54 0.73 4 109 227 332 106 0 0 862 G3RCL2 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101128750 PE=4 SV=1
30 : G3S1M9_GORGO 0.54 0.73 4 109 227 332 106 0 0 836 G3S1M9 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101128750 PE=4 SV=1
31 : G7NTI8_MACFA 0.54 0.73 4 109 227 332 106 0 0 863 G7NTI8 Putative uncharacterized protein OS=Macaca fascicularis GN=EGM_00486 PE=4 SV=1
32 : Q1ZYL6_HUMAN 0.54 0.73 4 109 227 332 106 0 0 836 Q1ZYL6 Granulocyte colony-stimulating factor receptor OS=Homo sapiens PE=2 SV=1
33 : G7MFW6_MACMU 0.53 0.73 4 109 227 332 106 0 0 863 G7MFW6 Putative uncharacterized protein OS=Macaca mulatta GN=EGK_00567 PE=4 SV=1
34 : H2N7Y9_PONAB 0.53 0.73 4 109 227 332 106 0 0 834 H2N7Y9 Uncharacterized protein OS=Pongo abelii GN=CSF3R PE=4 SV=1
35 : F6TTY1_MACMU 0.52 0.70 16 109 240 333 94 0 0 865 F6TTY1 Uncharacterized protein OS=Macaca mulatta GN=CSF3R PE=4 SV=1
36 : F6TTZ1_MACMU 0.52 0.70 16 109 240 333 94 0 0 837 F6TTZ1 Uncharacterized protein OS=Macaca mulatta GN=CSF3R PE=4 SV=1
37 : F6TU14_MACMU 0.52 0.70 16 109 240 333 94 0 0 784 F6TU14 Uncharacterized protein OS=Macaca mulatta GN=CSF3R PE=4 SV=1
38 : G1RLA1_NOMLE 0.51 0.73 4 109 227 332 106 0 0 863 G1RLA1 Uncharacterized protein OS=Nomascus leucogenys GN=CSF3R PE=4 SV=1
39 : F6ZY07_MONDO 0.42 0.58 21 106 245 333 89 1 3 866 F6ZY07 Uncharacterized protein OS=Monodelphis domestica GN=CSF3R PE=4 SV=2
40 : G3X148_SARHA 0.38 0.58 4 109 226 336 111 2 5 863 G3X148 Uncharacterized protein OS=Sarcophilus harrisii GN=CSF3R PE=4 SV=1
41 : G3X149_SARHA 0.38 0.58 4 109 234 344 111 2 5 871 G3X149 Uncharacterized protein (Fragment) OS=Sarcophilus harrisii GN=CSF3R PE=4 SV=1
42 : U3IK55_ANAPL 0.37 0.53 4 109 129 229 106 2 5 735 U3IK55 Uncharacterized protein (Fragment) OS=Anas platyrhynchos GN=CSF3R PE=4 SV=1
43 : U3JDQ5_FICAL 0.36 0.57 22 109 233 321 90 2 3 702 U3JDQ5 Uncharacterized protein OS=Ficedula albicollis GN=CSF3R PE=4 SV=1
44 : G1MWV1_MELGA 0.33 0.53 6 109 243 341 104 2 5 785 G1MWV1 Uncharacterized protein OS=Meleagris gallopavo GN=CSF3R PE=4 SV=1
45 : V8NJY1_OPHHA 0.33 0.61 5 109 346 444 105 3 6 1011 V8NJY1 Granulocyte colony-stimulating factor receptor (Fragment) OS=Ophiophagus hannah GN=CSF3R PE=4 SV=1
46 : K7FS22_PELSI 0.32 0.57 2 109 227 329 108 2 5 850 K7FS22 Uncharacterized protein OS=Pelodiscus sinensis GN=CSF3R PE=4 SV=1
47 : M7BAB9_CHEMY 0.32 0.60 2 109 227 329 108 2 5 855 M7BAB9 Granulocyte colony-stimulating factor receptor OS=Chelonia mydas GN=UY3_07910 PE=4 SV=1
48 : O57519_XENLA 0.30 0.47 6 99 218 307 96 4 8 881 O57519 Gp130p1 OS=Xenopus laevis GN=il6st PE=2 SV=1
49 : Q2TAD1_XENLA 0.30 0.47 6 99 197 286 96 4 8 861 Q2TAD1 Uncharacterized protein (Fragment) OS=Xenopus laevis PE=2 SV=1
50 : Q6DD37_XENLA 0.30 0.52 4 108 219 318 105 2 5 813 Q6DD37 Csf3r-prov protein OS=Xenopus laevis GN=csf3r PE=2 SV=1
## ALIGNMENTS 1 - 50
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A G 0 0 50 1 0
2 2 A S + 0 0 106 3 64 AA
3 3 A S S S- 0 0 80 3 76 KK
4 4 A L S S+ 0 0 175 41 0 LLLL LLLLLLLLLLLLLLLLLLLLLLLLLLLLL L LLL LL L
5 5 A E S S- 0 0 167 42 11 EEEE EEEEEEEEEEEEEEEEEEEEEEEEEEEEE E DDE EDD D
6 6 A P - 0 0 86 45 0 PPPP PPPPPPPPPPPPPPPPPPPPPPPPPPPPP P PPP PPPPPPP
7 7 A P - 0 0 48 45 0 PPPP PPPPPPPPPPPPPPPPPPPPPPPPPPPPP P PPP PPPPPPP
8 8 A M E -A 31 0A 136 45 72 MMMV TITRTTTTTTTTTTTMTTTMTMMMMMMMM V IIA TIVVQQV
9 9 A L E +A 30 0A 23 45 6 LLLL LLLLLLLLLLLLLLLLLLLLLLLLLLLLL L LLL LILLLLI
10 10 A Q E -A 29 0A 110 45 73 QQQQ MWWWESWWWWWWWWWRWWWRWRRRRRRRR R QQQ QQKKSSK
11 11 A A E -A 28 0A 34 45 58 AAAA AVAAAAAAAAAAAAATAAATATTTTTTTT T AAS SDSSEEE
12 12 A L - 0 0 40 45 41 LLLV LELLLLLLLLLLLLLLLLLMLMMMMLMMM M AAI IIVILLA
13 13 A D + 0 0 153 45 61 DDDD DEDDDDDDDEAAEEPDAEEDADDDDDDDD D KKQ HRQQIIA
14 14 A I + 0 0 42 45 55 IIII PPSPPPPPPPPPPPPPPPPPPPPPPPPPP P AAS SPSSSSP
15 15 A G S S+ 0 0 55 45 59 GGGS NSSRSNSKSSSSSSSNSSSSSSSSSSSSS S SSV VHVVSSV
16 16 A P S S- 0 0 108 48 42 PPHP PPPQPSPTPPPPPPPPPPPPPPPPPPPPPPPPP PPP PFPPLLI
17 17 A D - 0 0 95 48 53 DDDD EEKEEEEEEEEEAAEEEEEEEEEEEEEEEEEEE ggP PEFFEES
18 18 A V - 0 0 16 40 67 VVIL MAGPVVVVVVVVVVAAVVVAVAAAAAAAAAAAA ss. .......
19 19 A V S S+ 0 0 116 40 50 VVVI AAASAAAVAPAAAAAPAAAAAAAAAAAAAAAAA SS. .......
20 20 A S S S- 0 0 55 40 50 SSSP LSPPPRPPPPPPPPPPPPPPPPPPPPPPPPPPP LL. .......
21 21 A H + 0 0 168 41 52 HHRY PPPPAPPLPLPPPPPPPPPPPPPPPPPPPPPPPPRR. .......
22 22 A Q - 0 0 78 50 26 QQQQ QQQQQRQQQQQQQQQEQQQQQQQQQQQQQQQQEQQQQQQEQQLLR
23 23 A P S S- 0 0 121 50 45 PPPP PPPPPPPPPPPPPPPPPPPAPAAAAAAAAAAAAPPPTTTSTTPPP
24 24 A G S S+ 0 0 17 50 30 GGGG DGGGDDGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGDDDNNDNNG
25 25 A a E - B 0 72A 6 51 9 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCAAC
26 26 A L E - B 0 71A 18 51 16 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLVIVMVILLV
27 27 A W E + B 0 70A 16 51 82 WWWWWWWQWWWQRQRLLRRLHLRRQLQQQQQQQQQQQQQRRTATESSKKR
28 28 A L E -AB 11 69A 4 51 17 LLLLLLVLLFLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLVVVIII
29 29 A S E +A 10 0A 14 51 90 SSSSHRHHGRRHRHRRRRRRCRRRCRCCCCCCCCCCCCQKKAATKAAEEQ
30 30 A W E -AB 9 66A 9 51 8 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWT
31 31 A K E -A 8 0A 107 51 44 KKKKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEKKKADAEEEKKK
32 32 A P - 0 0 22 51 69 PPPPPPPPPSPSPSTLPTTPPPTTPPPPPPPPPPPPPPKKKVVMAVGNNK
33 33 A W + 0 0 113 51 82 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWAAAAAAAAAPPS
34 34 A K - 0 0 49 51 65 KKKKKKKKKKKKKKKKKKRKKKKKQKQQQQQQQQQQQQRRRPWPKHPIIE
35 35 A P - 0 0 95 51 52 PPPPPPPPQPPPPSPPPPPPPPPPPPPPPPPPPPPPPPQMMSGRDTTTTP
36 36 A S S S+ 0 0 124 51 48 SSSGSSSSSSSTGTSSSSSSGSSSGsGGGGAGAGAAAGNSSSTNSSSNNG
37 37 A E S S- 0 0 85 49 85 EEEELILHLLFQLQLLLLLLLLLLLlLLLLLLLLLLLLKNNAEAESS..S
38 38 A Y + 0 0 194 51 58 YYFYFYHFYFYYYYHYYYYYFYHHHYHHHHHHHHHHHHHHHHHHLHHAAY
39 39 A M - 0 0 94 51 35 MMMIMMIIVMMIIIIIIIIIIIIIIIIIIIIIIIIIIILMMMMMMIIFFI
40 40 A E + 0 0 177 51 37 EEEEDDSDEENDEDEEEEEENEEENENNNNNNNNNNNNEEEEEEEEENNE
41 41 A Q - 0 0 30 51 29 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLQQQLLQ
42 42 A E E -C 87 0B 72 51 60 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKTMMWQYAQQKKW
43 43 A b E -CD 86 62B 0 51 6 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCYYY
44 44 A E E -C 85 0B 21 51 14 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEEEEDEENNQ
45 45 A L E -CD 84 59B 1 51 5 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLIIL
46 46 A R E +CD 83 58B 68 51 11 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRQQRRRRRHRRR
47 47 A Y E -CD 82 57B 38 51 50 YYYYLYYHHYYHHHHHHHHHHHHHHHHHHHHHHHHHHHYYYYYYYYYYYY
48 48 A Q E -C 81 0B 15 51 53 QQQRRRQQQQRQQQQQQQQQKQQQKQKKKKKKKKKKKKQHHRRRQWRRRR
49 49 A P E -C 80 0B 14 51 38 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPHRRPAAVTTPPK
50 50 A Q E +C 79 0B 78 51 56 QQQQQQQQRQQQQQQQQRRRQQQQQQQQQQQQQQQQQQPPPPPAEPHVVA
51 51 A L S S- 0 0 65 51 95 LLLLRLPLLLLRLPLLLLLLRPRRRLRRRRSRSHSSSRGGGEEEGGEKKD
52 52 A K S S- 0 0 197 51 59 KKERGRGEGRGGGGGGGGGGGGGGAGGGGGGGGGGGGGEEEDDDSDDTTQ
53 53 A G S S- 0 0 42 51 69 GGGREEDEEEEGEGEEEEEEEEEEEEEEEEEEEEEEEELLLPPPLPPQQN
54 54 A A - 0 0 48 51 50 AAAAAAADAAAATAAADAAAAASSADAAAAAAAAAAAAnttNASEGGDDQ
55 55 A N - 0 0 153 42 58 NNNNNSSNSSSSSNSSSGGSSSSSNSSSSSSSSSSSSSmvv.........
56 56 A W - 0 0 84 51 12 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWSSSWWWWWWWWW
57 57 A T E -D 47 0B 62 51 68 TTTTTATTVAATATDAVDDDAADDAVAAAAAAAAAAAAPPPVADSTTEEI
58 58 A L E -D 46 0B 75 51 26 LLLLQQLLLLQLLLLLLLLLLLLLLLLLLLLLLLLLLLFFFLLVLLLMME
59 59 A V E -D 45 0B 16 51 14 VVVVVVEVVVVVVVVVVVVVEVVVVVVVVVVVVVVVVVLLLVVVVVVVVV
60 60 A F E + 0 0B 113 51 72 FFSSAAGAEAGGGGGGGSSGSGGGGGGGGGGGGGGGGGSNNTTTHPPppQ
61 61 A H E S+ 0 0B 143 51 70 HHHHTPAPPPPPPPAPPGAPPPAAPPPPPPPPPPPPPPFSSGGGNDDddP
62 62 A L E -D 43 0B 5 51 48 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLSSSIIIIIITTT
63 63 A P - 0 0 57 51 54 PPPPPPPPPPPPPPPLLPPLPLPPPLPPPPPPPPPPPPPTTVVPTVVAAT
64 64 A S S S+ 0 0 24 51 76 SSSSSSTSSSSSSSSPSAASSSSSLSLLLLLLLLLLLLQQQGGGTSSSSK
65 65 A S S S- 0 0 61 51 77 SSSSSSEMKCRKRKREQRRQEQRREQEEEEEEEEEEEENHHQQHSPPHHD
66 66 A K E -B 30 0A 47 51 72 KKTEITTTTTTTTTTTTTTTATSSATAAAAAAAAAAAAEEEAAASNNRRS
67 67 A D E S- 0 0A 148 51 85 DDDDLTLVLLTLLLLLLPSLFLLLLLLLLLLLLLLLLLGGGGGGQLLDDD
68 68 A Q E S- 0 0A 110 51 63 QQRQQRQQEHRQQQRQQQQQQQQQQQQQQQRQRRRRRREDDTTSRKKSSE
69 69 A F E -B 28 0A 11 51 63 FFFFHYYYYHYHHHYYYYYYYYYYYYYYYYYYYYYYYYHNNATVITTFFL
70 70 A E E +B 27 0A 104 51 39 EEEKEEEEEKEEEEEEEEEEEEEEEEEEEEEEEEEEEESDDQQQQQHTTE
71 71 A L E +B 26 0A 21 51 44 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLCRHQHHLLY
72 72 A a E +B 25 0A 56 51 17 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCQQC
73 73 A G + 0 0 51 51 19 GGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGKGGGDDN
74 74 A L + 0 0 8 51 5 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLFFFFFLLL
75 75 A H + 0 0 111 51 45 HHHHPRLLLLRLLLLLLLLLLLLLLLLLLLLLLLLLLLHLLLLLLLLLLS
76 76 A Q S S+ 0 0 152 51 73 QQQQPPPPPPAPPPPPPPPPPPPPPPPPPPPPPPPPPPFSSFFFFFFPPP
77 77 A A S S- 0 0 18 51 46 AAAAAAAAAAASSSSSSSSSASSSASAAAAAAAAAAAAGAAGGGGGGNNA
78 78 A P S S+ 0 0 87 51 30 PPPPTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTMSATTTT
79 79 A V E S-C 50 0B 30 51 76 VVVVATVVVAAAAAAAAAAATAAAAAAAAAAAAAAAAARRRQQLLEDVVR
80 80 A Y E -CE 49 104B 12 51 0 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY
81 81 A T E -CE 48 103B 11 51 74 TTTTAATVTTATASTVVTTATATTTVTTTTTTTTTTTTVVVLHHRHREER
82 82 A L E +CE 47 102B 15 51 13 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLFFFFFFVVF
83 83 A Q E -C 46 0B 50 51 13 QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQSSQ
84 84 A M E -CE 45 99B 11 51 31 MMLIVMMMMMMMMMMMMMMMILMMIMIIIIIIIIIIIIMLLMMMMMMIIV
85 85 A R E -C 44 0B 77 51 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
86 86 A b E +C 43 0B 39 51 0 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
87 87 A I E -C 42 0B 19 51 64 IIIIIIITTIITTTTVTTTTITTTITIIIIIIIIIIIIIIIRRRRRRIII
88 88 A R - 0 0 125 51 8 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRHHR
89 89 A S S S- 0 0 73 51 66 SSSSSWWWWWLWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWHnDLIVKKR
90 90 A S S S+ 0 0 94 50 59 SSSPHPPPPPHPPPRPPRRPPPPPPPPPPPPPPPPPPPPPPSsS.TTDDF
91 91 A L - 0 0 86 51 51 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLMMMAAAHGAGGL
92 92 A P + 0 0 105 51 59 PPPPPPPPPPPPPPPPPPPPPPPPPLPPPPPPPPPPPPSGGHLRLMRHHE
93 93 A G S S+ 0 0 54 51 6 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGSGGGGG
94 94 A F - 0 0 149 51 66 FFLLHHHHYHDHLHHQQHHQLHRRHQHHHHHHHHHHHHHHHHYYYYYFFY
95 95 A W - 0 0 80 51 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
96 96 A S - 0 0 58 51 0 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
97 97 A P - 0 0 94 51 51 PPSSRTNANDSDNDDDDEEDNDDDDDDDDDNDNDNNNDDDDEEESEEDDE
98 98 A W - 0 0 62 51 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
99 99 A S E -E 84 0B 47 51 3 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSG
100 100 A P E + 0 0B 111 49 28 PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPRRAPVPQP S
101 101 A G E - 0 0B 23 49 48 GGGGSSGSSSSSSSSSSSSSNSRRSSSSSSSSSSSSSSSSSAGGARE D
102 102 A L E -E 82 0B 41 49 44 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLRRRKWW A
103 103 A Q E +E 81 0B 110 49 55 QQQHEEKEEEEEEEEITEETETEEETEEEEEEEEEEEEEEENNNLNN S
104 104 A L E -E 80 0B 86 49 27 LLLLLLLRLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHYYFFF V
105 105 A R - 0 0 160 49 72 RRRRRQRRRNQTTTITTTTTRTTTRTRRRRRRRRRRRRTTTTTTTTI L
106 106 A P - 0 0 22 49 22 PPPPTTTTTTTSTSTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT T
107 107 A T - 0 0 38 47 66 TTTT TATTATTAATTAAAATAAATATTTTTTTTTTTT KKHHHDHH A
108 108 A M 0 0 190 45 47 MMIM EEEQQQQQQQQQQEQQQEQEEEEEEEEEEEE EEEEEEEE E
109 109 A K 0 0 211 43 40 KKKK RQRKQRQRQRRRRRRQQ RRRQQRRRRRRRR RRKKKKKK
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0.000 0 1.00
2 2 A 0 0 0 0 0 0 0 0 67 0 33 0 0 0 0 0 0 0 0 0 3 0 0 0.637 21 0.36
3 3 A 0 0 0 0 0 0 0 0 0 0 33 0 0 0 0 67 0 0 0 0 3 0 0 0.637 21 0.24
4 4 A 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 41 0 0 0.000 0 1.00
5 5 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 88 0 12 42 0 0 0.365 12 0.89
6 6 A 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 45 0 0 0.000 0 1.00
7 7 A 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 45 0 0 0.000 0 1.00
8 8 A 11 0 9 31 0 0 0 0 2 0 0 40 0 0 2 0 4 0 0 0 45 0 0 1.497 49 0.28
9 9 A 0 96 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 45 0 0 0.182 6 0.94
10 10 A 0 0 0 2 0 36 0 0 0 0 7 0 0 0 24 7 22 2 0 0 45 0 0 1.577 52 0.26
11 11 A 2 0 0 0 0 0 0 0 56 0 9 24 0 0 0 0 0 7 0 2 45 0 0 1.236 41 0.42
12 12 A 4 58 9 20 0 0 0 0 7 0 0 0 0 0 0 0 0 2 0 0 45 0 0 1.257 41 0.59
13 13 A 0 0 4 0 0 0 0 0 11 2 0 0 0 2 2 4 7 13 0 53 45 0 0 1.559 52 0.39
14 14 A 0 0 11 0 0 0 0 0 4 69 16 0 0 0 0 0 0 0 0 0 45 0 0 0.929 31 0.44
15 15 A 11 0 0 0 0 0 0 9 0 0 67 0 0 2 2 2 0 0 7 0 45 0 0 1.164 38 0.40
16 16 A 0 4 2 0 2 0 0 0 0 83 2 2 0 2 0 0 2 0 0 0 48 0 0 0.768 25 0.57
17 17 A 0 0 0 0 4 0 0 4 4 4 2 0 0 0 0 2 0 69 0 10 48 8 2 1.184 39 0.47
18 18 A 43 3 3 3 0 0 0 3 40 3 5 0 0 0 0 0 0 0 0 0 40 0 0 1.341 44 0.33
19 19 A 13 0 3 0 0 0 0 0 73 5 8 0 0 0 0 0 0 0 0 0 40 0 0 0.929 31 0.50
20 20 A 0 8 0 0 0 0 0 0 0 77 13 0 0 0 3 0 0 0 0 0 40 0 0 0.744 24 0.49
21 21 A 0 5 0 0 0 0 2 0 2 76 0 0 0 7 7 0 0 0 0 0 41 0 0 0.923 30 0.47
22 22 A 0 4 0 0 0 0 0 0 0 0 0 0 0 0 4 0 86 6 0 0 50 0 0 0.556 18 0.74
23 23 A 0 0 0 0 0 0 0 0 26 62 2 10 0 0 0 0 0 0 0 0 50 0 0 0.955 31 0.55
24 24 A 0 0 0 0 0 0 0 78 0 0 0 0 0 0 0 0 0 0 8 14 50 0 0 0.671 22 0.69
25 25 A 0 0 0 0 0 0 0 0 4 0 0 0 96 0 0 0 0 0 0 0 51 0 0 0.165 5 0.90
26 26 A 8 86 4 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 0.531 17 0.84
27 27 A 0 10 0 0 0 22 0 0 2 0 4 4 0 2 18 4 33 2 0 0 51 0 0 1.843 61 0.18
28 28 A 8 84 6 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 0.587 19 0.83
29 29 A 0 0 0 0 0 0 0 2 8 0 10 2 27 10 27 6 4 4 0 0 51 0 0 1.940 64 0.10
30 30 A 0 0 0 0 0 98 0 0 0 0 0 2 0 0 0 0 0 0 0 0 51 0 0 0.097 3 0.91
31 31 A 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 22 0 73 0 2 51 0 0 0.768 25 0.55
32 32 A 6 2 0 2 0 0 0 2 2 59 6 10 0 0 0 8 0 0 4 0 51 0 0 1.508 50 0.31
33 33 A 0 0 0 0 0 76 0 0 18 4 2 0 0 0 0 0 0 0 0 0 51 0 0 0.715 23 0.18
34 34 A 0 0 4 0 0 2 0 0 0 6 0 0 0 2 8 51 25 2 0 0 51 0 0 1.416 47 0.35
35 35 A 0 0 0 4 0 0 0 2 0 75 4 8 0 0 2 0 4 0 0 2 51 0 0 1.031 34 0.48
36 36 A 0 0 0 0 0 0 0 24 10 0 53 6 0 0 0 0 0 0 8 0 51 2 1 1.271 42 0.51
37 37 A 0 59 2 0 2 0 0 0 4 0 6 0 0 2 0 2 4 14 4 0 49 0 0 1.469 49 0.15
38 38 A 0 2 0 0 10 0 35 0 4 0 0 0 0 49 0 0 0 0 0 0 51 0 0 1.149 38 0.42
39 39 A 2 2 65 27 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 0.918 30 0.65
40 40 A 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 55 33 10 51 0 0 1.000 33 0.63
41 41 A 0 10 0 0 0 0 0 0 0 0 0 0 0 0 0 0 90 0 0 0 51 0 0 0.321 10 0.70
42 42 A 0 0 0 4 0 4 2 0 2 0 0 2 0 0 0 71 6 10 0 0 51 0 0 1.126 37 0.39
43 43 A 0 0 0 0 0 0 6 0 0 0 0 0 94 0 0 0 0 0 0 0 51 0 0 0.224 7 0.94
44 44 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 90 4 4 51 0 0 0.424 14 0.85
45 45 A 0 96 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 0.165 5 0.94
46 46 A 0 0 0 0 0 0 0 0 0 0 0 0 0 2 94 0 4 0 0 0 51 0 0 0.261 8 0.89
47 47 A 0 2 0 0 0 0 41 0 0 0 0 0 0 57 0 0 0 0 0 0 51 0 0 0.763 25 0.49
48 48 A 0 0 0 0 0 2 0 0 0 0 0 0 0 4 22 27 45 0 0 0 51 0 0 1.249 41 0.46
49 49 A 2 0 0 0 0 0 0 0 4 82 0 4 0 2 4 2 0 0 0 0 51 0 0 0.772 25 0.61
50 50 A 4 0 0 0 0 0 0 0 4 12 0 0 0 2 8 0 69 2 0 0 51 0 0 1.118 37 0.44
51 51 A 0 35 0 0 0 0 0 10 0 6 10 0 0 2 24 4 0 8 0 2 51 0 0 1.811 60 0.05
52 52 A 0 0 0 0 0 0 0 59 2 0 2 4 0 0 6 6 2 10 0 10 51 0 0 1.459 48 0.40
53 53 A 0 8 0 0 0 0 0 12 0 10 0 0 0 0 2 0 4 61 2 2 51 0 0 1.340 44 0.31
54 54 A 0 0 0 0 0 0 0 4 67 0 6 6 0 0 0 0 2 2 4 10 51 9 3 1.240 41 0.50
55 55 A 5 0 0 2 0 0 0 5 0 0 67 0 0 0 0 0 0 0 21 0 42 0 0 0.979 32 0.41
56 56 A 0 0 0 0 0 94 0 0 0 0 6 0 0 0 0 0 0 0 0 0 51 0 0 0.224 7 0.88
57 57 A 8 0 2 0 0 0 0 0 41 6 2 24 0 0 0 0 0 4 0 14 51 0 0 1.626 54 0.31
58 58 A 2 80 0 4 6 0 0 0 0 0 0 0 0 0 0 0 6 2 0 0 51 0 0 0.790 26 0.74
59 59 A 90 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 51 0 0 0.387 12 0.85
60 60 A 0 0 0 0 6 0 0 51 8 8 12 6 0 2 0 0 2 2 4 0 51 0 2 1.686 56 0.28
61 61 A 0 0 0 0 2 0 0 8 10 55 4 2 0 10 0 0 0 0 2 8 51 0 0 1.542 51 0.30
62 62 A 0 76 12 0 0 0 0 0 0 0 6 6 0 0 0 0 0 0 0 0 51 0 0 0.790 26 0.51
63 63 A 8 10 0 0 0 0 0 0 4 71 0 8 0 0 0 0 0 0 0 0 51 0 0 1.000 33 0.45
64 64 A 0 25 0 0 0 0 0 6 4 2 51 4 0 0 0 2 6 0 0 0 51 0 0 1.433 47 0.23
65 65 A 0 0 0 2 0 0 0 0 0 4 16 0 2 10 14 6 12 31 2 2 51 0 0 2.008 67 0.23
66 66 A 0 0 2 0 0 0 0 0 33 0 8 35 0 0 4 6 0 8 4 0 51 0 0 1.631 54 0.27
67 67 A 2 59 0 0 2 0 0 12 0 2 2 4 0 0 0 0 2 0 0 16 51 0 0 1.367 45 0.15
68 68 A 0 0 0 0 0 0 0 0 0 0 6 4 0 2 24 4 51 6 0 4 51 0 0 1.475 49 0.37
69 69 A 2 2 2 0 14 0 57 0 2 0 0 6 0 12 0 0 0 0 4 0 51 0 0 1.447 48 0.36
70 70 A 0 0 0 0 0 0 0 0 0 0 2 4 0 2 0 4 10 75 0 4 51 0 0 0.982 32 0.60
71 71 A 0 86 0 0 0 0 2 0 0 0 0 0 2 6 2 0 2 0 0 0 51 0 0 0.602 20 0.56
72 72 A 0 0 0 0 0 0 0 0 0 0 0 0 96 0 0 0 4 0 0 0 51 0 0 0.165 5 0.83
73 73 A 0 0 0 0 0 0 0 90 0 0 2 0 0 0 0 2 0 0 2 4 51 0 0 0.451 15 0.81
74 74 A 0 90 0 0 10 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 0.321 10 0.94
75 75 A 0 80 0 0 0 0 0 0 0 2 2 0 0 12 4 0 0 0 0 0 51 0 0 0.708 23 0.54
76 76 A 0 0 0 0 14 0 0 0 2 71 4 0 0 0 0 0 10 0 0 0 51 0 0 0.950 31 0.26
77 77 A 0 0 0 0 0 0 0 14 57 0 25 0 0 0 0 0 0 0 4 0 51 0 0 1.069 35 0.53
78 78 A 0 0 0 2 0 0 0 0 2 10 2 84 0 0 0 0 0 0 0 0 51 0 0 0.603 20 0.70
79 79 A 20 4 0 0 0 0 0 0 57 0 0 4 0 0 8 0 4 2 0 2 51 0 0 1.375 45 0.24
80 80 A 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 0.000 0 1.00
81 81 A 14 2 0 0 0 0 0 0 12 0 2 55 0 6 6 0 0 4 0 0 51 0 0 1.468 49 0.25
82 82 A 4 82 0 0 14 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 0.559 18 0.86
83 83 A 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 96 0 0 0 51 0 0 0.165 5 0.87
84 84 A 4 8 33 55 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 1.022 34 0.68
85 85 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 51 0 0 0.000 0 1.00
86 86 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 51 0 0 0.000 0 1.00
87 87 A 2 0 59 0 0 0 0 0 0 0 0 27 0 0 12 0 0 0 0 0 51 0 0 0.996 33 0.36
88 88 A 0 0 0 0 0 0 0 0 0 0 0 0 0 4 96 0 0 0 0 0 51 0 0 0.165 5 0.92
89 89 A 2 4 2 0 0 69 0 0 0 0 12 0 0 2 2 4 0 0 2 2 51 1 1 1.227 40 0.33
90 90 A 0 0 0 0 2 0 0 0 0 66 14 4 0 4 6 0 0 0 0 4 50 0 0 1.183 39 0.40
91 91 A 0 78 0 6 0 0 0 6 8 0 0 0 0 2 0 0 0 0 0 0 51 0 0 0.801 26 0.48
92 92 A 0 6 0 2 0 0 0 4 0 75 2 0 0 6 4 0 0 2 0 0 51 0 0 1.038 34 0.41
93 93 A 0 0 0 0 0 0 0 96 2 0 2 0 0 0 0 0 0 0 0 0 51 0 0 0.193 6 0.93
94 94 A 0 8 0 0 10 0 14 0 0 0 0 0 0 55 4 0 8 0 0 2 51 0 0 1.433 47 0.33
95 95 A 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 0.000 0 1.00
96 96 A 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 51 0 0 0.000 0 1.00
97 97 A 0 0 0 0 0 0 0 0 2 6 8 2 0 0 2 0 0 16 18 47 51 0 0 1.549 51 0.48
98 98 A 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 0 0 0.000 0 1.00
99 99 A 0 0 0 0 0 0 0 2 0 0 98 0 0 0 0 0 0 0 0 0 51 0 0 0.097 3 0.96
100 100 A 2 0 0 0 0 0 0 0 2 88 2 0 0 0 4 0 2 0 0 0 49 0 0 0.563 18 0.72
101 101 A 0 0 0 0 0 0 0 16 4 0 67 0 0 0 6 0 0 2 2 2 49 0 0 1.102 36 0.51
102 102 A 0 86 0 0 0 4 0 0 2 0 0 0 0 0 6 2 0 0 0 0 49 0 0 0.593 19 0.56
103 103 A 0 2 2 0 0 0 0 0 0 0 2 8 0 2 0 2 8 63 10 0 49 0 0 1.329 44 0.45
104 104 A 2 84 0 0 6 0 4 0 0 0 0 0 0 2 2 0 0 0 0 0 49 0 0 0.689 22 0.73
105 105 A 0 2 4 0 0 0 0 0 0 0 0 41 0 0 47 0 4 0 2 0 49 0 0 1.141 38 0.28
106 106 A 0 0 0 0 0 0 0 0 0 10 4 86 0 0 0 0 0 0 0 0 49 0 0 0.496 16 0.77
107 107 A 0 0 0 0 0 0 0 0 28 0 0 55 0 11 0 4 0 0 0 2 47 0 0 1.138 37 0.33
108 108 A 0 0 2 9 0 0 0 0 0 0 0 0 0 0 0 0 31 58 0 0 45 0 0 0.980 32 0.53
109 109 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 53 28 19 0 0 0 43 0 0 1.004 33 0.60
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
26 34 260 2 sLCl
39 35 279 3 nWVLm
40 15 240 2 gLDs
40 52 279 3 tWVLv
41 15 248 2 gLDs
41 52 287 3 tWVLv
43 68 300 2 nSAs
48 50 267 2 pEEd
49 50 246 2 pEEd
//