Complet list of 1dqc hssp file
Complete list of 1dqc.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1DQC
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-04-27
HEADER ANTIMICROBIAL PROTEIN 04-JAN-00 1DQC
COMPND MOL_ID: 1; MOLECULE: TACHYCITIN; CHAIN: A
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: TACHYPLEUS TRIDENTATUS; ORGANISM_TAXID
AUTHOR T.SUETAKE,S.TSUDA,S.KAWABATA,K.MIURA,K.KAWANO
DBREF 1DQC A 1 73 UNP P91818 P91818_TACTR 23 95
SEQLENGTH 73
NCHAIN 1 chain(s) in 1DQC data set
NALIGN 13
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : P91818_TACTR1DQC 1.00 1.00 1 73 23 95 73 0 0 98 P91818 Tachycitin OS=Tachypleus tridentatus PE=1 SV=1
2 : Q7PZX4_ANOGA 0.39 0.55 3 63 1135 1199 66 5 6 1602 Q7PZX4 AGAP012133-PA (Fragment) OS=Anopheles gambiae GN=AGAP012133 PE=4 SV=4
3 : D1MAJ4_TRICA 0.38 0.51 2 70 13 85 73 3 4 538 D1MAJ4 Peritrophic matrix protein 3 OS=Tribolium castaneum GN=PMP3 PE=2 SV=1
4 : V4A2T3_LOTGI 0.38 0.55 6 59 70 127 58 3 4 140 V4A2T3 Uncharacterized protein OS=Lottia gigantea GN=LOTGIDRAFT_239076 PE=4 SV=1
5 : T1DPJ0_ANOAQ 0.37 0.48 5 62 28 86 60 2 3 130 T1DPJ0 Putative peritrophin-1 (Fragment) OS=Anopheles aquasalis PE=2 SV=1
6 : Q16VK3_AEDAE 0.35 0.46 1 61 25 85 63 2 4 129 Q16VK3 AAEL009540-PA OS=Aedes aegypti GN=AAEL009540 PE=4 SV=1
7 : T1GZY6_MEGSC 0.35 0.53 6 63 62 126 66 4 9 130 T1GZY6 Uncharacterized protein OS=Megaselia scalaris PE=4 SV=1
8 : E2A9N4_CAMFO 0.34 0.41 4 62 35 104 70 4 11 254 E2A9N4 Putative uncharacterized protein OS=Camponotus floridanus GN=EAG_11454 PE=4 SV=1
9 : Q7Q5H5_ANOGA 0.34 0.52 5 70 286 347 67 2 6 519 Q7Q5H5 AGAP006434-PA OS=Anopheles gambiae GN=AGAP006434 PE=4 SV=3
10 : T1DR48_ANOAQ 0.34 0.49 3 63 2 67 67 6 7 177 T1DR48 Putative peritrophin-1 (Fragment) OS=Anopheles aquasalis PE=2 SV=1
11 : B4NBU1_DROWI 0.32 0.45 1 67 91 150 69 5 11 233 B4NBU1 GK25193 OS=Drosophila willistoni GN=Dwil\GK25193 PE=4 SV=1
12 : E2C958_HARSA 0.32 0.45 2 65 1050 1119 73 4 12 1305 E2C958 Peritrophin-1 OS=Harpegnathos saltator GN=EAI_03557 PE=4 SV=1
13 : Q8ISS2_PLUXY 0.32 0.46 2 67 1037 1100 71 6 12 1192 Q8ISS2 Peritrophic matrix insect intestinal mucin (Fragment) OS=Plutella xylostella PE=2 SV=1
## ALIGNMENTS 1 - 13
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A Y 0 0 262 4 0 Y Y Y
2 2 A L - 0 0 73 7 80 L L P CIL
3 3 A A S S+ 0 0 62 9 61 ATG D PPPP
4 4 A F + 0 0 84 10 89 FFL P F FRTN
5 5 A R + 0 0 182 12 65 RNA RR RRNKKG
6 6 A a S > S- 0 0 35 14 51 CCSCCCCKTCNCC
7 7 A G G > S- 0 0 47 14 69 GTSGPPGCNHGPP
8 8 A R G 3 S+ 0 0 70 14 81 RPPEPRSPGPFAA
9 9 A Y G < S+ 0 0 125 14 74 YHSYFYCYYNFKD
10 10 A S < + 0 0 9 14 84 SPDIDDTPYPADS
11 11 A P + 0 0 64 14 73 PNSDAnSDptHPs
12 12 A b S S- 0 0 23 9 71 CCCC.s.Imp...
13 13 A L S S+ 0 0 150 11 97 LPPSDLQNFG...
14 14 A D S S- 0 0 95 12 80 DAADPIPPRQ.K.
15 15 A D S S+ 0 0 117 13 76 DKRKNIEKNRPD.
16 16 A G E S-A 30 0A 19 13 78 GDDANLGNEIDI.
17 17 A P E +A 29 0A 51 13 73 PpgnpPWtKppt.
18 18 A N E -A 28 0A 68 11 95 Nppyl.Ln.p.lq
19 19 A V - 0 0 66 11 56 VVVPP.VI.V.AL
20 20 A N - 0 0 1 12 83 NQYDHHPA.Q.HL
21 21 A L S S+ 0 0 85 12 72 LLLMELVH.L.EP
22 22 A Y S S+ 0 0 123 12 97 YppSTITE.p.Yh
23 23 A S - 0 0 0 14 64 SsesDDgsDtvDs
24 24 A c S S+ 0 0 9 14 0 CcccCCccCcCCc
25 25 A d S S+ 0 0 9 14 69 CTGQTSTTSSNTG
26 26 A S E S- B 0 39A 1 14 62 SKKTKKQKQKIKK
27 27 A F E - B 0 38A 3 14 1 FFFYFFFFFFFFF
28 28 A Y E -AB 18 37A 7 14 20 YYYYYVCYYYYYY
29 29 A N E -AB 17 36A 51 14 74 NKQTTTVKQKNKQ
30 30 A b E +AB 16 35A 5 14 0 CCCCCCCCCCCCC
31 31 A H E > - B 0 34A 84 14 89 HSSHSVglDSIhV
32 32 A K T 3 S- 0 0 157 10 87 K.NEWSaeH..d.
33 33 A a T 3 S+ 0 0 56 14 65 CGGGGGVCGGDCH
34 34 A L E < -B 31 0A 76 14 82 LGVALLLPTGGPG
35 35 A A E -B 30 0A 2 14 82 ANAFAGNLANDYD
36 36 A R E -B 29 0A 151 14 93 RAYHYFQMYAAML
37 37 A L E -B 28 0A 26 14 90 LCLGLEVWLCLNV
38 38 A E E -B 27 0A 61 14 74 EEQHKMVKIEEKE
39 39 A N E -B 26 0A 65 14 74 NqNnSRqdQqtKm
40 40 A e > - 0 0 5 13 15 CcCcCCcpCcc.c
41 41 A P T 3 S+ 0 0 91 14 34 PPPPPPPVPPTGP
42 42 A K T 3 S- 0 0 165 14 86 KVPQTESKATVDI
43 43 A G S < S+ 0 0 42 14 33 GGGGGGGRGGGRG
44 44 A L - 0 0 22 14 15 LLLLQLLLLLLLL
45 45 A H E -C 54 0B 6 14 39 HHHVHEYHHHHHH
46 46 A Y E -C 53 0B 41 14 5 YYWYFFFYFYFFF
47 47 A N E >>> -C 52 0B 24 14 30 NNNNASNNNNDNN
48 48 A A T 345S+ 0 0 43 14 73 AAVEAPPRTKEAP
49 49 A Y T 345S+ 0 0 209 14 96 YRAEHLTARAYRA
50 50 A L T <45S- 0 0 82 14 75 LEKLLELLLESLT
51 51 A K T <5S+ 0 0 123 14 75 KQLGQKRQSQGQE
52 52 A V E > - 0 0 109 14 8 WWWWFYWWYWWWW
56 56 A P T 34>S+ 0 0 64 14 12 PPPQPPPPPPPPP
57 57 A S T 345S+ 0 0 96 14 94 SNRQFQLWDQAWE
58 58 A K T <45S+ 0 0 128 14 86 KRDNVIAEKRTAS
59 59 A A T <5S- 0 0 11 14 23 AAATAAAAVAAAA
60 60 A G T 5 - 0 0 45 13 54 GCG QQGGDCGGG
61 61 A c < + 0 0 20 13 20 CCC CCCCCCRCC
62 62 A T + 0 0 108 12 62 TDE T AENDEEA
63 63 A S S S- 0 0 1 10 72 SPD S GPGTV
64 64 A V S S+ 0 0 113 7 97 V K P CSD
65 65 A N + 0 0 35 7 69 N N V NET
66 66 A K + 0 0 121 6 80 K E R P N
67 67 A E + 0 0 84 6 24 E E N E E
68 68 A d > + 0 0 19 4 116 C N E
69 69 A H T 3 + 0 0 140 4 67 H S H
70 70 A L T 3 S- 0 0 89 4 28 L L V
71 71 A W S < S- 0 0 110 2 0 W
72 72 A K 0 0 174 2 0 K
73 73 A T 0 0 106 2 0 T
## SEQUENCE PROFILE AND ENTROPY
SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT
1 1 A 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0.000 0 1.00
2 2 A 0 57 14 0 0 0 0 0 0 14 0 0 14 0 0 0 0 0 0 0 7 0 0 1.154 38 0.20
3 3 A 0 0 0 0 0 0 0 11 22 44 0 11 0 0 0 0 0 0 0 11 9 0 0 1.427 47 0.38
4 4 A 0 10 0 0 50 0 0 0 0 10 0 10 0 0 10 0 0 0 10 0 10 0 0 1.498 49 0.10
5 5 A 0 0 0 0 0 0 0 8 8 0 0 0 0 0 50 17 0 0 17 0 12 0 0 1.358 45 0.35
6 6 A 0 0 0 0 0 0 0 0 0 0 7 7 71 0 0 7 0 0 7 0 14 0 0 0.994 33 0.49
7 7 A 0 0 0 0 0 0 0 36 0 29 7 7 7 7 0 0 0 0 7 0 14 0 0 1.668 55 0.30
8 8 A 0 0 0 0 7 0 0 7 14 36 7 0 0 0 21 0 0 7 0 0 14 0 0 1.730 57 0.18
9 9 A 0 0 0 0 14 0 43 0 0 0 7 0 7 7 0 7 0 0 7 7 14 0 0 1.772 59 0.26
10 10 A 0 0 7 0 0 0 7 0 7 21 21 7 0 0 0 0 0 0 0 29 14 0 0 1.772 59 0.16
11 11 A 0 0 0 0 0 0 0 0 7 29 21 7 0 7 0 0 0 0 14 14 14 5 3 1.810 60 0.26
12 12 A 0 0 11 11 0 0 0 0 0 11 11 0 56 0 0 0 0 0 0 0 9 0 0 1.303 43 0.29
13 13 A 0 27 0 0 9 0 0 9 0 18 9 0 0 0 0 0 9 0 9 9 11 0 0 1.972 65 0.03
14 14 A 0 0 8 0 0 0 0 0 17 25 0 0 0 0 8 8 8 0 0 25 12 0 0 1.820 60 0.19
15 15 A 0 0 8 0 0 0 0 0 0 8 0 0 0 0 15 23 0 8 15 23 13 0 0 1.845 61 0.24
16 16 A 0 8 15 0 0 0 0 23 8 0 0 0 0 0 0 0 0 8 15 23 13 0 0 1.845 61 0.22
17 17 A 0 0 0 0 0 8 0 8 0 54 0 15 0 0 0 8 0 0 8 0 13 3 7 1.411 47 0.27
18 18 A 0 27 0 0 0 0 9 0 0 27 0 0 0 0 0 0 9 0 27 0 11 0 0 1.499 50 0.04
19 19 A 55 9 9 0 0 0 0 0 9 18 0 0 0 0 0 0 0 0 0 0 11 0 0 1.295 43 0.44
20 20 A 0 8 0 0 0 0 8 0 8 8 0 0 0 25 0 0 17 0 17 8 12 0 0 1.979 66 0.17
21 21 A 8 50 0 8 0 0 0 0 0 8 0 0 0 8 0 0 0 17 0 0 12 0 0 1.474 49 0.28
22 22 A 0 0 8 0 0 0 25 0 0 25 8 17 0 8 0 0 0 8 0 0 12 0 4 1.820 60 0.02
23 23 A 7 0 0 0 0 0 0 7 0 0 43 7 0 0 0 0 0 7 0 29 14 0 7 1.475 49 0.35
24 24 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 14 0 0 0.000 0 1.00
25 25 A 0 0 0 0 0 0 0 14 0 0 21 36 14 0 0 0 7 0 7 0 14 0 0 1.631 54 0.30
26 26 A 0 0 7 0 0 0 0 0 0 0 14 7 0 0 0 57 14 0 0 0 14 0 0 1.253 41 0.38
27 27 A 0 0 0 0 93 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 14 0 0 0.257 8 0.99
28 28 A 7 0 0 0 0 0 86 0 0 0 0 0 7 0 0 0 0 0 0 0 14 0 0 0.509 16 0.80
29 29 A 7 0 0 0 0 0 0 0 0 0 0 21 0 0 0 29 21 0 21 0 14 0 0 1.537 51 0.26
30 30 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 14 0 0 0.000 0 1.00
31 31 A 14 7 7 0 0 0 0 7 0 0 29 0 0 29 0 0 0 0 0 7 14 4 3 1.748 58 0.10
32 32 A 0 0 0 0 0 10 0 0 10 0 10 0 0 10 0 20 0 20 10 10 10 0 0 2.025 67 0.13
33 33 A 7 0 0 0 0 0 0 50 0 0 0 0 29 7 0 0 0 0 0 7 14 0 0 1.270 42 0.35
34 34 A 7 36 0 0 0 0 0 29 7 14 0 7 0 0 0 0 0 0 0 0 14 0 0 1.569 52 0.17
35 35 A 0 7 0 0 7 0 7 7 36 0 0 0 0 0 0 0 0 0 21 14 14 0 0 1.730 57 0.17
36 36 A 0 7 0 14 7 0 21 0 21 0 0 0 0 7 14 0 7 0 0 0 14 0 0 1.970 65 0.06
37 37 A 14 43 0 0 0 7 0 7 0 0 0 0 14 0 0 0 0 7 7 0 14 0 0 1.673 55 0.09
38 38 A 7 0 7 7 0 0 0 0 0 0 0 0 0 7 0 21 7 43 0 0 14 0 0 1.636 54 0.25
39 39 A 0 0 0 7 0 0 0 0 0 0 7 7 0 0 7 7 29 0 29 7 14 1 7 1.847 61 0.26
40 40 A 0 0 0 0 0 0 0 0 0 8 0 0 92 0 0 0 0 0 0 0 13 0 0 0.271 9 0.85
41 41 A 7 0 0 0 0 0 0 7 0 79 0 7 0 0 0 0 0 0 0 0 14 0 0 0.755 25 0.65
42 42 A 14 0 7 0 0 0 0 0 7 7 7 14 0 0 0 21 7 7 0 7 14 0 0 2.206 73 0.14
43 43 A 0 0 0 0 0 0 0 86 0 0 0 0 0 0 14 0 0 0 0 0 14 0 0 0.410 13 0.67
44 44 A 0 93 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 14 0 0 0.257 8 0.85
45 45 A 7 0 0 0 0 0 7 0 0 0 0 0 0 79 0 0 0 7 0 0 14 0 0 0.755 25 0.61
46 46 A 0 0 0 0 50 7 43 0 0 0 0 0 0 0 0 0 0 0 0 0 14 0 0 0.898 29 0.94
47 47 A 0 0 0 0 0 0 0 0 7 0 7 0 0 0 0 0 0 0 79 7 14 0 0 0.755 25 0.70
48 48 A 7 0 0 0 0 0 0 0 36 21 0 7 0 0 7 7 0 14 0 0 14 0 0 1.730 57 0.26
49 49 A 0 7 0 0 0 0 21 0 29 0 0 7 0 7 21 0 0 7 0 0 14 0 0 1.772 59 0.04
50 50 A 0 57 0 0 0 0 0 0 0 0 7 7 0 0 0 7 0 21 0 0 14 0 0 1.215 40 0.24
51 51 A 0 7 0 0 0 0 0 14 0 0 7 0 0 0 7 21 36 7 0 0 14 0 0 1.730 57 0.24
52 52 A 64 0 0 0 0 0 0 0 7 0 7 7 0 0 14 0 0 0 0 0 14 0 0 1.128 37 0.39
53 53 A 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 14 0 0 0.000 0 1.00
54 54 A 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 86 14 0 0 0.509 16 0.75
55 55 A 0 0 0 0 7 79 14 0 0 0 0 0 0 0 0 0 0 0 0 0 14 0 0 0.656 21 0.91
56 56 A 0 0 0 0 0 0 0 0 0 93 0 0 0 0 0 0 7 0 0 0 14 0 0 0.257 8 0.87
57 57 A 0 7 0 0 7 14 0 0 7 0 14 0 0 0 7 0 21 7 7 7 14 0 0 2.206 73 0.06
58 58 A 7 0 7 0 0 0 0 0 14 0 7 7 0 0 14 21 0 7 7 7 14 0 0 2.206 73 0.13
59 59 A 7 0 0 0 0 0 0 0 86 0 0 7 0 0 0 0 0 0 0 0 14 0 0 0.509 16 0.76
60 60 A 0 0 0 0 0 0 0 62 0 0 0 0 15 0 0 0 15 0 0 8 13 0 0 1.072 35 0.46
61 61 A 0 0 0 0 0 0 0 0 0 0 0 0 92 0 8 0 0 0 0 0 13 0 0 0.271 9 0.80
62 62 A 0 0 0 0 0 0 0 0 17 0 0 25 0 0 0 0 0 33 8 17 12 0 0 1.517 50 0.38
63 63 A 10 0 0 0 0 0 0 20 0 20 30 10 0 0 0 0 0 0 0 10 10 0 0 1.696 56 0.27
64 64 A 29 0 0 0 0 0 0 0 0 14 14 0 14 0 0 14 0 0 0 14 7 0 0 1.748 58 0.02
65 65 A 14 0 0 0 0 0 0 0 0 0 0 14 0 0 0 0 0 14 57 0 7 0 0 1.154 38 0.30
66 66 A 0 0 0 0 0 0 0 0 0 17 0 0 0 0 17 33 0 17 17 0 6 0 0 1.561 52 0.20
67 67 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 83 17 0 6 0 0 0.451 15 0.75
68 68 A 0 0 0 0 0 0 0 0 0 0 0 0 50 0 0 0 0 25 25 0 4 0 0 1.040 34 -0.16
69 69 A 0 0 0 0 0 0 0 0 0 0 25 0 0 75 0 0 0 0 0 0 4 0 0 0.562 18 0.32
70 70 A 25 75 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0.562 18 0.71
71 71 A 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0.000 0 1.00
72 72 A 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 2 0 0 0.000 0 1.00
73 73 A 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 2 0 0 0.000 0 1.00
## INSERTION LIST
AliNo IPOS JPOS Len Sequence
2 16 1150 2 pLHp
2 21 1157 1 pHs
2 22 1159 1 sDc
2 37 1175 1 qLc
3 17 29 2 gAFp
3 22 36 1 pHe
3 23 38 1 eDc
4 13 82 2 nGHy
4 19 90 1 sFc
4 35 107 1 nYc
5 13 40 2 pTQl
6 12 36 2 nPHs
7 18 79 1 gDc
7 26 88 6 gTFQYDGa
7 34 102 1 qTc
8 15 49 2 tTTn
8 21 57 1 sDc
8 29 66 7 lLGRGTEQe
8 37 81 1 dDp
9 8 293 1 pVm
10 10 11 1 tLp
10 16 18 2 pVHp
10 21 25 1 pHt
10 22 27 1 tDc
10 37 43 1 qIc
11 15 105 1 pAv
11 31 122 1 tKc
12 15 1064 2 tIHl
12 29 1080 7 hAGKKFLLd
13 11 1047 2 sIEq
13 16 1054 1 hDs
13 17 1056 1 sEc
13 32 1072 1 mAc
//