Complet list of 1ugl hssp file
Complete list of 1ugl.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1UGL
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-10
HEADER PLANT PROTEIN 16-JUN-03 1UGL
COMPND MOL_ID: 1; MOLECULE: S-LOCUS POLLEN PROTEIN; CHAIN: A; FRAGMENT: RESID
SOURCE MOL_ID: 1; SYNTHETIC: YES; OTHER_DETAILS: THIS SEQUENCE OCCURS NATURAL
AUTHOR M.MISHIMA,S.TAKAYAMA,K.SASAKI,J.G.JEE,C.KOJIMA,A.ISOGAI, M.SHIRAKAWA,R
DBREF 1UGL A 1 50 UNP Q9SE17 Q9SE17_BRACM 25 74
SEQLENGTH 50
NCHAIN 1 chain(s) in 1UGL data set
NALIGN 9
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : Q9SE17_BRACM1UGL 1.00 1.00 1 50 25 74 50 0 0 74 Q9SE17 S locus protein 11 OS=Brassica campestris GN=SCR PE=1 SV=1
2 : Q8W410_BRAOL 0.92 0.92 1 50 7 56 50 0 0 56 Q8W410 S locus protein 11-32 (Fragment) OS=Brassica oleracea GN=SP11-32 PE=2 SV=1
3 : Q1ET33_BRAOL 0.36 0.45 1 50 26 79 56 2 8 79 Q1ET33 S locus protein 11 OS=Brassica oleracea GN=S-12 SP11 PE=4 SV=1
4 : Q1ET34_BRACM 0.36 0.46 1 50 26 79 56 2 8 79 Q1ET34 S locus protein 11 OS=Brassica campestris GN=S-47 SP11 PE=4 SV=1
5 : Q6L8S4_RAPSA 0.36 0.54 1 50 7 61 56 2 7 61 Q6L8S4 S locus protein 11-6 (Fragment) OS=Raphanus sativus GN=SP11-6 PE=2 SV=1
6 : Q8W411_BRAOL 0.36 0.45 1 50 7 60 56 2 8 60 Q8W411 S locus protein 11-12 (Fragment) OS=Brassica oleracea GN=SP11-12 PE=2 SV=1
7 : Q9LRD1_BRACM 0.36 0.46 1 50 14 67 56 2 8 67 Q9LRD1 S locus protein 11-47 (Fragment) OS=Brassica campestris GN=SP11-47 PE=2 SV=1
8 : Q9MB54_BRACM 0.36 0.48 1 50 25 79 56 2 7 79 Q9MB54 S-locus pollen protein OS=Brassica campestris GN=SP11-52 PE=4 SV=1
9 : Q9SE18_BRAOL 0.36 0.55 1 50 25 77 55 3 7 77 Q9SE18 S-locus cysteine-rich protein OS=Brassica oleracea GN=SCR PE=4 SV=1
## ALIGNMENTS 1 - 9
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A N 0 0 201 10 0 NNNNNNNNN
2 2 A L + 0 0 132 10 0 LLLLLLLLL
3 3 A M + 0 0 148 10 0 MMMMMMMMM
4 4 A K - 0 0 31 10 86 KKYNKYNNM
5 5 A R - 0 0 165 10 56 RRPPRPPRP
6 6 A a - 0 0 16 10 0 CCCCCCCCC
7 7 A T S S- 0 0 21 10 63 TTDDTDDTG
8 8 A R S S- 0 0 148 9 77 RRDDHDDR.
9 9 A G S S- 0 0 63 10 90 GGTIQTIES
10 10 A F S S+ 0 0 127 10 9 FFFFLFFLF
11 11 A R + 0 0 140 9 93 RRGGPGGP.
12 12 A K E -A 43 0A 2 10 79 KKMMFMMFM
13 13 A L E +A 42 0A 112 10 104 LPEEREEPF
14 14 A G S S- 0 0 25 10 0 GGGGGGGGG
15 15 A K S S- 0 0 139 10 71 KKQQTQQKN
16 16 A b S S- 0 0 47 10 0 CCCCCCCCC
17 17 A T >> - 0 0 49 7 74 TT..GG.GR
18 18 A T H 3> S+ 0 0 95 6 75 TT..S..SN
19 19 A L H 3> S+ 0 0 112 9 99 LLGGS.GSI
20 20 A E H <> S+ 0 0 3 10 44 EEGGGGGEG
21 21 A E H X S+ 0 0 85 10 75 EEPPDPPDA
22 22 A E H X>S+ 0 0 90 10 73 EEKKEKKGR
23 23 A K H X5S+ 0 0 31 10 86 KKTTVTTGE
24 24 A c H X5S+ 0 0 0 10 0 CCCCCCCCC
25 25 A K H <5S+ 0 0 75 10 72 KKEEKEEIE
26 26 A T H <5S+ 0 0 89 10 63 TTNKKNKKK
27 27 A L H <