Complet list of 1b9p hssp file
Complete list of 1b9p.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 1B9P
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-04-27
HEADER COLLAGEN FACIT XIV 15-FEB-99 1B9P
COMPND MOL_ID: 1; MOLECULE: PROTEIN (COLLAGEN ALPHA 1); CHAIN: A; FRAGMENT: F
SOURCE MOL_ID: 1; SYNTHETIC: YES; OTHER_DETAILS: THE PROTEIN WAS CHEMICALLY S
AUTHOR R.MONTSERRET,G.DELEAGE,F.PENIN
DBREF 1B9P A 1 34 UNP P32018 COEA1_CHICK 1852 1885
SEQLENGTH 34
NCHAIN 1 chain(s) in 1B9P data set
NALIGN 13
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : COEA1_CHICK 1B9P 1.00 1.00 2 34 1853 1885 33 0 0 1888 P32018 Collagen alpha-1(XIV) chain OS=Gallus gallus GN=COL14A1 PE=1 SV=2
2 : F1NKZ3_CHICK 1.00 1.00 2 34 1853 1885 33 0 0 1888 F1NKZ3 Collagen alpha-1(XIV) chain OS=Gallus gallus GN=COL14A1 PE=4 SV=2
3 : F6T4R2_CHICK 1.00 1.00 2 34 1822 1854 33 0 0 1857 F6T4R2 Collagen alpha-1(XIV) chain OS=Gallus gallus GN=COL14A1 PE=4 SV=1
4 : Q9PSS5_CHICK 1.00 1.00 2 34 88 120 33 0 0 123 Q9PSS5 Collagen type XIV alpha 1 chain (Fragments) OS=Gallus gallus PE=4 SV=1
5 : G1NIW3_MELGA 0.97 1.00 2 34 1849 1881 33 0 0 1884 G1NIW3 Uncharacterized protein OS=Meleagris gallopavo GN=COL14A1 PE=4 SV=2
6 : G1NIW5_MELGA 0.97 1.00 2 34 1821 1853 33 0 0 1856 G1NIW5 Uncharacterized protein OS=Meleagris gallopavo GN=COL14A1 PE=4 SV=2
7 : G1NIW6_MELGA 0.97 1.00 2 34 1852 1884 33 0 0 1887 G1NIW6 Uncharacterized protein OS=Meleagris gallopavo GN=COL14A1 PE=4 SV=2
8 : G3USE3_MELGA 0.97 1.00 2 34 88 120 33 0 0 123 G3USE3 Uncharacterized protein (Fragment) OS=Meleagris gallopavo GN=COL14A1 PE=4 SV=1
9 : U3ILT8_ANAPL 0.72 0.84 2 33 1852 1883 32 0 0 1887 U3ILT8 Uncharacterized protein OS=Anas platyrhynchos GN=COL14A1 PE=4 SV=1
10 : U3JIB0_FICAL 0.72 0.84 2 33 1860 1891 32 0 0 1895 U3JIB0 Uncharacterized protein OS=Ficedula albicollis GN=COL14A1 PE=4 SV=1
11 : U3JIB6_FICAL 0.72 0.84 2 33 1851 1882 32 0 0 1886 U3JIB6 Uncharacterized protein OS=Ficedula albicollis GN=COL14A1 PE=4 SV=1
12 : H0ZQ71_TAEGU 0.69 0.81 2 33 1849 1880 32 0 0 1884 H0ZQ71 Uncharacterized protein OS=Taeniopygia guttata GN=COL14A1 PE=4 SV=1
13 : K7FM26_PELSI 0.65 0.84 4 34 1860 1890 31 0 0 1891 K7FM26 Uncharacterized protein OS=Pelodiscus sinensis GN=COL14A1 PE=4 SV=1
## ALIGNMENTS 1 - 13
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 1 A C 0 0 181 1 0
2 2 A A - 0 0 83 13 0 AAAAAAAAAAAA
3 3 A V > - 0 0 92 13 0 VVVVVVVVVVVV
4 4 A E T 3 S- 0 0 173 14 0 EEEEEEEEEEEEE
5 5 A L T 3 S+ 0 0 155 14 13 LLLLLLLLLLLLV
6 6 A R < + 0 0 206 14 0 RRRRRRRRRRRRR
7 7 A S > - 0 0 48 14 0 SSSSSSSSSSSSS
8 8 A P T 4 + 0 0 69 14 0 PPPPPPPPPPPPP
9 9 A G T 4 S+ 0 0 50 14 0 GGGGGGGGGGGGG
10 10 A I T > S+ 0 0 131 14 0 IIIIIIIIIIIII
11 11 A S H X S+ 0 0 59 14 17 SSSSSSSSTSSSS
12 12 A R H 4 S+ 0 0 185 14 0 RRRRRRRRRRRRR
13 13 A F H > S+ 0 0 141 14 0 FFFFFFFFFFFFF
14 14 A R H X S+ 0 0 142 14 99 RRRRRRRRTAAAA
15 15 A R H X S+ 0 0 194 14 0 RRRRRRRRRRRRR
16 16 A K H > S+ 0 0 145 14 27 KKKKKKKKKRRRK
17 17 A I H X S+ 0 0 94 14 0 IIIIIIIIIIIII
18 18 A A H X S+ 0 0 40 14 21 AAAAAAAAAAAAS
19 19 A K H X S+ 0 0 138 14 0 KKKKKKKKKKKKK
20 20 A R H X S+ 0 0 205 14 0 RRRRRRRRRRRRR
21 21 A S H X S+ 0 0 74 14 0 SSSSSSSSSSSSS
22 22 A I H X S+ 0 0 87 14 66 IIIIIIIIVAAAV
23 23 A K H X S+ 0 0 139 14 54 KKKKKKKKTRRKN
24 24 A T H X S+ 0 0 84 14 37 TTTTTTTTATTTS
25 25 A L H >X S+ 0 0 82 14 96 LLLLLLLLPPPPP
26 26 A E H 3X>S+ 0 0 104 14 64 EEEEEEEEVEEVA
27 27 A H H 3<5S+ 0 0 127 14 39 HHHHHHHHHRRRH
28 28 A K H <<5S+ 0 0 141 14 50 KKKKKKKKKTTTK
29 29 A R H ><5S+ 0 0 175 14 10 RRRRRRRRRRRKR
30 30 A E T 3<5S+ 0 0 172 14 56 EEEEEEEEAAAAE
31 31 A N T 3