Complet list of 2bud hssp file
Complete list of 2bud.hssp file
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 2.0 2011
PDBID 2BUD
THRESHOLD according to: t(L)=(290.15 * L ** -0.562) + 5
REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991).
CONTACT Maintained at http://www.cmbi.ru.nl/ by Maarten L. Hekkelman
DATE file generated on 2014-05-17
HEADER TRANSFERASE 09-JUN-05 2BUD
COMPND MOL_ID: 1; MOLECULE: MALES-ABSENT ON THE FIRST PROTEIN; CHAIN: A; FRAG
SOURCE MOL_ID: 1; ORGANISM_SCIENTIFIC: DROSOPHILA MELANOGASTER; ORGANISM_COMM
AUTHOR P.R.NIELSEN,D.NIETLISPACH,A.BUSCAINO,R.J.WARNER,A.AKHTAR, A.G.MURZIN,N
DBREF 2BUD A 363 366 PDB 2BUD 2BUD 363 366
DBREF 2BUD A 367 454 UNP O02193 MOF_DROME 367 454
SEQLENGTH 92
NCHAIN 1 chain(s) in 2BUD data set
NALIGN 39
NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein
NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken
NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry
NOTATION : %IDE: percentage of residue identity of the alignment
NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment
NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence
NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein
NOTATION : LALI: length of the alignment excluding insertions and deletions
NOTATION : NGAP: number of insertions and deletions in the alignment
NOTATION : LGAP: total length of all insertions and deletions
NOTATION : LSEQ2: length of the entire sequence of the aligned protein
NOTATION : ACCNUM: SwissProt accession number
NOTATION : PROTEIN: one-line description of aligned protein
NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary
NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983)
NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments
NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of insertion in this sequence
NOTATION : dots (....) in the alignend sequence indicate points of deletion in this sequence
NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their
NOTATION : acid/amide form in proportion to their database frequencies
NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence)
NOTATION : NDEL: number of sequences with a deletion in the test protein at this position
NOTATION : NINS: number of sequences with an insertion in the test protein at this position
NOTATION : ENTROPY: entropy measure of sequence variability at this position
NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100
NOTATION : WEIGHT: conservation weight
## PROTEINS : identifier and alignment statistics
NR. ID STRID %IDE %WSIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCNUM PROTEIN
1 : I1V4X5_DROME 1.00 1.00 5 90 367 452 86 0 0 827 I1V4X5 Males absent on the first OS=Drosophila melanogaster GN=mof PE=4 SV=1
2 : MOF_DROME 2BUD 1.00 1.00 5 92 367 454 88 0 0 827 O02193 Males-absent on the first protein OS=Drosophila melanogaster GN=mof PE=1 SV=1
3 : B4I0I1_DROSE 0.98 1.00 5 92 359 446 88 0 0 819 B4I0I1 GM12611 OS=Drosophila sechellia GN=Dsec\GM12611 PE=4 SV=1
4 : B4Q1D9_DROYA 0.98 1.00 6 92 367 453 87 0 0 826 B4Q1D9 GE15686 OS=Drosophila yakuba GN=Dyak\GE15686 PE=4 SV=1
5 : A8B8A8_DROSI 0.97 1.00 5 92 367 454 88 0 0 827 A8B8A8 MOF OS=Drosophila simulans PE=4 SV=1
6 : A8B8B8_DROSI 0.97 1.00 5 92 367 454 88 0 0 827 A8B8B8 MOF OS=Drosophila simulans PE=4 SV=1
7 : A8B8C1_DROSI 0.97 1.00 5 92 367 454 88 0 0 827 A8B8C1 MOF OS=Drosophila simulans PE=4 SV=1
8 : A8B8C8_DROSI 0.97 1.00 5 92 367 454 88 0 0 827 A8B8C8 MOF OS=Drosophila simulans PE=4 SV=1
9 : A8B8D6_DROSI 0.97 1.00 5 92 367 454 88 0 0 827 A8B8D6 MOF OS=Drosophila simulans PE=4 SV=1
10 : A8B8D8_DROSI 0.97 1.00 5 92 367 454 88 0 0 827 A8B8D8 MOF OS=Drosophila simulans PE=4 SV=1
11 : A8B8E4_DROSI 0.97 1.00 5 92 367 454 88 0 0 827 A8B8E4 MOF OS=Drosophila simulans PE=4 SV=1
12 : A8IKT6_DROSI 0.97 1.00 5 92 336 423 88 0 0 796 A8IKT6 MOF (Fragment) OS=Drosophila simulans GN=mof PE=4 SV=1
13 : A8IKU1_DROSI 0.97 1.00 5 92 92 179 88 0 0 552 A8IKU1 MOF (Fragment) OS=Drosophila simulans GN=mof PE=4 SV=1
14 : A8IKU9_DROSI 0.97 1.00 5 92 336 423 88 0 0 796 A8IKU9 MOF (Fragment) OS=Drosophila simulans GN=mof PE=4 SV=1
15 : A8IKV3_DROSI 0.97 1.00 5 92 336 423 88 0 0 796 A8IKV3 MOF (Fragment) OS=Drosophila simulans GN=mof PE=4 SV=1
16 : A8IKW7_DROSI 0.97 1.00 5 92 336 423 88 0 0 796 A8IKW7 MOF (Fragment) OS=Drosophila simulans GN=mof PE=4 SV=1
17 : B3NSY4_DROER 0.95 1.00 6 92 365 451 87 0 0 824 B3NSY4 GG18463 OS=Drosophila erecta GN=Dere\GG18463 PE=4 SV=1
18 : B4JKW1_DROGR 0.76 0.92 10 87 408 485 78 0 0 865 B4JKW1 GH11980 OS=Drosophila grimshawi GN=Dgri\GH11980 PE=4 SV=1
19 : B4JRS4_DROGR 0.76 0.93 10 82 330 403 74 1 1 757 B4JRS4 GH19608 OS=Drosophila grimshawi GN=Dgri\GH19608 PE=4 SV=1
20 : B4M8C2_DROVI 0.73 0.89 5 87 508 590 83 0 0 996 B4M8C2 GJ16786 OS=Drosophila virilis GN=Dvir\GJ16786 PE=4 SV=1
21 : B4N1W0_DROWI 0.73 0.92 7 90 526 610 85 1 1 1000 B4N1W0 GK16378 OS=Drosophila willistoni GN=Dwil\GK16378 PE=4 SV=1
22 : B3MR53_DROAN 0.72 0.91 5 92 363 450 88 0 0 826 B3MR53 GF21061 OS=Drosophila ananassae GN=Dana\GF21061 PE=4 SV=1
23 : B4L1W3_DROMO 0.72 0.86 1 87 430 516 87 0 0 900 B4L1W3 GI14721 OS=Drosophila mojavensis GN=Dmoj\GI14721 PE=4 SV=1
24 : B4H3N8_DROPE 0.67 0.82 7 91 671 755 85 0 0 1153 B4H3N8 GL15211 OS=Drosophila persimilis GN=Dper\GL15211 PE=4 SV=1
25 : Q29JC6_DROPS 0.67 0.82 7 91 700 784 85 0 0 1182 Q29JC6 GA15730 OS=Drosophila pseudoobscura pseudoobscura GN=Dpse\GA15730 PE=4 SV=2
26 : G1S288_NOMLE 0.39 0.69 2 71 42 110 70 1 1 467 G1S288 Uncharacterized protein OS=Nomascus leucogenys GN=KAT8 PE=4 SV=1
27 : G3TCI2_LOXAF 0.39 0.69 2 71 42 110 70 1 1 466 G3TCI2 Uncharacterized protein OS=Loxodonta africana GN=KAT8 PE=4 SV=1
28 : K7CL24_PANTR 0.39 0.69 2 71 42 110 70 1 1 467 K7CL24 MYST histone acetyltransferase 1 OS=Pan troglodytes GN=KAT8 PE=2 SV=1
29 : K9ITZ6_DESRO 0.39 0.69 2 71 64 132 70 1 1 480 K9ITZ6 Putative histone acetyltransferase myst family (Fragment) OS=Desmodus rotundus PE=2 SV=1
30 : K9IY17_DESRO 0.39 0.69 2 71 42 110 70 1 1 476 K9IY17 Putative histone acetyltransferase myst family OS=Desmodus rotundus PE=2 SV=1
31 : L8J3L0_9CETA 0.39 0.69 2 71 42 110 70 1 1 468 L8J3L0 Putative histone acetyltransferase MYST1 OS=Bos mutus GN=M91_19517 PE=4 SV=1
32 : T1L0F9_TETUR 0.39 0.69 1 70 42 111 70 0 0 465 T1L0F9 Uncharacterized protein OS=Tetranychus urticae PE=4 SV=1
33 : W4WY06_ATTCE 0.39 0.68 12 80 47 114 69 1 1 157 W4WY06 Uncharacterized protein OS=Atta cephalotes PE=4 SV=1
34 : G3WE75_SARHA 0.38 0.69 1 71 36 105 71 1 1 465 G3WE75 Uncharacterized protein (Fragment) OS=Sarcophilus harrisii GN=KAT8 PE=4 SV=1
35 : G3RCP2_GORGO 0.37 0.67 2 71 42 110 70 1 1 467 G3RCP2 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101153108 PE=4 SV=1
36 : G3RNG8_GORGO 0.37 0.69 2 71 42 110 70 1 1 458 G3RNG8 Uncharacterized protein OS=Gorilla gorilla gorilla GN=101153108 PE=4 SV=1
37 : E2A1K0_CAMFO 0.36 0.64 12 80 45 112 69 1 1 452 E2A1K0 Probable histone acetyltransferase MYST1 OS=Camponotus floridanus GN=EAG_06858 PE=4 SV=1
38 : F4WNJ3_ACREC 0.36 0.65 12 80 47 114 69 1 1 457 F4WNJ3 Putative histone acetyltransferase MYST1 OS=Acromyrmex echinatior GN=G5I_07346 PE=4 SV=1
39 : E2C6Q8_HARSA 0.35 0.59 12 80 47 114 69 1 1 454 E2C6Q8 Probable histone acetyltransferase MYST1 OS=Harpegnathos saltator GN=EAI_02843 PE=4 SV=1
## ALIGNMENTS 1 - 39
SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7
1 363 A G 0 0 135 4 52 G D A
2 364 A S + 0 0 109 12 45 G SSSSSSN SSS
3 365 A H S S- 0 0 161 12 83 G PPPPPPN PPP
4 366 A M - 0 0 104 12 92 D PPPQQPK PPP
5 367 A D - 0 0 94 29 76 DDD DDDDDDDDDDDD E DQ TTTTTTG ATT
6 368 A P S S+ 0 0 117 31 19 PPPPPPPPPPPPPPPPP P PL PPPPPPE PPP
7 369 A L S S+ 0 0 137 34 86 LLLLLLLLLLLLLLLLM SLMLLLAAAAAAY AAA
8 370 A M + 0 0 111 34 86 MMMMMMMMMMMMMMMMM ILLLGGRRRRRRK RRR
9 371 A Q S S- 0 0 118 34 72 QQQQQQQQQQQQQQQQQ DKQNDDGGGGGGE GGG
10 372 A K S S- 0 0 85 36 70 KKKKKKKKKKKKKKKKKKKKTKKGGEEEEEEN EEE
11 373 A I - 0 0 38 36 87 IIIIIIIIIIIIIIIIIIILIIIIIPPPPPPE PPP
12 374 A D - 0 0 84 40 33 DDDDDDDDDDDDDDDDDEEDDDDVVEEEEEEDEEEEEEE
13 375 A I S > S+ 0 0 31 40 28 IIIIIIIIIIIIIIIIIIIIIIIVVVVVVVVILVVVLLM
14 376 A S T 3 S+ 0 0 111 40 75 SSSSSSSSSSSSSSSSSNSSNSANNTTTTTTVPTTTPPP
15 377 A E T 3 S+ 0 0 123 40 92 EEEEEEEEEEEEEEEEEEADEEDEEVVVVVVLLVVVLLL
16 378 A N X + 0 0 38 40 54 NNNNNNNNNNNNNNNNNHHNNKNDDEEEEEEDDEEEDDD
17 379 A P T 3 S+ 0 0 65 40 90 PPPPPPPPPPPPPPPPPSPPPSPSSIIIIIIIIIIIIII
18 380 A D T 3 S+ 0 0 142 40 45 DDDDDDDDDDDDDDDDDDEEDEEEEGGGGGGGGGGGGGG
19 381 A K S < S- 0 0 78 40 60 KKKKKKKKKKKKKKKKKKKKKKKKKEEEEEEDEEEEEEE
20 382 A I - 0 0 52 40 83 IIIIIIIIIIIIIIIIIIVIIIIVVTTTTTTHHTTTHHH
21 383 A Y E -A 33 0A 2 40 0 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYFYYYYYYY
22 384 A F E -AB 32 71A 59 40 54 FFYYYYYYYYYYYYYYYYYYYYYYYLLLLLLLLLLLLLL
23 385 A I E -AB 31 70A 0 40 55 IIIIIIIIIIIIIIIIIIIIIIIIICCCCCCVVCCCVVV
24 386 A R E -A 30 0A 63 40 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
25 387 A R > - 0 0 83 40 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
26 388 A E T 3 S+ 0 0 159 40 79 EEEEEEEEEEEEEEEEEAHADDAEEPPPPPPSSPPPSSS
27 389 A D T 3 S- 0 0 93 40 0 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
28 390 A G S < S+ 0 0 1 40 50 GGGGGGGGGGGGGGGGGGGGGGGGGSSSSSSNDSSSDDD
29 391 A T - 0 0 35 40 23 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSTTTSSS
30 392 A V E +A 24 0A 7 39 108 VVVVVVVVVVVVVVVVVVVVVVVVVWWWWWWWWWXLWWW
31 393 A H E -A 23 0A 59 39 5 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHXYHHH
32 394 A R E +A 22 0A 141 40 83 RRRRRRRRRRRRRRRRRAAAAAAAASSSSSSSPSSSPPP
33 395 A G E -A 21 0A 0 39 39 GGGGGGGGGGGGGGGGGGGGGGGGGAAAAAAAAAAAAA.
34 396 A Q E -C 53 0B 64 40 54 QQQQQQQQQQQQQQQQQHHQHQQRREEEEEEEEEEEEEA
35 397 A V E +C 52 0B 1 40 14 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVIVVVIIE
36 398 A L E + 0 0B 97 40 34 LLLLLLLLLLLLLLLLLLLLLLLLLIIIIIIIIIIIIII
37 399 A Q E - 0 0B 119 40 15 QQQQQQQQQQQQQQQQQQEQQQQQQQQQQQQEQQQQQQI
38 400 A S E -C 50 0B 44 40 42 SSSSSSSSSSSSSSSSSASSAVSSSSSSSSSVTSSSTTQ
39 401 A R E -C 49 0B 139 39 8 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR.RT
40 402 A T > - 0 0 37 40 85 TTTTTTTTTTTTTTTTTIVVSSVIIVVVVVVSYVVVRYR
41 403 A T T 4 - 0 0 76 40 74 TTTTTTTTTTTTTTTTTTTTTSTTTNNNNNNNNNNNYNY
42 404 A E T 4 S+ 0 0 192 40 39 EEEEEEEEEEEEEEEEEDDDEDDAADDDDDDPEDDDNEN
43 405 A N T 4 S- 0 0 116 40 50 NNNNNNNNNNNNNNNNNNnNqNNNNQQQQQQQNQQQENE
44 406 A A < + 0 0 57 31 87 AAAAPPPPPPPPPPPPAGsCsSCVV......EE...NEN
45 407 A A S S- 0 0 62 39 81 AAAAAAAAAAAAAAAAAIRVTAVLLEEEEEEG.EEEESE
46 408 A A S S+ 0 0 67 40 70 AAATAAAAAAAAAAAATSTSVSSMMGGGGGGGSGGGNHS
47 409 A P + 0 0 31 40 70 PPPPPPPPPPPPPPPPPPPPPPPPPRRRRRRRHRRRHYH
48 410 A D - 0 0 34 40 58 DDEDEEEEEEEEEEEENNNNNDNNNEEEEEEYYEEEYEY
49 411 A E E -CD 39 66B 50 40 9 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEYE
50 412 A Y E -CD 38 65B 1 40 3 YYYYYYYYYYYYYYYYYYYYYYYYYFFFFFFYYFFFYYY
51 413 A Y E - D 0 64B 71 40 6 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYIY
52 414 A V E -CD 35 63B 0 39 2 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVIVVVV.V
53 415 A H E -C 34 0B 62 40 0 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
54 416 A Y - 0 0 21 40 0 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY
55 417 A V S S- 0 0 45 40 41 VVVVVVVVVVVVVVVVVVLVVVVVVVVVVVVEDVVVEDE
56 418 A G S S+ 0 0 83 40 0 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
57 419 A L S S- 0 0 47 40 42 LLLLLLLLLLLLLLLLLLLLLLLLLFFFFFFLHFFFHHH
58 420 A N > - 0 0 105 40 0 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
59 421 A R T >4 S+ 0 0 211 40 2 RRRRRRRRRRRRRRRRRRRRRKRRRRRRRRRRRRRRRRR
60 422 A R T 34 S+ 0 0 203 40 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
61 423 A L T 34 S+ 0 0 78 40 0 LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
62 424 A D << + 0 0 22 40 0 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
63 425 A G E -D 52 0B 21 40 48 GGGGGGGGGGGGGGGGGGGEGGEEEEEEEEEEEEEEEEE
64 426 A W E -D 51 0B 99 40 0 WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW
65 427 A V E -D 50 0B 5 40 0 VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVV
66 428 A G E >> -D 49 0B 9 40 50 GGGGGGGGGGGGGGGGGGGGGGGGGDDDDDDGPDDDPPP
67 429 A R T 34 S+ 0 0 55 40 25 RRRRRRRRRRRRRRRRRRRRRRRRRKKKKKKRRKKKRRR
68 430 A H T 34 S+ 0 0 142 40 55 HHHHHHHHHHHHHHHHHHHHHHHHHNNNNNNDDNNNDDD
69 431 A R T <4 S+ 0 0 74 40 0 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
70 432 A I E < +B 23 0A 0 40 27 IIIIIIIIIIIIIIIIIIIIIIIIILLLLLLVILLLIII
71 433 A S E -B 22 0A 18 39 72 SSSSSSSSSSSSSSSSSSSSSSSSSAAAAAA MAAAMMM
72 434 A D S S+ 0 0 95 30 49 DDDDDDDDDDDDDDDDDDDDDDDDD S SSS
73 435 A N > - 0 0 59 30 45 NNNNNNNNNNNNNNNNNNNNNNNNN S SSS
74 436 A A H >>S+ 0 0 8 30 71 AAAAAAAAAAAAAAAAAAAAAAPAA R RRR
75 437 A D H >5S+ 0 0 112 30 98 DDDDDDDDDDDDDDDDDDDEDEDDD F FFF
76 438 A D H 45S+ 0 0 127 30 0 DDDDDDDDDDDDDDDDDDDDDDDDD D DDD
77 439 A L H <5S- 0 0 16 30 7 LLLLLLLLLLLLLLLLLLLLLLLLL M MMM
78 440 A G H X5S- 0 0 3 30 36 GGGGGGGGGGGGGGGGGGGGGGGGG S NSS
79 441 A G T <