CASP2 Target T0012

Received Sun Jun 2 0:51:18 BST 1996

1. Protein Name
Procaricain
2. Organism Name
Carica papaya
3. Number of amino acids (approx)
107
4. Accession number
EM_PL:CPPRO
5. Sequence Database
EMBL
6. Amino acid sequence
MDFSIVGYSQ DDLTSTERLI QLFNSWMLNH NKFYENVDEK LYRFEIFKDN
LNYIDETNKK NNSYWLGLNE FADLSNDEFN EKYVGSLIDA TIEQSYDEEF
INEDTVN
7. Homologous Sequence of known structure
yes
8. Current state of the experimental work
Refinement and manuscript preparation but only 3.2A resolution

9. Interpretable map?
yes
10. Estimated date of chain tracing completion
Done
11. Estimated date of public release of structure
1.9.96 - 1.10.96
13. Name
John Jenkins
14. Mailing address
Food Macromolecular Science
Institute of Food Research
Earley Gate
Whiteknights Road
Reading RG6 6BZ, UK
15. Telephone
+44 (0)1189 357143
16. Fax
+44 (0)1189 267917
17. Email
john.jenkins@bbsrc.ac.uk
18. Source of information about experiment
Email
12. Additional Information
The structure of the 216 residue mature enzyme is already known at 1.8A resolution (PDB: 1PPO). The "new" information is the structure of the proregion (107 residues). The structure of a homologous proenzyme, cathepsin B, with a shorter proregion (62 residues) has been recently published by the group of Cygler. Note: EMBL sequence contains a signal sequence:
embl sequence:
MAMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLI
QLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEF
NEKYVGSLIDATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAV
ATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKNGIHLRSKYPYKA
KQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIF
EGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYK
SSYYPTKN
Note that the signal peptide is included in the above sequence so that the sequence expressed in E. coli was:

MDFSIVGYSQDDLTSTERLI
QLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEF
NEKYVGSLIDATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAV
ATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKNGIHLRSKYPYKA
KQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIF
EGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYK
SSYYPTKN
and the initial (new) methionine was probably removed

Related Files

Template Sequence file

Template PDB file