The structure of the 216 residue mature enzyme is already known at
1.8A resolution (PDB: 1PPO). The "new" information is the structure of the
proregion (107 residues). The structure of a homologous proenzyme,
cathepsin B, with a shorter proregion (62 residues) has been recently
published by the group of Cygler.
Note: EMBL sequence contains a signal sequence:
embl sequence:
MAMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLI
QLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEF
NEKYVGSLIDATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAV
ATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKNGIHLRSKYPYKA
KQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIF
EGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYK
SSYYPTKN
Note that the signal peptide is included in the above sequence so that
the sequence expressed in E. coli was:
MDFSIVGYSQDDLTSTERLI
QLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEF
NEKYVGSLIDATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAV
ATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKNGIHLRSKYPYKA
KQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIF
EGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYK
SSYYPTKN
and the initial (new) methionine was probably removed