PFRMAT AL 
TARGET T0075 
AUTHOR 3873-9906-1225 
REMARK Submission 1 
REMARK Work by Gidon Moont (1) , Lawrence Kelley (1), 
REMARK Bob MacCallum (1), Marcel Turcotte (1) Mansoor Saqi (2) 
REMARK and Michael Sternberg (1) (m.sternberg@icrf.icnet.uk) 
REMARK (1) Biomolecular Modelling Laboratory, 
REMARK Imperial Cancer Research Fund 
REMARK (1) Lincoln's Inn Fields, London WC2A 3PX, UK 
REMARK (2) Bioinformatics Group, GlaxoWellcome, Stevenage, UK 
METHOD 
METHOD Method outline 
METHOD --------------- 
METHOD *************************** 
METHOD - SEE NEW METHOD 3D-PSSM --- 
METHOD ***************************** 
METHOD unknown = target, library of known folds = template 
METHOD (0) Initial check for remote homology of target 
METHOD to templates of known structures using PSI-BLAST 
METHOD (1) Secondary structure & sequence target against fold 
METHOD template library using FOLDFIT 
METHOD (2) Multiple structure / multiple sequence matching 
METHOD  against fold template library (3D-PSSM) *** NEW METHOD*** 
METHOD (3) Search against Hidden Markov Models for fold template 
METHOD library using SAM 
METHOD (4) Local hydrophobicity and predicted secondary structure 
METHOD matched for target and template using SIVA (MacCallum & 
METHOD Thornton) 
METHOD (5) Filter top hits from above against topological rules 
METHOD for folds derived by an artificial intelligent type machine 
METHOD learning approach  (PROGOL) , Turcotte, Muggleton & 
METHOD Sternberg) 
METHOD (6) Evaluation of above results in terms of literature and 
METHOD function of target. 
METHOD 
METHOD General features of approach 
METHOD ----------------------------- 
METHOD 
METHOD (i) The fold (template) library consists of non-redundant 
METHOD SCOP domains with <40% sequence identity per family (called 
METHOD SCOP40). 
METHOD 
METHOD (ii) Secondary structure prediction from multiple alignment 
METHOD (homologues gathered with PSI-BLAST) DSC (King & 
METHOD Sternberg); PHD (Rost & Sander); JPRED (Barton) 
METHOD 
METHOD Method details 
METHOD -------------- 
METHOD 
METHOD (1) FOLDFIT (Russell,R.B., Saqi, M.A.S., Bates,P.A., 
METHOD Sayle,R.A.  & Sternberg, M.J.E. (1998). Prot Eng 11, 1-9.) 
METHOD The target is represented by sequence and predicted 
METHOD secondary structure and scanned against known secondary 
METHOD structure and sequence for template in fold library. 
METHOD Different weights for secondary structure and sequence are 
METHOD used to obtain different possible top hits. 
METHOD 
METHOD (2) 3D-PSSM - Structures within the same SCOP fold family 
METHOD are aligned in 3D and if structures can be superposed well 
METHOD then each is used together with all homologous 
METHOD sequences in sequence database found by PSI-BLAST. 
METHOD These 3D-PSSMs were generated for each template. 
METHOD The target is matched against each template, 
METHOD (3D-PSSM, Kelley, MacCallum, Saqi & Sternberg, unpublished). 
METHOD NOW INCLUDING PREDICTED SECONDARY STRUCTURE 
METHOD as in FOLDFIT. 
METHOD 
METHOD (3) HMM from SAM (Hughley & Krogh ) against a 
METHOD library generated from each 
METHOD template in SCOP40 (Moont, MacCallum & Sternberg). 
METHOD 
METHOD (4) Vector-based alignment of per-residue hydrophobicity 
METHOD and DSC predicted secondary structure probabilities for 
METHOD both target and template. This approach could also 
METHOD be used in the absence of known structures for library 
METHOD sequences.  Algorithm is SIVA (MacCallum & Thornton, 
METHOD unpublished) 
METHOD 
METHOD (5) Using an artificial intelligence based machine learning 
METHOD algorithm (PROGOL, Muggleton et al), we have obtained 
METHOD expert system type rules governing protein folds (Turcotte, 
METHOD Muggleton & Sternberg).  These rules include data on 
METHOD patterns and types of secondary structures including 
METHOD length, loop length and hydrophobicity.  Top hits from all 
METHOD the above methods were screened against rules for the folds 
METHOD to assess their likelihood. 
METHOD 
METHOD (6) Visual inspection of results. 
METHOD 
METHOD Detils for T0075 
METHOD ---------------- 
METHOD examination of top hits failed to find a match 
METHOD that was sensible structurally and functionally. 
METHOD We therefore suggest it is a new fold 
METHOD 
MODEL     1 
PARENT NONE 
TER 
END 
