2007 Fiscal Year Final Research Report Summary
Construction of a support vector machine (SVM) for accurately detecting domain boundaries
Project/Area Number |
18500225
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Bioinformatics/Life informatics
|
Research Institution | Tokyo University of Agriculture and Technology |
Principal Investigator |
KURODA Yutaka Tokyo University of Agriculture and Technology, Institute of Symbiotic Science and Technology, Associate professor (10312240)
|
Co-Investigator(Kenkyū-buntansha) |
TOH Hiroyuki Kyushu University, Medical Institute of Bioregulation, Professor (70192656)
|
Project Period (FY) |
2006 – 2007
|
Keywords | Domain boundary prediction / Support Vector Machine(SVM) / Strutural domain / Domain linkers / Machine learning / Large scale / computer aided |
Research Abstract |
The prediction of structural domains in novel protein sequences is becoming of practical importance. One important area of application is the development of computer-aided techniques for identifying, at a low cost, novel protein domain targets for large-scale functional and structural proteomics. Traditional computer-aided methods rely on sequence similarity to domain databases such as Pfam, Prosite or SMART. However, methods that work independently from domain databases are required for detecting novel domains. Such methods need to predict domain regions solely from the information contained in the amino acid sequence of the protein of interest. In this project, we report a Support Vector Machine (SVM) that identifies domain linkers, which are loops separating two structural domains. We constructed a multi-domain protein database based on SCOP and CATH domain boundary definition. First, we selected domains that do not form inter-domain interactions, as defined by the presence of inter-domain VdW, Hbonds and SS-bonds, and are independently foldable. From this set, we further selected domain boundaries that form loops as defined by DSSP. Domain boundaries that fulfilled both conditions were called linkers and used for training and testing the SVM. We developed a domain linker prediction (DLP-SVM) based on SVMlight. The sensitivity and the specificity were, respectively, 46.8% and 57.1%. These values are over 5.1 and 6.8%, respectively, higher than previously reported methods. DLP-SVM is freely available at : http://www.tuat.ac.jp/~ domserv/cgi-bin/DLP-SVM.cgi A travel grant from the Protein Science Society of Japan was awarded to PhD student Teippei Ebina for presenting this research at the PRICIPS 2008 conference held in Cairns, Australia, June 2008.
|
Research Products
(41 results)
-
-
-
-
-
-
-
[Journal Article] Mutational analysis of protein solubility enhancement using short peptide tags2007
Author(s)
Kato, A., Maki K., Ebina, T., Kuwajima, K., Soda, K., and Kuroda, Y.
-
Journal Title
Biopolymers 84(2)
Pages: 12-18
Description
「研究成果報告書概要(和文)」より
Peer Reviewed
-
-
-
-
-
-
[Journal Article] Mutational analysis of protein solubility enhancement using short peptide tags2007
Author(s)
A., Kato, K., Maki, T., Ebina, K., Kuwajima, K., Soda, Y., Kuroda
-
Journal Title
Biopolymers 84(2)
Pages: 12-18
Description
「研究成果報告書概要(欧文)」より
-
-
-
-
-
-
-
-
-
[Presentation] Prediction of protein structural flexibility from an amino acid sequence2007
Author(s)
S., Hirose, K., Yokota, Y., Kuroda, H., Wako, S., Kanai, T., Noguchi
Organizer
The 45Th Annual Meeting of the Biophysical Society of Japan
Place of Presentation
Yokohama
Year and Date
20071221-23
Description
「研究成果報告書概要(欧文)」より
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-