Construction of a support vector machine (SVM) for accurately detecting domain boundaries
Project/Area Number |
18500225
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Bioinformatics/Life informatics
|
Research Institution | Tokyo University of Agriculture and Technology |
Principal Investigator |
KURODA Yutaka Tokyo University of Agriculture and Technology, Institute of Symbiotic Science and Technology, Associate professor (10312240)
|
Co-Investigator(Kenkyū-buntansha) |
TOH Hiroyuki Kyushu University, Medical Institute of Bioregulation, Professor (70192656)
|
Project Period (FY) |
2006 – 2007
|
Project Status |
Completed (Fiscal Year 2007)
|
Budget Amount *help |
¥3,880,000 (Direct Cost: ¥3,400,000、Indirect Cost: ¥480,000)
Fiscal Year 2007: ¥2,080,000 (Direct Cost: ¥1,600,000、Indirect Cost: ¥480,000)
Fiscal Year 2006: ¥1,800,000 (Direct Cost: ¥1,800,000)
|
Keywords | Domain boundary prediction / Support Vector Machine(SVM) / Strutural domain / Domain linkers / Machine learning / Large scale / computer aided / 機会学習 |
Research Abstract |
The prediction of structural domains in novel protein sequences is becoming of practical importance. One important area of application is the development of computer-aided techniques for identifying, at a low cost, novel protein domain targets for large-scale functional and structural proteomics. Traditional computer-aided methods rely on sequence similarity to domain databases such as Pfam, Prosite or SMART. However, methods that work independently from domain databases are required for detecting novel domains. Such methods need to predict domain regions solely from the information contained in the amino acid sequence of the protein of interest. In this project, we report a Support Vector Machine (SVM) that identifies domain linkers, which are loops separating two structural domains. We constructed a multi-domain protein database based on SCOP and CATH domain boundary definition. First, we selected domains that do not form inter-domain interactions, as defined by the presence of inter-domain VdW, Hbonds and SS-bonds, and are independently foldable. From this set, we further selected domain boundaries that form loops as defined by DSSP. Domain boundaries that fulfilled both conditions were called linkers and used for training and testing the SVM. We developed a domain linker prediction (DLP-SVM) based on SVMlight. The sensitivity and the specificity were, respectively, 46.8% and 57.1%. These values are over 5.1 and 6.8%, respectively, higher than previously reported methods. DLP-SVM is freely available at : http://www.tuat.ac.jp/~ domserv/cgi-bin/DLP-SVM.cgi A travel grant from the Protein Science Society of Japan was awarded to PhD student Teippei Ebina for presenting this research at the PRICIPS 2008 conference held in Cairns, Australia, June 2008.
|
Report
(3 results)
Research Products
(56 results)
-
-
-
-
-
-
-
[Journal Article] Mutational analysis of protein solubility enhancement using short peptide tags2007
Author(s)
Kato, A., Maki K., Ebina, T., Kuwajima, K., Soda, K., and Kuroda, Y.
-
Journal Title
Biopolymers 84(2)
Pages: 12-18
Description
「研究成果報告書概要(和文)」より
Related Report
Peer Reviewed
-
-
-
-
-
-
[Journal Article] Mutational analysis of protein solubility enhancement using short peptide tags2007
Author(s)
A., Kato, K., Maki, T., Ebina, K., Kuwajima, K., Soda, Y., Kuroda
-
Journal Title
Biopolymers 84(2)
Pages: 12-18
Description
「研究成果報告書概要(欧文)」より
Related Report
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
[Presentation] Prediction of protein structural flexibility from an amino acid sequence2007
Author(s)
S., Hirose, K., Yokota, Y., Kuroda, H., Wako, S., Kanai, T., Noguchi
Organizer
The 45Th Annual Meeting of the Biophysical Society of Japan
Place of Presentation
Yokohama
Description
「研究成果報告書概要(欧文)」より
Related Report
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-