• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Fully automated protein NMR assignments and structures from raw time-domain data by deep learning

Research Project

Project/Area Number 23K05660
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Review Section Basic Section 43020:Structural biochemistry-related
Research InstitutionTokyo Metropolitan University

Principal Investigator

PETER GUENTERT  東京都立大学, 理学研究科, 客員教授 (20392110)

Co-Investigator(Kenkyū-buntansha) 池谷 鉄兵  東京都立大学, 理学研究科, 准教授 (30457840)
伊藤 隆  東京都立大学, 理学研究科, 教授 (80261147)
Project Period (FY) 2023-04-01 – 2026-03-31
Project Status Granted (Fiscal Year 2023)
Budget Amount *help
¥4,810,000 (Direct Cost: ¥3,700,000、Indirect Cost: ¥1,110,000)
Fiscal Year 2025: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2024: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Fiscal Year 2023: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
KeywordsNMR / machine learning / automated assignment / protein structure / protein / spectra analysis / ARTINA
Outline of Research at the Start

Using generated training data, we will apply deep learning for (i) a better treatment of time-domain data acquired with non-uniform sampling (NUS) than conventional approaches, (ii) resolution enhancement by virtual decoupling, and (iii) deconvolution of highly overlapped signals. In this way, time-consuming or costly experimental methods for obtaining better NMR spectra can be replaced by computational alternatives, thereby increasing the efficiency, completeness, and quality of NMR spectra analysis.

Outline of Annual Research Achievements

The ARTINA workflow for machine learning-based automated NMR spectra analysis was combined with AlphaFold structure prediction and UCBShift chemical shift prediction in order to drastically reduce the amount of NMR spectra that are required for obtaining the chemical shift assignment of a protein. Extensive studies have been performed to identify the optimal sets of NMR spectra for the assignment of the backbone or all chemical shifts in a protein. This was published in Klukowski et al., Science Advances 9, eadi9323 (2023).

In addition, ARTINA was generalized to additional biomacromolecular systems. Originally, ARTINA was designed exclusively for monomeric proteins composed of standard amino acid residues. This restriction has been lifted to enable automated NMR spectra analysis also for protein-protein, protein-small molecule ligand, RNA and DNA systems. This significantly extends the applicability of the method to biologically important systems.

A large scale data set comprising more than 1300 multidimensional NMR spectra, from which the chemical shift assignments and three-dimensional structures of 100 proteins can be obtained, has been published as open research data for general use by the NMR research community. Published in Klukowski et al., Nature Scientific Data 11, 30 (2024).

Current Status of Research Progress
Current Status of Research Progress

2: Research has progressed on the whole more than it was originally planned.

Reason

Research progressed as planned. In particular, we started to develop new machine learning models specifically for analyzing two-dimensional homonuclear NMR spectra of proteins (see below).

Strategy for Future Research Activity

As a new direction of research, we have started to implement new machine learning models specifically for the purpose of analyzing two-dimensional 1H-1H NMR spectra of proteins up to ~20 kDa. If successful, this will enable efficient NMR studies of proteins without isotope labeling and requiring much less NMR measurement time.

Training and testing data is crucial for machine learning applications, which constitutes a limiting factor for its use in biological NMR spectroscopy. In order to collect and make available a larger and more diverse set of multi-dimensional NMR spectra, we are developing a new public website and data repository for the upload, storage, and access of primary biomolecular NMR data, i.e. spectra or time-domain data. This should become available for general use by the NMR research community in the near future.

Report

(1 results)
  • 2023 Research-status Report
  • Research Products

    (12 results)

All 2024 2023 Other

All Journal Article (6 results) (of which Int'l Joint Research: 6 results,  Peer Reviewed: 6 results,  Open Access: 6 results) Presentation (3 results) (of which Int'l Joint Research: 2 results,  Invited: 3 results) Remarks (3 results)

  • [Journal Article] The 100-protein NMR spectra dataset: A resource for biomolecular NMR data analysis2024

    • Author(s)
      Klukowski Piotr、Damberger Fred F.、Allain Frederic H.-T.、Iwai Hideo、Kadavath Harindranath、Ramelot Theresa A.、Montelione Gaetano T.、Riek Roland、Guentert Peter
    • Journal Title

      Scientific Data

      Volume: 11 Issue: 1 Pages: 30-30

    • DOI

      10.1038/s41597-023-02879-5

    • Related Report
      2023 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] 1H, 13C, and 15N resonance assignments and solution structure of the N-terminal divergent calponin homology (NN-CH) domain of human intraflagellar transport protein 542024

    • Author(s)
      Kuwasako Kanako、Dang Weirong、He Fahu、Takahashi Mari、Tsuda Kengo、Nagata Takashi、Tanaka Akiko、Kobayashi Naohiro、Kigawa Takanori、Guentert Peter、Shirouzu Mikako、Yokoyama Shigeyuki、Muto Yutaka
    • Journal Title

      Biomolecular NMR Assignments

      Volume: 18 Issue: 1 Pages: 71-78

    • DOI

      10.1007/s12104-024-10170-w

    • Related Report
      2023 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] NMRtist: an online platform for automated biomolecular NMR spectra analysis2023

    • Author(s)
      Klukowski Piotr, Riek Roland, Guentert Peter
    • Journal Title

      Bioinformatics

      Volume: 39 Issue: 2

    • DOI

      10.1093/bioinformatics/btad066

    • Related Report
      2023 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Facilitating the structural characterisation of non-canonical amino acids in biomolecular NMR2023

    • Author(s)
      Kuschert Sarah、Stroet Martin、Chin Yanni Ka-Yan、Conibear Anne Claire、Jia Xinying、Lee Thomas、Bartling Christian Reinhard Otto、Stromgaard Kristian、Guentert Peter、Rosengren Karl Johan、Mark Alan Edward、Mobli Mehdi
    • Journal Title

      Magnetic Resonance

      Volume: 4 Issue: 1 Pages: 57-72

    • DOI

      10.5194/mr-4-57-2023

    • Related Report
      2023 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Chemical shift transfer: an effective strategy for protein NMR assignment with ARTINA2023

    • Author(s)
      Wetton Henry、Klukowski Piotr、Riek Roland、Guentert Peter
    • Journal Title

      Frontiers in Molecular Biosciences

      Volume: 10 Pages: 1244029-1244029

    • DOI

      10.3389/fmolb.2023.1244029

    • Related Report
      2023 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Time-optimized protein NMR assignment with an integrative deep learning approach using AlphaFold and chemical shift prediction2023

    • Author(s)
      Klukowski Piotr、Riek Roland、Guentert Peter
    • Journal Title

      Science Advances

      Volume: 9 Issue: 47

    • DOI

      10.1126/sciadv.adi9323

    • Related Report
      2023 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Presentation] Automated biomolecular NMR spectra analysis, assignments, and structures with the deep learning technique ARTINA2024

    • Author(s)
      Guentert Peter
    • Organizer
      The ESPERANCE Project Practical Course
    • Related Report
      2023 Research-status Report
    • Invited
  • [Presentation] Accelerating protein chemical shift assignment by deep learning for visual spectra analysis, structure and shift prediction2023

    • Author(s)
      Guentert Peter
    • Organizer
      iNEXT-Discovery 2023 Conference
    • Related Report
      2023 Research-status Report
    • Int'l Joint Research / Invited
  • [Presentation] Structure-based protein chemical shift assignment from minimal NMR data with the hybrid deep learning approach ARTINA2023

    • Author(s)
      Guentert Peter
    • Organizer
      EUROMAR 2023 Conference
    • Related Report
      2023 Research-status Report
    • Int'l Joint Research / Invited
  • [Remarks] NMRtist cloud computing service

    • URL

      https://nmrtist.org/

    • Related Report
      2023 Research-status Report
  • [Remarks] The 100-protein NMR spectra dataset

    • URL

      https://doi.org/10.3929/ethz-b-000630211

    • Related Report
      2023 Research-status Report
  • [Remarks] The 100-protein NMR spectra dataset

    • URL

      https://nmrdb.ethz.ch/

    • Related Report
      2023 Research-status Report

URL: 

Published: 2023-04-13   Modified: 2024-12-25  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi