• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Acoustic-to-articulatory conversion based on integration of EMA-based measurement and statistical media conversion techniques

Research Project

Project/Area Number 15K12059
Research Category

Grant-in-Aid for Challenging Exploratory Research

Allocation TypeMulti-year Fund
Research Field Perceptual information processing
Research InstitutionThe University of Tokyo

Principal Investigator

Minematsu Nobuaki  東京大学, 大学院工学系研究科(工学部), 教授 (90273333)

Co-Investigator(Kenkyū-buntansha) 齋藤 大輔  東京大学, 大学院工学系研究科(工学部), 講師 (40615150)
Research Collaborator UCHIDA Hidetsugu  
Project Period (FY) 2015-04-01 – 2017-03-31
Project Status Completed (Fiscal Year 2016)
Budget Amount *help
¥3,640,000 (Direct Cost: ¥2,800,000、Indirect Cost: ¥840,000)
Fiscal Year 2016: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Fiscal Year 2015: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Keywords音声・調音推定 / EMA / パラレルコーパス / 話者正規化 / 音声の構造的表象 / 外国語学習 / 調音運動計測 / 調音・音響マッピング / 統計的声質変換
Outline of Final Research Achievements

In this study, by integrating three techniques, articulatory measurement, acoustic measurement, and statistical media conversion, novel techniques were built, which can convert speech signals into the articulatory movement pattern that is assumed to be conducted to generate the speech signals. Further, a corpus was developed, which is required to realize the above technique. As a result of this study, 1) by using EMA, a parallel corpus between acoustic measurements and articulatory measurements was built from a single Japanese-Chinese bilingual speaker. 2) the performance of acoustic-to-articulatory mapping was improved by introducing speaker normalization techniques. 3) by using the structural representation of speech, a novel technique was built to predict the articulatory movement of a phoneme, which is impossible for the speaker to generate correctly. These results were presented at top conferences of speech communication and a journal paper was published.

Report

(3 results)
  • 2016 Annual Research Report   Final Research Report ( PDF )
  • 2015 Research-status Report
  • Research Products

    (6 results)

All 2017 2016 2015

All Journal Article (2 results) (of which Peer Reviewed: 2 results,  Acknowledgement Compliant: 1 results,  Open Access: 1 results) Presentation (4 results) (of which Int'l Joint Research: 1 results)

  • [Journal Article] 音声の構造的表象を用いた未観測音素の調音運動の推定2017

    • Author(s)
      内田秀継,齋藤大輔,峯松信明
    • Journal Title

      音響学会誌

      Volume: 印刷中

    • NAID

      130007382358

    • Related Report
      2016 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] Statistical acoustic-to-articulatory mapping unified with speaker normalization based on voice conversion2015

    • Author(s)
      H. Uchida, S. Saito, N. Minematsu, K. Hirose
    • Journal Title

      Proc. INTERSPEECH 2015

      Volume: 1 Pages: 588-592

    • Related Report
      2015 Research-status Report
    • Peer Reviewed / Open Access
  • [Presentation] 磁気センサシステムによる日中二カ国語話者の音声調音パラレルデータの測定2017

    • Author(s)
      内田秀継,橋本哲称,齋藤大輔,峯松信明
    • Organizer
      音響学会春季講演大会
    • Place of Presentation
      桐蔭横浜大学(神奈川)
    • Year and Date
      2017-03-15
    • Related Report
      2016 Annual Research Report
  • [Presentation] Prediction of the articulatory movements of unseen phonemes of a speaker using the speech structure of another speaker2016

    • Author(s)
      Hidetsu UCHIDA, Saisuke SAITO, Nobuaki MINEMATSU
    • Organizer
      INTERSPEECH2016
    • Place of Presentation
      サンフランシスコ(米国)
    • Year and Date
      2016-09-08
    • Related Report
      2016 Annual Research Report
    • Int'l Joint Research
  • [Presentation] 音声の構造的表象を用いた未観測調音運動の推定に関する検討2016

    • Author(s)
      内田秀継, 斉藤大輔, 峯松信明
    • Organizer
      電子情報通信学会音声研究会
    • Place of Presentation
      サンピアンかわさき(神奈川県川崎市)
    • Year and Date
      2016-01-14
    • Related Report
      2015 Research-status Report
  • [Presentation] 内田秀継, 斉藤大輔, 峯松信明2015

    • Author(s)
      音声の構造的表象を用いた未観測調音運動の推定に関する実験的検討
    • Organizer
      日本音響学会秋季講演会
    • Place of Presentation
      会津大学(福島県会津若松市)
    • Year and Date
      2015-09-16
    • Related Report
      2015 Research-status Report

URL: 

Published: 2015-04-16   Modified: 2018-03-22  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi