Speech synthesis based on articulatory movement HMM and LSP digital filter

Research Project

Project/Area Number	16K00234
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	Perceptual information processing
Research Institution	Tokyo University of Science
Principal Investigator	Katsurada Kouichi 東京理科大学, 理工学部情報科学科, 准教授 (80324490)
Co-Investigator(Kenkyū-buntansha)	新田恒雄早稲田大学, グリーン・コンピューティング・システム研究機構, その他(招聘研究員) (70314101) 牧野武彦中央大学, 経済学部, 教授 (00269482) 金澤靖豊橋技術科学大学, 工学(系)研究科(研究院), 准教授 (50214432)
Research Collaborator	Kaburagi Tokihiko Wakamiya Kohei
Project Period (FY)	2016-04-01 – 2019-03-31
Project Status	Completed (Fiscal Year 2018)
Budget Amount *help	¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000) Fiscal Year 2018: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000) Fiscal Year 2017: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000) Fiscal Year 2016: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Keywords	調音運動 / 音声合成 / データベース構築 / コーパス作成 / IPAラベリング / 声質変換 / LSPパラメータ / AutoEncoder / 話者変換 / 調音運動HMM
Outline of Final Research Achievements	We have investigated how to synthesize speeches from articulatory features that represent movement of lip and tongue when humans utter. During the first half of the period, we have constructed a speech synthesizer from the features that parameterize the actual movement of lip and tongue. After that, we have collected the data of lip/tongue movement using EMA (Electromagnetic Articulography). We recorded the movement from a male announcer last year, and now we are labeling IPA (International Phonetic Alphabet) on it.
Academic Significance and Societal Importance of the Research Achievements	近年，深層学習等の発展により音声合成のクオリティが格段に向上している．しかし一般的な音声合成では人間の発音に関する詳細な特徴を用いていないため，人間ならではの発音の失敗や声質の変化に対応することが難しい．本研究で取り組む調音運動ベースの音声合成は人間の発声の仕組みに近い方式をとるため，こうした人間ならではの声の変化に対応できる可能性がある．こうした合成のモデルを他者の発話の認識等に用いることで，言語情報だけではなく，その背後の発声方式の変化（風邪をひいたとか，口の中が痛いとか）を認識する補助情報として利用することも考えられる．

Report

(4 results)

2018 Annual Research Report Final Research Report ( PDF )
2017 Research-status Report
2016 Research-status Report

Research Products
(15 results)

All 2018 2017 2016

All Journal Article (1 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 1 results, Open Access: 1 results, Acknowledgement Compliant: 1 results) Presentation (14 results) (of which Int'l Joint Research: 3 results)

[Journal Article] Using Reversed Sequences and Grapheme Generation Rules to Extend the Feasibility of a Phoneme Transition Network-Based Grapheme-to-Phoneme Conversion2016
- Author(s)
  Seng Kheang, Kouichi Katsurada, Yurie Iribe and Tsuneo Nitta
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E99.D Issue: 4 Pages: 1182-1192
- DOI
  10.1587/transinf.2015EDP7349
- NAID
  130005141390
- ISSN
  0916-8532, 1745-1361
- Related Report
  2016 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research / Acknowledgement Compliant
[Presentation] 発話時脳波を利用した音声言語情報の識別2018
- Author(s)
  深井健大郎，大村英史，桂田浩一，平田里佳，入部百合絵，新田恒雄
- Organizer
  第5回サイレント音声認識ワークショップ
- Related Report
  2018 Annual Research Report
[Presentation] Active Appearance Modelsを用いた読唇2018
- Author(s)
  小口優人，大村英史，桂田浩一
- Organizer
  第5回サイレント音声認識ワークショップ
- Related Report
  2018 Annual Research Report
[Presentation] 学会報告（INTERSPEECH2018）2018
- Author(s)
  桂田浩一
- Organizer
  第5回サイレント音声認識ワークショップ
- Related Report
  2018 Annual Research Report
[Presentation] 変分オートエンコーダーを用いた多重音解析の性能評価2018
- Author(s)
  森口寛生，大村英史，桂田浩一
- Organizer
  情報処理学会第80回全国大会
- Related Report
  2017 Research-status Report
[Presentation] 脳波による日本語短音節の認識2018
- Author(s)
  新田恒雄，桂田浩一，神崎卓丸
- Organizer
  第4回サイレント音声認識ワークショップ
- Related Report
  2017 Research-status Report
[Presentation] Suffix Arrayを用いた高速STDにおけるキーワード分割の最適化に関する検討2017
- Author(s)
  桂田浩一
- Organizer
  日本音響学会2017年春季研究発表会
- Place of Presentation
  明治大学（生田キャンパス）
- Year and Date
  2017-03-15
- Related Report
  2016 Research-status Report
[Presentation] 日本語音節発話・想起時の脳波解析2017
- Author(s)
  浅原康平，中根丈司，神崎卓丸，桂田浩一，杉本俊二，新田恒雄，堀川順生
- Organizer
  日本音響学会2017年春季研究発表会
- Place of Presentation
  明治大学（生田キャンパス）
- Year and Date
  2017-03-15
- Related Report
  2016 Research-status Report
[Presentation] 発話時と想起時の脳波による日本語短音節認識の比較2017
- Author(s)
  神崎卓丸，浅原康平，中根丈司，桂田浩一，杉本俊二，堀川順生，新田恒雄
- Organizer
  日本音響学会2017年春季研究発表会
- Place of Presentation
  明治大学（生田キャンパス）
- Year and Date
  2017-03-15
- Related Report
  2016 Research-status Report
[Presentation] 顔画像の対称3D-AAMによる顔方向非依存な発話認識2017
- Author(s)
  渡辺拓也，桂田浩一，金澤靖
- Organizer
  電子情報通信学会技術研究報告，PRMU2016-127
- Place of Presentation
  京都大学
- Year and Date
  2017-01-19
- Related Report
  2016 Research-status Report
[Presentation] EEG during Japanese syllable recall and speech tasks2016
- Author(s)
  Kohei Asahara, Jozi Nakane, Takumaru Kanzaki, Shunji Sugimoto, Kouich Katsurada, Tsuneo Nitta, and Junsei Horikawa
- Organizer
  The 3rd Annual Meeting of the Society for Bioacoustics
- Place of Presentation
  Tahara, Japan
- Year and Date
  2016-12-10
- Related Report
  2016 Research-status Report
- Int'l Joint Research
[Presentation] Japanese monosyllable recognition from EEG2016
- Author(s)
  Takumaru Kanzaki, Shunji Sugimoto, Kouich Katsurada, Junsei Horikawa, and Tsuneo Nitta
- Organizer
  The 3rd Annual Meeting of the Society for Bioacoustics
- Place of Presentation
  Tahara, Japan
- Year and Date
  2016-12-10
- Related Report
  2016 Research-status Report
- Int'l Joint Research
[Presentation] Lip Reading from Multi View Facial Images Using 3D-AAM2016
- Author(s)
  Takuya Watanabe, Kouichi Katsurada, and Yasushi Kanazawa
- Organizer
  ACCV2016 Workshops
- Place of Presentation
  Taipei, Taiwan
- Year and Date
  2016-11-20
- Related Report
  2016 Research-status Report
- Int'l Joint Research
[Presentation] 日本語単音節発話時と想起時の脳波解析2016
- Author(s)
  浅原康平，中根丈司，神崎卓丸，中澤香太，桂田浩一，杉本俊二，新田恒雄，堀川順生
- Organizer
  日本音響学会2016年秋季研究発表会
- Place of Presentation
  富山大学五福キャンパス
- Year and Date
  2016-09-14
- Related Report
  2016 Research-status Report
[Presentation] 脳波からの日本語単音節認識方式の検討2016
- Author(s)
  神崎卓丸，浅原康平，中根丈司，中澤香太，桂田浩一，杉本俊二，堀川順生，新田恒雄
- Organizer
  日本音響学会2016年秋季研究発表会
- Place of Presentation
  富山大学五福キャンパス
- Year and Date
  2016-09-14
- Related Report
  2016 Research-status Report

Speech synthesis based on articulatory movement HMM and LSP digital filter

Principal Investigator

Katsurada Kouichi 東京理科大学, 理工学部情報科学科, 准教授 (80324490)

¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000)

Report

Research Products

[Journal Article] Using Reversed Sequences and Grapheme Generation Rules to Extend the Feasibility of a Phoneme Transition Network-Based Grapheme-to-Phoneme Conversion2016

Author(s)

Journal Title

DOI

NAID

ISSN

Related Report

[Presentation] 発話時脳波を利用した音声言語情報の識別2018

Author(s)

Organizer

Related Report

[Presentation] Active Appearance Modelsを用いた読唇2018

Author(s)

Organizer

Related Report

[Presentation] 学会報告（INTERSPEECH2018）2018

Author(s)

Organizer

Related Report

[Presentation] 変分オートエンコーダーを用いた多重音解析の性能評価2018

Author(s)

Organizer

Related Report

[Presentation] 脳波による日本語短音節の認識2018

Author(s)

Organizer

Related Report

[Presentation] Suffix Arrayを用いた高速STDにおけるキーワード分割の最適化に関する検討2017

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 日本語音節発話・想起時の脳波解析2017

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 発話時と想起時の脳波による日本語短音節認識の比較2017

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 顔画像の対称3D-AAMによる顔方向非依存な発話認識2017

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] EEG during Japanese syllable recall and speech tasks2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Japanese monosyllable recognition from EEG2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Lip Reading from Multi View Facial Images Using 3D-AAM2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 日本語単音節発話時と想起時の脳波解析2016

Author(s)

Organizer

Place of Presentation

Year and Date