Construction of a substitute speech generation technique based on the input of articulatory motion

Research Project

Project/Area Number	21K11965
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Review Section	Basic Section 61010:Perceptual information processing-related
Research Institution	Kyushu University
Principal Investigator	Kaburagi Tokihiko 九州大学, 芸術工学研究院, 教授 (30325568)
Project Period (FY)	2021-04-01 – 2024-03-31
Project Status	Completed (Fiscal Year 2023)
Budget Amount *help	¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000) Fiscal Year 2023: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000) Fiscal Year 2022: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000) Fiscal Year 2021: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Keywords	音声合成 / 代用発声 / 調音運動 / 口唇動画 / 機械学習 / ニューラルネットワーク / 発声障害 / 転移学習 / トランスフォーマー / リアルタイムMRI / 敵対的生成ネットワーク / MRI / 系列変換モデル / 調音観測
Outline of Research at the Start	本研究では、喉頭疾患による発声障害者が音声コミュニケーションを維持できるようにするために、発話時の口の動きだけから音声を合成し、意図の伝達を可能にする、代用発声技術を検討する。システムへ入力される発話動作から機械学習によって音声を生成するため、口唇動画とリアルタイムMRIを用いて、複数話者の調音・音声パラレルコーパス作成を行う。さらに、畳み込み層を基としてEnd-to-endで了解性の高い音声を生成する検討を行う。
Outline of Final Research Achievements	In this project, a model for synthesizing speech from motion of the lips was constructed as a tool of substitute speech, that can help laryngectomees maintain voice communication, and a set of Japanese speech corpus was gathered for training the model. The model comprises an encoder, by which low-dimensional speech features are extracted from the motion input, and a decoder, by which mel-spectrogram is estimated as the output. As a result of experiments, the model is capable of estimating, not only the acoustic characteristic of the vocal tract, but also the pitch conture for expressing the accent and intonation. The synthesized speech was intelligible. In addition, a model was studied for estimating the motion of the vocal tract, which was measured using a real time MRI, from speech.
Academic Significance and Societal Importance of the Research Achievements	喉頭癌などの重度の疾患で喉頭を摘出した場合、その後の一生において日常のコミュニケーションに大きな支障をきたす。喉頭摘出者の代用発声法としては、電気式人工喉頭や食道の粘膜を声帯の代わりに振動させる食道発声などがあるが、それぞれ、抑揚のない機械的な発声になる、胃に空気を取り込むため高齢者では習得が難しいなどの問題がある。超高齢化した社会状況に鑑みても、喉頭疾患によるコミュニケーションの喪失に対処し得る情報技術の創出は不可欠であり、本研究で検討した新しい代用発声技術が意味を持つと考えられる。

Report

(4 results)

2023 Annual Research Report Final Research Report ( PDF )
2022 Research-status Report
2021 Research-status Report

Research Products
(18 results)

All 2024 2023 2022 2021 Other

All Journal Article (4 results) (of which Peer Reviewed: 3 results, Open Access: 2 results) Presentation (10 results) (of which Int'l Joint Research: 1 results) Book (1 results) Remarks (3 results)

[Journal Article] Numerical method for analyzing steady-state oscillation in trumpets2023
- Author(s)
  Kaburagi Tokihiko、Kuroki Chiho、Hidaka Shunsuke、Ishikawa Satoshi
- Journal Title
  
  Acoustical Science and Technology
  
  Volume: 44 Issue: 3 Pages: 269-280
- DOI
  10.1250/ast.44.269
- ISSN
  0369-4232, 1346-3969, 1347-5177
- Year and Date
  2023-05-01
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Vocal fold vibration of the whistle register observed by high-speed digital imaging2023
- Author(s)
  Kato Hikari、Lee Yogaku、Wakamiya Kohei、Nakagawa Takashi、Kaburagi Tokihiko
- Journal Title
  
  Journal of Voice
  
  Volume: -
- DOI
  10.1016/j.jvoice.2023.08.026
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Automatic GRBAS Scoring of Pathological Voices using Deep Learning and a Small Set of Labeled Voice Data2022
- Author(s)
  Shunsuke Hidaka, Yogaku Lee, Moe Nakanishi, Kohei Wakamiya, Takashi Nakagawa, Tokihiko Kaburagi
- Journal Title
  
  Journal of Voice
  
  Volume: - Issue: 3 Pages: 846.e1-846.e23
- DOI
  10.1016/j.jvoice.2022.10.020
- Related Report
  2022 Research-status Report
- Peer Reviewed
[Journal Article] 磁気共鳴画像(MRI)を用いた管楽器吹奏時の声道計測2021
- Author(s)
  鏑木時彦
- Journal Title
  
  日本音響学会誌
  
  Volume: 77 Pages: 572-579
- NAID
  130008095429
- Related Report
  2021 Research-status Report
[Presentation] 転移学習を用いた少量データからの口唇動画音声合成2024
- Author(s)
  藤田直明，南汰翼，鏑木時彦
- Organizer
  日本音響学会春季研究発表会
- Related Report
  2023 Annual Research Report
[Presentation] 自己回帰及び非自己回帰モデルによる口唇動画を用いた音声合成2023
- Author(s)
  南汰翼，藤田直明，鏑木時彦
- Organizer
  日本音響学会秋季研究発表会
- Related Report
  2023 Annual Research Report
[Presentation] 高速度ディジタル撮像を用いたボーカルフライ声区における声帯振動の分析2023
- Author(s)
  加藤日花里，李庸學，鏑木時彦，若宮幸平
- Organizer
  日本音響学会秋季研究発表会
- Related Report
  2023 Annual Research Report
[Presentation] 発声における仮声帯振動の影響に関する数値流体解析2023
- Author(s)
  鏑木時彦，加藤日花里，李庸學
- Organizer
  日本音響学会秋季研究発表会
- Related Report
  2023 Annual Research Report
[Presentation] TransformerとGANを用いた口唇動画音声合成2023
- Author(s)
  藤田直明，南汰翼，鏑木時彦
- Organizer
  日本音響学会春季研究発表会
- Related Report
  2022 Research-status Report
[Presentation] An investigation of the effectiveness of phase for audio classification2022
- Author(s)
  Shunsuke Hidaka, Kohei Wakamiya, and Tokihiko Kaburagi
- Organizer
  IEEE ICASSP 2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] 自己回帰及び非自己回帰モデルによる口唇動画を用いた音声合成2022
- Author(s)
  南汰翼，藤田直明，鏑木時彦
- Organizer
  日本音響学会九州支部　学生のための研究発表会
- Related Report
  2022 Research-status Report
[Presentation] 系列変換モデルを用いた口唇動画からの複数話者音声合成2022
- Author(s)
  江崎蓮, 鏑木時彦
- Organizer
  日本音響学会春季研究発表会
- Related Report
  2021 Research-status Report
[Presentation] 系列変換モデルを用いた口唇動画・音声変換システムに関する研究2021
- Author(s)
  江崎蓮, 鏑木時彦
- Organizer
  日本音響学会九州支部学生のための研究発表会
- Related Report
  2021 Research-status Report
[Presentation] 音分類課題において有効な位相情報の表現に関する検討2021
- Author(s)
  日髙駿介, 若宮幸平, 鏑木時彦
- Organizer
  日本音響学会秋季研究発表会
- Related Report
  2021 Research-status Report
[Book] 音響学講座　音声（上）2021
- Author(s)
  滝口哲也（編著）鏑木時彦他（著）
- Total Pages
  309
- Publisher
  コロナ社
- ISBN
  9784339013665
- Related Report
  2021 Research-status Report
[Remarks] 九州大学研究者情報
- URL
  https://hyoka.ofc.kyushu-u.ac.jp/search/details/K002357/index.html
- Related Report
  2023 Annual Research Report
[Remarks] 九州大学　研究者情報
- URL
  https://hyoka.ofc.kyushu-u.ac.jp/search/details/K002357/research.html
- Related Report
  2022 Research-status Report
[Remarks] 九州大学研究者情報　鏑木時彦
- URL
  https://hyoka.ofc.kyushu-u.ac.jp/search/details/K002357/index.html
- Related Report
  2021 Research-status Report

Construction of a substitute speech generation technique based on the input of articulatory motion

Principal Investigator

Kaburagi Tokihiko 九州大学, 芸術工学研究院, 教授 (30325568)

¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)

Report

Research Products

[Journal Article] Numerical method for analyzing steady-state oscillation in trumpets2023

Author(s)

Journal Title

DOI

ISSN

Year and Date

Related Report

[Journal Article] Vocal fold vibration of the whistle register observed by high-speed digital imaging2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Automatic GRBAS Scoring of Pathological Voices using Deep Learning and a Small Set of Labeled Voice Data2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] 磁気共鳴画像(MRI)を用いた管楽器吹奏時の声道計測2021

Author(s)

Journal Title

NAID

Related Report

[Presentation] 転移学習を用いた少量データからの口唇動画音声合成2024

Author(s)

Organizer

Related Report

[Presentation] 自己回帰及び非自己回帰モデルによる口唇動画を用いた音声合成2023

Author(s)

Organizer

Related Report

[Presentation] 高速度ディジタル撮像を用いたボーカルフライ声区における声帯振動の分析2023

Author(s)

Organizer

Related Report

[Presentation] 発声における仮声帯振動の影響に関する数値流体解析2023

Author(s)

Organizer

Related Report

[Presentation] TransformerとGANを用いた口唇動画音声合成2023

Author(s)

Organizer

Related Report

[Presentation] An investigation of the effectiveness of phase for audio classification2022

Author(s)

Organizer

Related Report

[Presentation] 自己回帰及び非自己回帰モデルによる口唇動画を用いた音声合成2022

Author(s)

Organizer

Related Report

[Presentation] 系列変換モデルを用いた口唇動画からの複数話者音声合成2022

Author(s)

Organizer

Related Report

[Presentation] 系列変換モデルを用いた口唇動画・音声変換システムに関する研究2021

Author(s)

Organizer

Related Report

[Presentation] 音分類課題において有効な位相情報の表現に関する検討2021

Author(s)

Organizer

Related Report

[Book] 音響学講座 音声（上）2021

Author(s)

Total Pages

Publisher

ISBN

Related Report

[Remarks] 九州大学研究者情報

URL

Related Report

[Remarks] 九州大学 研究者情報

URL

Related Report

[Book] 音響学講座　音声（上）2021

[Remarks] 九州大学　研究者情報

[Remarks] 九州大学研究者情報　鏑木時彦