2022 Fiscal Year Annual Research Report

脳波と眼球運動を用いた音声生成と知覚の神経メカニズムに関する研究

Research Project

Project/Area Number	20K11883
Research Institution	Japan Advanced Institute of Science and Technology
Principal Investigator	党建武北陸先端科学技術大学院大学, 先端科学技術研究科, 名誉教授 (80334796)
Co-Investigator(Kenkyū-buntansha)	赤木正人北陸先端科学技術大学院大学, 先端科学技術研究科, 名誉教授 (20242571)
Project Period (FY)	2020-04-01 – 2023-03-31
Keywords	音声生成 / 音声理解 / 脳ネットワーク / 脳活動の動的特性 / 音声生成の神経学的モデル
Outline of Annual Research Achievements	本研究では、文を朗読するには視覚、調音及び聴覚プロセスの高度に柔軟な調整が必要であることを着眼して、脳ネットワークでどのようにエンコードまたはデコードを行うかを明らかにすることを目標とした。そのため、リアルタイムのEEG、眼球運動、および音声記録を、脳画像の結果からの空間的に正確なネットワークトポロジと組み合わせることにより、マルチモーダルソリューションを探った。階層的な皮質レベルでの根底にある神経的関連性を明らかにするために、独立成分（IC）に事象関連のスペクトル摂動分析、ICクラスターに効果的な接続分析、機能特定サブネットワークの類似性分析を順次適用した。その結果、前頭前野、前頭葉、および下前頭葉を含むいくつかの高次認知および言語野におけるトップダウンのソースを特定した。これらの高度な認知および言語ネットワークは、早期の活性化と下位の視覚運動システムとの頻繁な相互作用で検出され、文構造の知識によって調整された並行および反復的な相互作用プロセスを示唆した。それに従って、我々は音声生成と音声理解の神経モデルを構築した。人間の感情認識のため高密度EEG 信号に基づいた時空間特徴融合畳み込みグラフ注意ネットワーク (STFCGAT) モデルを構築した。まず、単一チャネルの差分エントロピー (DE) 機能とクロスチャネル機能接続 (FC) 機能を組み合わせて、EEG の時間的変動と空間トポロジー情報の両方を着眼して、DE と FC の機能を融合し、感情表現力の高いグラフ構造情報をさらに抽出した。さらに、グラフニューラルネットワークに多頭注意メカニズムを導入して、モデルの一般化能力を向上させた。その結果、提案したSTFCGAT アーキテクチャの感情認識への有効性を実証した。また、EEG信号を用いて異なる韻律をもつ同じ文章の意図を識別する方法を研究した

Research Products
(12 results)

All 2023 2022 Other

All Int'l Joint Research (2 results) Journal Article (5 results) (of which Int'l Joint Research: 5 results, Peer Reviewed: 5 results, Open Access: 5 results) Presentation (5 results) (of which Int'l Joint Research: 4 results)

[Int'l Joint Research] 天津大学(中国)
- Country Name
  CHINA
- Counterpart Institution
  天津大学
[Int'l Joint Research] Nanyang Tech(シンガポール)
- Country Name
  SINGAPORE
- Counterpart Institution
  Nanyang Tech
[Journal Article] Emotion recognition using spatial-temporal EEG features through convolutional graph attention network.2023
- Author(s)
  Li, Z., Zhang, G., Wang, L., Wei, J., & Dang, J.
- Journal Title
  
  Journal of Neural Engineering
  
  Volume: 20 Pages: 016046
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Detection of brain network communities during natural speech comprehension from functionally aligned EEG sources2022
- Author(s)
  Zhou, D., Zhang, G., Dang, J., Unoki, M., & Liu, X.
- Journal Title
  
  Frontiers in Computational Neuroscience
  
  Volume: 1 Pages: 1-10
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Intrinsic Representation Mining for Zero-Shot Slot Filling2022
- Author(s)
  Li, S., Okada, S., & Dang, J.
- Journal Title
  
  IEICE TRANSACTIONS on Information and Systems
  
  Volume: 105 Pages: 1947-1956.
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling2022
- Author(s)
  S Qin, L Wang, S Li, J Dang, L Pan
- Journal Title
  
  EURASIP Journal on Audio, Speech, and Music Processing
  
  Volume: 1 Pages: 1-10
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] One-shot emotional voice conversion based on feature separation2022
- Author(s)
  W Lu, X Zhao, N Guo, Y Li, J Wei, J Tao, J Dang
- Journal Title
  
  Speech Communication
  
  Volume: 143 Pages: 1-9
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] Investigating the neural responses to continuous speech based on reconstructed source signal from EEG.2022
- Author(s)
  Zhou, D., Zhang, G., Dang, J.
- Organizer
  Acoustical Science and Technology (ROMBUNNO.1-8-5).
[Presentation] Reconstruction of speech spectrogram based on non-invasive EEG signal.2022
- Author(s)
  Zhou, D., Unoki, M., Zhang, G., & Dang, J.
- Organizer
  13th International Symposium on Chinese Spoken Language Processing (ISCSLP)
- Int'l Joint Research
[Presentation] Dialogue scenario classification based on social factors2022
- Author(s)
  Liu, Y., Zhou, D., Unoki, M., Dang, J., & Li, A.
- Organizer
  13th International Symposium on Chinese Spoken Language Processing (ISCSLP)
- Int'l Joint Research
[Presentation] Dual-stream Speech Dereverberation Network Using Long-term and Short-term Cues2022
- Author(s)
  N Li, M Ge, L Wang, J Dang
- Organizer
  2022 International Joint Conference on Neural Networks (IJCNN)
- Int'l Joint Research
[Presentation] Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training2022
- Author(s)
  J He, C Gong, L Wang, D Jin, X Wang, J Xu, J Dang
- Organizer
  Proc. Interspeech 2022
- Int'l Joint Research

2022 Fiscal Year Annual Research Report

脳波と眼球運動を用いた音声生成と知覚の神経メカニズムに関する研究

Principal Investigator

党 建武 北陸先端科学技術大学院大学, 先端科学技術研究科, 名誉教授 (80334796)

Research Products

[Int'l Joint Research] 天津大学(中国)

Country Name

Counterpart Institution

[Int'l Joint Research] Nanyang Tech(シンガポール)

Country Name

Counterpart Institution

[Journal Article] Emotion recognition using spatial-temporal EEG features through convolutional graph attention network.2023

Author(s)

Journal Title

[Journal Article] Detection of brain network communities during natural speech comprehension from functionally aligned EEG sources2022

Author(s)

Journal Title

[Journal Article] Intrinsic Representation Mining for Zero-Shot Slot Filling2022

Author(s)

Journal Title

[Journal Article] Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling2022

Author(s)

Journal Title

[Journal Article] One-shot emotional voice conversion based on feature separation2022

Author(s)

Journal Title

[Presentation] Investigating the neural responses to continuous speech based on reconstructed source signal from EEG.2022

Author(s)

Organizer

[Presentation] Reconstruction of speech spectrogram based on non-invasive EEG signal.2022

Author(s)

Organizer

[Presentation] Dialogue scenario classification based on social factors2022

Author(s)

Organizer

[Presentation] Dual-stream Speech Dereverberation Network Using Long-term and Short-term Cues2022

Author(s)

Organizer

[Presentation] Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training2022

Author(s)

Organizer

党建武北陸先端科学技術大学院大学, 先端科学技術研究科, 名誉教授 (80334796)