Analysis and synthesis method of phonetic/emotional information in audio-visual speech information

Research Project

Project/Area Number	24650100
Research Category	Grant-in-Aid for Challenging Exploratory Research
Allocation Type	Multi-year Fund
Research Field	Sensitivity informatics/Soft computing
Research Institution	Tohoku University
Principal Investigator	SUZUKI Yo-iti 東北大学, 電気通信研究所, 教授 (20143034)
Co-Investigator(Kenkyū-buntansha)	KAWASE Tetsuaki 東北大学, 大学院医工学研究科, 教授 (50169728) SAKAMOTO Shuichi 東北大学, 電気通信研究所, 准教授 (60332524)
Project Period (FY)	2012-04-01 – 2015-03-31
Project Status	Completed (Fiscal Year 2014)
Budget Amount *help	¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000) Fiscal Year 2013: ¥2,210,000 (Direct Cost: ¥1,700,000、Indirect Cost: ¥510,000) Fiscal Year 2012: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Keywords	視聴覚音声知覚 / マルチモーダルインタフェース / 感性情報処理
Outline of Final Research Achievements	Moving images of a talker's face carry much information for speech understanding. Interpretation of that information is known as lip-reading. For the development of advanced multi-modal communications systems, such information should be well considered. To aim at developing such systems, we have focused on the relationship between speech sound information and moving image of talker's face. In this study, we have been particularly examining which parts of moving image of talker's face contribute most to speech understanding. We performed audio-visual speech intelligibility tests and investigated the relationship between speech intelligibility and effects of the parts of moving image of talker's face. Results of the experiments indicated that the mouth area alone provides sufficient information for speech intelligibility. The results suggested that the cue of lip-reading around the mouth might be able to generate from speech sound information.

Report

(4 results)

2014 Annual Research Report Final Research Report ( PDF )
2013 Research-status Report
2012 Research-status Report

Research Products
(11 results)

All 2014 2013

All Journal Article (5 results) (of which Acknowledgement Compliant: 1 results) Presentation (6 results) (of which Invited: 2 results)

[Journal Article] The contribution of the detailed parts around talker's mouth for speech intelligibility2014
- Author(s)
  S. Sakamoto, G. Hasegawa, T. Abe, T. Ohtani, Y. Suzuki and T. Kawase
- Journal Title
  
  Proc. the 21st International Congress on Sound and Vibration (ICSV21)
  
  Volume: -
- Related Report
  2014 Annual Research Report
- Acknowledgement Compliant
[Journal Article] 口唇以外の話者映像情報が無意味3連音節を用いた音声明瞭度に与える影響2014
- Author(s)
  長谷川玄，坂本修一，阿部亨，大谷智子，鈴木陽一，川瀬哲明
- Journal Title
  
  日本音響学会講演論文集
  
  Volume: 2-P5-21 Pages: 641-642
- Related Report
  2013 Research-status Report
[Journal Article] The contribution of the detailed parts around talker's mouth for speech intelligibility2014
- Author(s)
  Shuichi Sakamoto, Gen Hasegawa, Toru Abe, Tomoko Ohtani, Yo-iti Suzuki and Tetsuaki Kawase
- Journal Title
  
  Proc. the 21st International Congress on Sound and Vibration (ICSV21)
  
  Volume: -
- Related Report
  2013 Research-status Report
[Journal Article] 無意味3連音節を用いた音素別明瞭度における視覚情報の寄与の分析2013
- Author(s)
  長谷川玄，坂本修一，阿部亨，大谷智子，鈴木陽一，川瀬哲明
- Journal Title
  
  日本音響学会聴覚研究会資料
  
  Volume: H-2013-102 Pages: 595-600
- NAID
  40019839445
- Related Report
  2013 Research-status Report
[Journal Article] 無意味3連音節を用いた音素別明瞭度における話者映像の寄与の分析2013
- Author(s)
  長谷川玄，坂本修一，阿部亨，大谷智子，鈴木陽一，川瀬哲明
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: HIP2013-60 Pages: 1-6
- Related Report
  2013 Research-status Report
[Presentation] The contribution of the detailed parts around talker's mouth for speech intelligibility2014
- Author(s)
  S. Sakamoto, G. Hasegawa, T. Abe, T. Ohtani, Y. Suzuki and T. Kawase
- Organizer
  The 21st International Congress on Sound and Vibration (ICSV21)
- Place of Presentation
  Beijing, China
- Year and Date
  2014-07-13 – 2014-07-17
- Related Report
  2014 Annual Research Report
- Invited
[Presentation] Contribution of detailed parts around talker's mouth for audio-visual speech perceptio2014
- Author(s)
  S. Sakamoto, G. Hasegawa, T. Abe, T. Ohtani, Y. Suzuki and T. Kawase
- Organizer
  167th Meeting of the Acoustical Society of America
- Place of Presentation
  Providence, USA
- Year and Date
  2014-05-05 – 2014-05-09
- Related Report
  2014 Annual Research Report
[Presentation] 口唇以外の話者映像情報が無意味3連音節を用いた音声明瞭度に与える影響2014
- Author(s)
  長谷川玄
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学
- Related Report
  2013 Research-status Report
[Presentation] The contribution of the detailed parts around talker's mouth for speech intelligibility2014
- Author(s)
  Shuichi Sakamoto
- Organizer
  the 21st International Congress on Sound and Vibration (ICSV21)
- Place of Presentation
  Beijing, China
- Related Report
  2013 Research-status Report
- Invited
[Presentation] 無意味3連音節を用いた音素別明瞭度における視覚情報の寄与の分析2013
- Author(s)
  長谷川玄
- Organizer
  日本音響学会聴覚研究会
- Place of Presentation
  神戸セミナーハウス
- Related Report
  2013 Research-status Report
[Presentation] 無意味3連音節を用いた音素別明瞭度における話者映像の寄与の分析2013
- Author(s)
  長谷川玄
- Organizer
  電子情報通信学会ヒューマン情報処理（HIP）研究会
- Place of Presentation
  東北大学電気通信研究所
- Related Report
  2013 Research-status Report

Analysis and synthesis method of phonetic/emotional information in audio-visual speech information

Principal Investigator

SUZUKI Yo-iti 東北大学, 電気通信研究所, 教授 (20143034)

¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000)

Report

Research Products

[Journal Article] The contribution of the detailed parts around talker's mouth for speech intelligibility2014

Author(s)

Journal Title

Related Report

[Journal Article] 口唇以外の話者映像情報が無意味3連音節を用いた音声明瞭度に与える影響2014

Author(s)

Journal Title

Related Report

[Journal Article] The contribution of the detailed parts around talker's mouth for speech intelligibility2014

Author(s)

Journal Title

Related Report

[Journal Article] 無意味3連音節を用いた音素別明瞭度における視覚情報の寄与の分析2013

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 無意味3連音節を用いた音素別明瞭度における話者映像の寄与の分析2013

Author(s)

Journal Title

Related Report

[Presentation] The contribution of the detailed parts around talker's mouth for speech intelligibility2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Contribution of detailed parts around talker's mouth for audio-visual speech perceptio2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 口唇以外の話者映像情報が無意味3連音節を用いた音声明瞭度に与える影響2014

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] The contribution of the detailed parts around talker's mouth for speech intelligibility2014

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 無意味3連音節を用いた音素別明瞭度における視覚情報の寄与の分析2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 無意味3連音節を用いた音素別明瞭度における話者映像の寄与の分析2013

Author(s)

Organizer

Place of Presentation

Related Report