• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Analysis and synthesis method of phonetic/emotional information in audio-visual speech information

Research Project

Project/Area Number 24650100
Research Category

Grant-in-Aid for Challenging Exploratory Research

Allocation TypeMulti-year Fund
Research Field Sensitivity informatics/Soft computing
Research InstitutionTohoku University

Principal Investigator

SUZUKI Yo-iti  東北大学, 電気通信研究所, 教授 (20143034)

Co-Investigator(Kenkyū-buntansha) KAWASE Tetsuaki  東北大学, 大学院医工学研究科, 教授 (50169728)
SAKAMOTO Shuichi  東北大学, 電気通信研究所, 准教授 (60332524)
Project Period (FY) 2012-04-01 – 2015-03-31
Project Status Completed (Fiscal Year 2014)
Budget Amount *help
¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000)
Fiscal Year 2013: ¥2,210,000 (Direct Cost: ¥1,700,000、Indirect Cost: ¥510,000)
Fiscal Year 2012: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Keywords視聴覚音声知覚 / マルチモーダルインタフェース / 感性情報処理
Outline of Final Research Achievements

Moving images of a talker's face carry much information for speech understanding. Interpretation of that information is known as lip-reading. For the development of advanced multi-modal communications systems, such information should be well considered. To aim at developing such systems, we have focused on the relationship between speech sound information and moving image of talker's face. In this study, we have been particularly examining which parts of moving image of talker's face contribute most to speech understanding. We performed audio-visual speech intelligibility tests and investigated the relationship between speech intelligibility and effects of the parts of moving image of talker's face. Results of the experiments indicated that the mouth area alone provides sufficient information for speech intelligibility. The results suggested that the cue of lip-reading around the mouth might be able to generate from speech sound information.

Report

(4 results)
  • 2014 Annual Research Report   Final Research Report ( PDF )
  • 2013 Research-status Report
  • 2012 Research-status Report
  • Research Products

    (11 results)

All 2014 2013

All Journal Article (5 results) (of which Acknowledgement Compliant: 1 results) Presentation (6 results) (of which Invited: 2 results)

  • [Journal Article] The contribution of the detailed parts around talker's mouth for speech intelligibility2014

    • Author(s)
      S. Sakamoto, G. Hasegawa, T. Abe, T. Ohtani, Y. Suzuki and T. Kawase
    • Journal Title

      Proc. the 21st International Congress on Sound and Vibration (ICSV21)

      Volume: -

    • Related Report
      2014 Annual Research Report
    • Acknowledgement Compliant
  • [Journal Article] 口唇以外の話者映像情報が無意味3連音節を用いた音声明瞭度に与える影響2014

    • Author(s)
      長谷川玄,坂本修一,阿部亨,大谷智子,鈴木陽一,川瀬哲明
    • Journal Title

      日本音響学会講演論文集

      Volume: 2-P5-21 Pages: 641-642

    • Related Report
      2013 Research-status Report
  • [Journal Article] The contribution of the detailed parts around talker's mouth for speech intelligibility2014

    • Author(s)
      Shuichi Sakamoto, Gen Hasegawa, Toru Abe, Tomoko Ohtani, Yo-iti Suzuki and Tetsuaki Kawase
    • Journal Title

      Proc. the 21st International Congress on Sound and Vibration (ICSV21)

      Volume: -

    • Related Report
      2013 Research-status Report
  • [Journal Article] 無意味3連音節を用いた音素別明瞭度における視覚情報の寄与の分析2013

    • Author(s)
      長谷川玄,坂本修一,阿部亨,大谷智子,鈴木陽一,川瀬哲明
    • Journal Title

      日本音響学会聴覚研究会資料

      Volume: H-2013-102 Pages: 595-600

    • NAID

      40019839445

    • Related Report
      2013 Research-status Report
  • [Journal Article] 無意味3連音節を用いた音素別明瞭度における話者映像の寄与の分析2013

    • Author(s)
      長谷川玄,坂本修一,阿部亨,大谷智子,鈴木陽一,川瀬哲明
    • Journal Title

      電子情報通信学会技術研究報告

      Volume: HIP2013-60 Pages: 1-6

    • Related Report
      2013 Research-status Report
  • [Presentation] The contribution of the detailed parts around talker's mouth for speech intelligibility2014

    • Author(s)
      S. Sakamoto, G. Hasegawa, T. Abe, T. Ohtani, Y. Suzuki and T. Kawase
    • Organizer
      The 21st International Congress on Sound and Vibration (ICSV21)
    • Place of Presentation
      Beijing, China
    • Year and Date
      2014-07-13 – 2014-07-17
    • Related Report
      2014 Annual Research Report
    • Invited
  • [Presentation] Contribution of detailed parts around talker's mouth for audio-visual speech perceptio2014

    • Author(s)
      S. Sakamoto, G. Hasegawa, T. Abe, T. Ohtani, Y. Suzuki and T. Kawase
    • Organizer
      167th Meeting of the Acoustical Society of America
    • Place of Presentation
      Providence, USA
    • Year and Date
      2014-05-05 – 2014-05-09
    • Related Report
      2014 Annual Research Report
  • [Presentation] 口唇以外の話者映像情報が無意味3連音節を用いた音声明瞭度に与える影響2014

    • Author(s)
      長谷川玄
    • Organizer
      日本音響学会2014年春季研究発表会
    • Place of Presentation
      日本大学
    • Related Report
      2013 Research-status Report
  • [Presentation] The contribution of the detailed parts around talker's mouth for speech intelligibility2014

    • Author(s)
      Shuichi Sakamoto
    • Organizer
      the 21st International Congress on Sound and Vibration (ICSV21)
    • Place of Presentation
      Beijing, China
    • Related Report
      2013 Research-status Report
    • Invited
  • [Presentation] 無意味3連音節を用いた音素別明瞭度における視覚情報の寄与の分析2013

    • Author(s)
      長谷川玄
    • Organizer
      日本音響学会聴覚研究会
    • Place of Presentation
      神戸セミナーハウス
    • Related Report
      2013 Research-status Report
  • [Presentation] 無意味3連音節を用いた音素別明瞭度における話者映像の寄与の分析2013

    • Author(s)
      長谷川玄
    • Organizer
      電子情報通信学会ヒューマン情報処理(HIP)研究会
    • Place of Presentation
      東北大学電気通信研究所
    • Related Report
      2013 Research-status Report

URL: 

Published: 2013-05-31   Modified: 2019-07-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi