A Method for Speech Analysis Based on a Visual-to-Auditory Feedback Mechanism

Research Project

Project/Area Number	21680016
Research Category	Grant-in-Aid for Young Scientists (A)
Allocation Type	Single-year Grants
Research Field	Perception information processing/Intelligent robotics
Research Institution	Kyoto University
Principal Investigator	KAWASHIMA Hiroaki Kyoto University, 情報学研究科, 講師 (40346101)
Project Period (FY)	2009 – 2010
Project Status	Completed (Fiscal Year 2010)
Budget Amount *help	¥8,710,000 (Direct Cost: ¥6,700,000、Indirect Cost: ¥2,010,000) Fiscal Year 2010: ¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000) Fiscal Year 2009: ¥5,980,000 (Direct Cost: ¥4,600,000、Indirect Cost: ¥1,380,000)
Keywords	音声推定・分離 / 口唇運動 / 線形システム / ハイブリッドシステム / タイミング構造 / 視聴覚統合 / マルチモダリティ / 時系列の分節化
Research Abstract	We have developed a novel speech-analysis method based on the detail modeling of temporal relationship between mouth movements and speech signals. First, we use a hybrid system, which is an integrated model of dynamical systems and discrete-event systems, as a mathematical tool to segment and model multimedia signals such as captured mouth motion and speech data. Then, we build a statistical cross-media timing model that can be learned from those segmented data. The proposed method realizes the mechanism of signal generation "from mouth motion to speech", which enables highly accurate speech estimation in non-stationary noise environment.

Report

(3 results)

2010 Annual Research Report Final Research Report ( PDF )
2009 Annual Research Report

Research Products
(7 results)

All 2010 Other

All Presentation (5 results) Remarks (2 results)

[Presentation] Interval-based Modeling of Human Communication Dynamics via Hybrid Dynamical Systems2010
- Author(s)
  H.Kawashima
- Organizer
  Workshop on Human Communication Dynamics (NIPS WS)
- Place of Presentation
  Whistler Canada
- Year and Date
  2010-12-10
- Related Report
  2010 Final Research Report
[Presentation] Interval-based Modeling of Human Communication Dynamics via Hybrid Dynamical Systems2010
- Author(s)
  Hiroaki Kawashima
- Organizer
  Workshop on Human Communication Dynamics (NIPS WS)
- Place of Presentation
  カナダ(ウィスラー)
- Year and Date
  2010-12-10
- Related Report
  2010 Annual Research Report
[Presentation] Speech Estimation in Non-Stationary Noise Environments Using Timing Structures Between Mouth Movements and Sound Signals2010
- Author(s)
  H.Kawashima
- Organizer
  Interspeech2010
- Place of Presentation
  Makuhari Japan
- Year and Date
  2010-09-27
- Related Report
  2010 Final Research Report
[Presentation] Speech Estimation in Non-Stationary Noise Environments Using Timing Structure between Mouth Movements and Sound Signals2010
- Author(s)
  Hiroaki Kawashima
- Organizer
  Interspeech
- Place of Presentation
  千葉(幕張)
- Year and Date
  2010-09-27
- Related Report
  2010 Annual Research Report
[Presentation] 口唇運動-音声間のタイミング構造を利用した非定常雑音環境での発話音声推定2010
- Author(s)
  川嶋宏彰
- Organizer
  第13回画像の認識・理解シンポジウム(MIRU)
- Place of Presentation
  北海道(釧路)
- Year and Date
  2010-07-29
- Related Report
  2010 Annual Research Report 2010 Final Research Report
[Remarks] ホームページ
- URL
  http://vision.kuee.kyoto-u.ac.jp/~hiroaki/research/
- Related Report
  2010 Final Research Report
[Remarks]
- URL
  http://vision.kuee.kyoto-u.ac.jp/~hiroaki/research/
- Related Report
  2010 Annual Research Report

A Method for Speech Analysis Based on a Visual-to-Auditory Feedback Mechanism

Principal Investigator

KAWASHIMA Hiroaki Kyoto University, 情報学研究科, 講師 (40346101)

¥8,710,000 (Direct Cost: ¥6,700,000、Indirect Cost: ¥2,010,000)

Report

Research Products

[Presentation] Interval-based Modeling of Human Communication Dynamics via Hybrid Dynamical Systems2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Interval-based Modeling of Human Communication Dynamics via Hybrid Dynamical Systems2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Speech Estimation in Non-Stationary Noise Environments Using Timing Structures Between Mouth Movements and Sound Signals2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Speech Estimation in Non-Stationary Noise Environments Using Timing Structure between Mouth Movements and Sound Signals2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 口唇運動-音声間のタイミング構造を利用した非定常雑音環境での発話音声推定2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Remarks] ホームページ

URL

Related Report

[Remarks]

URL

Related Report