• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A Method for Speech Analysis Based on a Visual-to-Auditory Feedback Mechanism

Research Project

Project/Area Number 21680016
Research Category

Grant-in-Aid for Young Scientists (A)

Allocation TypeSingle-year Grants
Research Field Perception information processing/Intelligent robotics
Research InstitutionKyoto University

Principal Investigator

KAWASHIMA Hiroaki  Kyoto University, 情報学研究科, 講師 (40346101)

Project Period (FY) 2009 – 2010
Project Status Completed (Fiscal Year 2010)
Budget Amount *help
¥8,710,000 (Direct Cost: ¥6,700,000、Indirect Cost: ¥2,010,000)
Fiscal Year 2010: ¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000)
Fiscal Year 2009: ¥5,980,000 (Direct Cost: ¥4,600,000、Indirect Cost: ¥1,380,000)
Keywords音声推定・分離 / 口唇運動 / 線形システム / ハイブリッドシステム / タイミング構造 / 視聴覚統合 / マルチモダリティ / 時系列の分節化
Research Abstract

We have developed a novel speech-analysis method based on the detail modeling of temporal relationship between mouth movements and speech signals. First, we use a hybrid system, which is an integrated model of dynamical systems and discrete-event systems, as a mathematical tool to segment and model multimedia signals such as captured mouth motion and speech data. Then, we build a statistical cross-media timing model that can be learned from those segmented data. The proposed method realizes the mechanism of signal generation "from mouth motion to speech", which enables highly accurate speech estimation in non-stationary noise environment.

Report

(3 results)
  • 2010 Annual Research Report   Final Research Report ( PDF )
  • 2009 Annual Research Report
  • Research Products

    (7 results)

All 2010 Other

All Presentation (5 results) Remarks (2 results)

  • [Presentation] Interval-based Modeling of Human Communication Dynamics via Hybrid Dynamical Systems2010

    • Author(s)
      H.Kawashima
    • Organizer
      Workshop on Human Communication Dynamics (NIPS WS)
    • Place of Presentation
      Whistler Canada
    • Year and Date
      2010-12-10
    • Related Report
      2010 Final Research Report
  • [Presentation] Interval-based Modeling of Human Communication Dynamics via Hybrid Dynamical Systems2010

    • Author(s)
      Hiroaki Kawashima
    • Organizer
      Workshop on Human Communication Dynamics (NIPS WS)
    • Place of Presentation
      カナダ(ウィスラー)
    • Year and Date
      2010-12-10
    • Related Report
      2010 Annual Research Report
  • [Presentation] Speech Estimation in Non-Stationary Noise Environments Using Timing Structures Between Mouth Movements and Sound Signals2010

    • Author(s)
      H.Kawashima
    • Organizer
      Interspeech2010
    • Place of Presentation
      Makuhari Japan
    • Year and Date
      2010-09-27
    • Related Report
      2010 Final Research Report
  • [Presentation] Speech Estimation in Non-Stationary Noise Environments Using Timing Structure between Mouth Movements and Sound Signals2010

    • Author(s)
      Hiroaki Kawashima
    • Organizer
      Interspeech
    • Place of Presentation
      千葉(幕張)
    • Year and Date
      2010-09-27
    • Related Report
      2010 Annual Research Report
  • [Presentation] 口唇運動-音声間のタイミング構造を利用した非定常雑音環境での発話音声推定2010

    • Author(s)
      川嶋宏彰
    • Organizer
      第13回画像の認識・理解シンポジウム(MIRU)
    • Place of Presentation
      北海道(釧路)
    • Year and Date
      2010-07-29
    • Related Report
      2010 Annual Research Report 2010 Final Research Report
  • [Remarks] ホームページ

    • URL

      http://vision.kuee.kyoto-u.ac.jp/~hiroaki/research/

    • Related Report
      2010 Final Research Report
  • [Remarks]

    • URL

      http://vision.kuee.kyoto-u.ac.jp/~hiroaki/research/

    • Related Report
      2010 Annual Research Report

URL: 

Published: 2009-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi