2009 Fiscal Year Annual Research Report

多層モデルの階層間密統合に基づく音声理解フレームワークの研究

Research Project

Project/Area Number	21300066
Research Institution	Nagoya Institute of Technology
Principal Investigator	李晃伸 Nagoya Institute of Technology, 大学院・工学研究科, 准教授 (80332766)
Co-Investigator(Kenkyū-buntansha)	秋田祐哉京都大学, 学術情報メディアセンター, 助教 (90402742) 駒谷和範京都大学, 情報学研究科, 助教 (40362579) 西村竜一和歌山大学, システム工学部, 助教 (00379611) 篠崎隆宏東京工業大学, 情報理工学(系)研究科, 助教 (80447903) 西田昌史同志社大学, 理工学部, 准教授 (80361442) 南條浩輝龍谷大学, 理工学部, 助教 (50388162)
Keywords	音声言語理解 / 音声対話 / 音声認識
Research Abstract	本研究課題の目的は、音声言語理解において、信号処理から意味理解、ユーザモデルまでの各層の実際的な関係や統計的性質を明らかにし、それに基づいて処理を確率的に密統合することで、より高度かつ柔軟な音声言語処理と、高度な音声言語インタフェースを実現することである。 H21年度は、研究計画の初年度として、主に各層における研究および確率的な制約表現方法、他層との接続可能性について研究を行った。1、制約条件の調査では、全メンバーによる集中的な研究ミーティングを8月と12月に開催し、各階層の内部仕様および外部仕様、今後の方向性とターゲットについて討議を行った。2、言語モデルについては、国会音声や講演といった自然発話の自動書き起こしの研究において得られた、認識結果の可読性、状況・話題といった知識を探索過程へ動的な組み込む方について検討した。また幅広いタスクにおける言語モデルの話題依存性についても調査した。3、ベイズリスク最少化深索においては、最小化探索において組み込む外部制約の高精度化のために、単語重要度推定や認識誤りのシミュレーションを提案し検証した。4、音響モデルについては、日本語話し言葉コーパス(CSJ)の音響モデルの異なるタスクに対する頑健性を調べ、タスクと音響モデルの関連性について調査した。5、データ収集用音声対話システムについては、大規模で広いインタラクティブなデータ収集のために、Webベースの音声入力アプリケーションの開発と改善を行った。6、統計的ユーザモデル及び統計的対話モデルについては、ユーザの発話履歴や想定文パターン(文法)とのマッチ度合いを応答候補選択に反映させる手法、ならびに統計的な対話管理手法として近年注目されているPOMDP(Partially observable Markov decision process)に基づく音声対話システムについて研究を行った。

Research Products
(14 results)

All 2010 2009

All Presentation (14 results)

[Presentation] ユーザの文法知識を状態に加えたPOMDPに基づく音声対話システム2010
- Author(s)
  穐山空道, 駒谷和範, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第72回全国大会, 5U-9
- Place of Presentation
  東京大学
- Year and Date
  2010-03-10
[Presentation] 国会音声における認識文と整形過程の分析2010
- Author(s)
  秋田祐哉
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  電気通信大学 (東京都調布市)
- Year and Date
  2010-03-09
[Presentation] 音声入力型情報検索における単語重要度推定のための統計的機械翻訳を用いた音声認識シミュレート2010
- Author(s)
  七里崇, 南條浩輝, 吉見毅彦
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  電気通信大学 (東京都調布市)
- Year and Date
  2010-03-09
[Presentation] 日本語話し言葉コーパスを用いた異なるタスクに対する音声認識2010
- Author(s)
  西井俊介, 篠崎隆宏, 他
- Organizer
  日本音響学会
- Place of Presentation
  電気通信大学 (調布市)
- Year and Date
  2010-03-08
[Presentation] Google DBを用いたトピック特化型N-gramモデル補完の検討2010
- Author(s)
  島田敏明, 西村竜一, 他
- Organizer
  日本音響学会2010年春季研究発表会講演論文集, pp. 177-178
- Place of Presentation
  電気通信大学
- Year and Date
  2010-03-08
[Presentation] 統計的機械翻訳による音声データを用いない音声認識のシミュレートの検討2009
- Author(s)
  七里崇, 南條浩輝
- Organizer
  日本音響学会第12回関西支部若手研究者交流研究発表会
- Place of Presentation
  関西大学100周年記念会館
- Year and Date
  2009-12-05
[Presentation] Topic-Dependent Language Modeling for VoiceWeb Systems2009
- Author(s)
  鈴田健太郎, 西村竜一, et al
- Organizer
  WESPAC X 2009, paper-id : 0223
- Place of Presentation
  Beijing, China
- Year and Date
  2009-09-23
[Presentation] 音声Webインタフェースを用いて収集した実環境発話の分析2009
- Author(s)
  鈴田健太郎, 西村竜一
- Organizer
  日本音響学会2009年秋季研究発表会譜演論文集, pp. 125-126
- Place of Presentation
  日本大学
- Year and Date
  2009-09-17
[Presentation] 音声Webインタフェースを用いて収集した実環境発話の分析2009
- Author(s)
  鈴田健太郎, 西村竜一
- Organizer
  日本音響学会2009年秋季研究発表会講演論文集, pp. 125-126
- Place of Presentation
  日本大学
- Year and Date
  2009-09-17
[Presentation] 講演の書き起こしに対する読点の自動挿入2009
- Author(s)
  秋田祐哉
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  日本大学 (福島県郡山市)
- Year and Date
  2009-09-16
[Presentation] Ranking Help Message Candidates Based on Robust Grammar Verification Results and Utterance History in Spoken Dialogue Systems2009
- Author(s)
  Kazunori Komatani., 他4名
- Organizer
  10th Annual SIGDIAL Meeting on Discourse and Dialogue
- Place of Presentation
  London, UK
- Year and Date
  2009-09-12
[Presentation] Automatic Transcription System for Meetings of the Japanese National Congress2009
- Author(s)
  Yuya Akita
- Organizer
  ISCA Interspeech 2009
- Place of Presentation
  Brighton Centre, Brighton, UK
- Year and Date
  2009-09-07
[Presentation] Development of Speech Input Method for Interactive Voice Web Systems2009
- Author(s)
  西村竜一
- Organizer
  HCI International 2009, vol. 5611, pp. 710-719
- Place of Presentation
  San Diego, CA, USA
- Year and Date
  2009-07-22
[Presentation] 音声対話システムにおける文法検証結果と発話履歴に基づくヘルプメッセージ候補のランキング2009
- Author(s)
  駒谷和範, 池田智志, 福林雄一朗, 尾形哲也, 奥乃博
- Organizer
  情報処理学会音声言語情報処理研究会 (SIG-SLP), 2009-SLP-77-12
- Place of Presentation
  福島県飯坂温泉・飯坂ホテル聚楽
- Year and Date
  2009-07-18

2009 Fiscal Year Annual Research Report

多層モデルの階層間密統合に基づく音声理解フレームワークの研究

Principal Investigator

李 晃伸 Nagoya Institute of Technology, 大学院・工学研究科, 准教授 (80332766)

Research Products

[Presentation] ユーザの文法知識を状態に加えたPOMDPに基づく音声対話システム2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 国会音声における認識文と整形過程の分析2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 音声入力型情報検索における単語重要度推定のための統計的機械翻訳を用いた音声認識シミュレート2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 日本語話し言葉コーパスを用いた異なるタスクに対する音声認識2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Google DBを用いたトピック特化型N-gramモデル補完の検討2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 統計的機械翻訳による音声データを用いない音声認識のシミュレートの検討2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Topic-Dependent Language Modeling for VoiceWeb Systems2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 音声Webインタフェースを用いて収集した実環境発話の分析2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 音声Webインタフェースを用いて収集した実環境発話の分析2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 講演の書き起こしに対する読点の自動挿入2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Ranking Help Message Candidates Based on Robust Grammar Verification Results and Utterance History in Spoken Dialogue Systems2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Automatic Transcription System for Meetings of the Japanese National Congress2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Development of Speech Input Method for Interactive Voice Web Systems2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 音声対話システムにおける文法検証結果と発話履歴に基づくヘルプメッセージ候補のランキング2009

Author(s)

Organizer

Place of Presentation

Year and Date

李晃伸 Nagoya Institute of Technology, 大学院・工学研究科, 准教授 (80332766)