複数話者の音声コミュニケーションの意図・状況理解

Research Project

Project/Area Number	13224057
Research Category	Grant-in-Aid for Scientific Research on Priority Areas (C)
Allocation Type	Single-year Grants
Review Section	Science and Engineering
Research Institution	Kyoto University
Principal Investigator	河原達也京都大学, 情報学研究科, 助教授 (00234104)
Co-Investigator(Kenkyū-buntansha)	岡田美智男 ATR, メディア情報科学研究所, 主任研究員
Project Period (FY)	2001
Project Status	Completed (Fiscal Year 2001)
Keywords	音声情報処理 / 音声認識 / 意図理解 / 話し言葉 / 自動インデキシング / アーカイブ
Research Abstract	会議・会話など人間どうしの音声を対象として、音声データの収集を行うとともに、その音響的・言語的そしてコミュニケーションの観点からのモデル化を行った。まず会議音声を対象として、階層的なアーカイブを構築し、議事録の作成支援を行うシステムを設計した。GMMによる話者識別を行い、その結果により音声を分割するとともに、話者IDや時間情報などのインデックスを生成する。また談話標識を含むキーフレーズの検出により議論の結論となる発話を特定し、議事次第や会議の配布資料などに含まれる話題依存語彙を利用して、これを自動的に書き起こし、議事録のドラフトとする。以上により音声・インデックス・テキストの3層からなるアーカイブを構築することができる。次に、この談話標識に基づく自動インデキシングを大規模な講演音声コーパスに対して適用・評価を行った。学習データの講演の書き起こしからポーズ情報を用いてセクション境界候補を検出し、統計的言語モデルを用いて句点を挿入して、各セクションの先頭の一文を抽出する。その中に含まれる名詞から単語頻度と文頻度に基づいて談話標識を選定する。これらの過程は人手によるタグを必要としない教師なし学習により行われる。評価データの各文について談話標識の単語頻度と文頻度の統計量に基づく評価値を計算し、その合計が閾値以上であればインデックスを付与する。実際の講演音声の書き起こしと音声認識結果に対して評価を行った結果、再現率85%程度(適合率は20%程度)の精度で話題セクション境界を自動検出することができた。

Report

(1 results)

2001 Annual Research Report

Research Products
(7 results)

All Other

All Publications (7 results)

[Publications] T.Kawahara: "Automatic transcription of spontaneous lecture speech"IEEE workshop Automatic Speech Recognition and Understanding. (2001)
- Related Report
  2001 Annual Research Report
[Publications] 河原達也: "連続音声認識コンソーシアム2000年度版ソフトウェアの概要と評価"情報処理学会研究報告. SLP-38-6. (2001)
- Related Report
  2001 Annual Research Report
[Publications] 長谷川将宏: "談話標識の抽出に基づいた講演音声の自動インデキシング"情報処理学会研究報告. SLP-36-6. (2001)
- Related Report
  2001 Annual Research Report
[Publications] M.Mimura: "Difference of acoustic modeling for read speech and dialogue speech"Acoustical Science & Technology. 22. 373-374 (2001)
- Related Report
  2001 Annual Research Report
[Publications] K.Komatani: "Domain-independent spoken dialogue platform using key-phrase spotting based on combined language model"Proc.EUROSPEECH. 1319-1322 (2001)
- Related Report
  2001 Annual Research Report
[Publications] A.Lee: "Gaussian mixture selection using context-independent HMM"Proc.IEEE-ICASSP. 1. 69-72 (2001)
- Related Report
  2001 Annual Research Report
[Publications] 鹿野清宏: "音声認識システム"オーム社. 200 (2001)
- Related Report
  2001 Annual Research Report

複数話者の音声コミュニケーションの意図・状況理解

Principal Investigator

河原 達也 京都大学, 情報学研究科, 助教授 (00234104)

Report

Research Products

[Publications] T.Kawahara: "Automatic transcription of spontaneous lecture speech"IEEE workshop Automatic Speech Recognition and Understanding. (2001)

Related Report

[Publications] 河原達也: "連続音声認識コンソーシアム2000年度版ソフトウェアの概要と評価"情報処理学会研究報告. SLP-38-6. (2001)

Related Report

[Publications] 長谷川将宏: "談話標識の抽出に基づいた講演音声の自動インデキシング"情報処理学会研究報告. SLP-36-6. (2001)

Related Report

[Publications] M.Mimura: "Difference of acoustic modeling for read speech and dialogue speech"Acoustical Science & Technology. 22. 373-374 (2001)

Related Report

[Publications] K.Komatani: "Domain-independent spoken dialogue platform using key-phrase spotting based on combined language model"Proc.EUROSPEECH. 1319-1322 (2001)

Related Report

[Publications] A.Lee: "Gaussian mixture selection using context-independent HMM"Proc.IEEE-ICASSP. 1. 69-72 (2001)

Related Report

[Publications] 鹿野清宏: "音声認識システム"オーム社. 200 (2001)

Related Report

河原達也京都大学, 情報学研究科, 助教授 (00234104)