2010 Fiscal Year Annual Research Report

話し言葉の統計的モデル化に基づく自動整形

Research Project

Project/Area Number	21700193
Research Institution	Kyoto University
Principal Investigator	秋田祐哉京都大学, 学術情報メディアセンター, 助教 (90402742)
Keywords	話し言葉 / 音声認識 / 自動整形 / モデル化
Research Abstract	本研究課題では,話し言葉音声認識の応用性を高めるための基本・基盤技術として,話し言葉を可読性の高い整形文(書き言葉)に自動変換する手法の確立を目指している.平成22年度では,この一環である話し言葉に対する句読点の自動挿入の検討を行った.本研究の前提となる話し言葉音声認識システムの改善もあわせて図っている. 句読点の挿入手法については,昨年に引き続き講演音声の書き起こしを対象として,読点について重点的に取り組みを進めた.同一のテキストに対して異なる作業者により付与された句読点を比較し,句点については作業者間でおおむね一致する一方,読点では個人差があり,付与位置も付与数も作業者により異なることを確認している.読点は文章の多くの位置に挿入可能であるが,読みやすさのためには一定の基準に従って挿入されている必要があると考えられ,個人ごとのモデル化はその一方策といえる.このため読点に関する個人差について言語的・音響的な観点から分析を行った.具体的には読点の間隔や前後の言語表現,ポーズとの共起やその長さなどである.本研究では機械学習(CRF)を導入し,この分析に基づいて利用する素性を定めてモデルを学習し,句読点の挿入器を構成した.複数の作業者に共通する読点や,特定の作業者による読点などについてモデルを構成し評価したところ,個人差の小さい読点については比較的高い精度で挿入でき,個人的な読点もモデルの組み合わせにより精度を改善できることがわかった.この分析評価については2010年日本音響学会秋季研究発表会(2010年9月)にて発表を行っている. このほか,話し言葉音声認識システムについてはISCA Interspeech 2010(国際会議,2010年9月)や情報処理学会研究会(2010年12月)などで報告している.

Research Products
(6 results)

All 2011 2010

All Journal Article (2 results) (of which Peer Reviewed: 2 results) Presentation (4 results)

[Journal Article] Statistical transformation of language and pronunciation models for spontaneous speech recognition2010
- Author(s)
  Yuya Akita
- Journal Title
  
  IEEE Trans.Audio, Speech & Language Process.
  
  Volume: 18(6) Pages: 1539-1549
- Peer Reviewed
[Journal Article] 会議録作成支援のための国会審議の音声認識システム2010
- Author(s)
  秋田祐哉
- Journal Title
  
  電子情報通信学会論文誌
  
  Volume: J93-D(9) Pages: 1736-1744
- Peer Reviewed
[Presentation] 衆議院会議録作成における音声認識システム-言語モデル-2011
- Author(s)
  秋田祐哉
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  早稲田大学(東京都新宿区)
- Year and Date
  2011-03-11
[Presentation] 国会音声認識システムの音響・言語モデルの半自動更新2010
- Author(s)
  秋田祐哉
- Organizer
  情報処理学会音声言語情報処理研究会
- Place of Presentation
  国立オリンピック記念青少年総合センター(東京都渋谷区)
- Year and Date
  2010-12-20
[Presentation] Semi-automated update of automatic transcription system for the Japanese national congress2010
- Author(s)
  Yuya Akita
- Organizer
  ISCA Interspeech 2010
- Place of Presentation
  幕張メッセ(千葉市)
- Year and Date
  2010-09-27
[Presentation] 講演における読点の個人的傾向のモデル化と自動挿入2010
- Author(s)
  秋田祐哉
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  関西大学(大阪府吹田市)
- Year and Date
  2010-09-14

2010 Fiscal Year Annual Research Report

話し言葉の統計的モデル化に基づく自動整形

Principal Investigator

秋田 祐哉 京都大学, 学術情報メディアセンター, 助教 (90402742)

Research Products

[Journal Article] Statistical transformation of language and pronunciation models for spontaneous speech recognition2010

Author(s)

Journal Title

[Journal Article] 会議録作成支援のための国会審議の音声認識システム2010

Author(s)

Journal Title

[Presentation] 衆議院会議録作成における音声認識システム-言語モデル-2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 国会音声認識システムの音響・言語モデルの半自動更新2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Semi-automated update of automatic transcription system for the Japanese national congress2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 講演における読点の個人的傾向のモデル化と自動挿入2010

Author(s)

Organizer

Place of Presentation

Year and Date

秋田祐哉京都大学, 学術情報メディアセンター, 助教 (90402742)