音源分離を規範とした音声認識手法に関する研究

Research Project

Project/Area Number	17650047
Research Category	Grant-in-Aid for Exploratory Research
Allocation Type	Single-year Grants
Research Field	Perception information processing/Intelligent robotics
Research Institution	Japan Advanced Institute of Science and Technology
Principal Investigator	赤木正人 Japan Advanced Institute of Science and Technology, 情報科学研究科, 教授 (20242571)
Co-Investigator(Kenkyū-buntansha)	鵜木祐史北陸先端科学技術大学院大学, 情報科学研究科, 准教授 (00343187)
Project Period (FY)	2005 – 2007
Project Status	Completed (Fiscal Year 2007)
Budget Amount *help	¥3,200,000 (Direct Cost: ¥3,200,000) Fiscal Year 2007: ¥900,000 (Direct Cost: ¥900,000) Fiscal Year 2006: ¥1,100,000 (Direct Cost: ¥1,100,000) Fiscal Year 2005: ¥1,200,000 (Direct Cost: ¥1,200,000)
Keywords	音声認識 / 音源分離 / 雑音 / パターン認識法 / 計算論的音情景解析 / 仮説検証
Research Abstract	本研究では,雑音をモデル化する必要がなくどのような雑音にも対処可能な音声認識手法の新しい枠組みを提案する。具体的には,申請者らが提案した音源分離手法を認識規範として,従来の枠組みにとらわれない認識法を提案する。このため,1年目は,提案する手法によって高精度の音声認識が可能かどうかの詳細な議論を行った。提案手法では,認識対象に関する情報をtop-down的に音源分離部へ与え,この情報を用いて分離が完了する(認識が行える)かどうかを観測する。比較的定常な楽器音の分離処理ではこの手法は成功しているが,母音系列に対しても適用可能かどうかの検討を行った。その結果,音声認識に使用可能であるという結論を得た。 2年目は,変化の激しい音声,特に単語に対して認識が行えるかどうかについて検討した。その結果,従来法であるスペクトルサブトラクション法とか音響モデル適応法に比較して誤り率が数十%減少した。これらを受けて,最終年度である本年度は,提案手法の有効性を検証するために,定常、非定常雑音環境で日本語数字認識を実施した。現有の手法である雑音抑圧前処理,および,参照パターン適応による認識結果との比較を行った結果,提案手法を用いたASRシステムは,どの雑音環境においても既存の手法の認識率を上回った。これは,提案手法が選択的音源分離を評価して得られた目的音の存在確度を用いて認識するため,また,提案手法が雑音モデルを一切用いていないためと考えられる。このことは,提案手法が様々な雑音環境で頑健に認識できる可能性があり,ASRシステムの頑健性向上の手法として有益である事を示すものである。

Report

(3 results)

Research Products
(7 results)

All 2008 2007 2006 2005

All Journal Article (5 results) (of which Peer Reviewed: 3 results) Presentation (2 results)

[Journal Article] A speech recognition method based on the selective sound segregation in vanous noisy environments2008
- Author(s)
  Atsushi Haniu, Masashi Unoki, and Masato Akagi
- Journal Title
  
  Proc.NCSP08
  
  Pages: 168-171
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] A study on a speech recognition method based on the selective sound segregation in various noisy environments2007
- Author(s)
  Atsushi Haniu, Masashi Unoki, and Masato Akagi
- Journal Title
  
  Proc.NOLTA2007
  
  Pages: 445-448
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] A study on a speech recognition method based on the selective sound segregation in noisy environment2007
- Author(s)
  Atsushi Haniu, Masashi Unoki, and Masato Akagi
- Journal Title
  
  Proc.JCA2007 (CD-ROM)
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] A Model-Concept of the Selective Sound Segregation : -A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed bound of Various Instruments -2006
- Author(s)
  Unoki M., Kubo M., Haniu, A., Akagi, M.
- Journal Title
  
  Journal of Signal Processing 10,6
  
  Pages: 419-431
- Related Report
  2006 Annual Research Report
[Journal Article] A model for selective segregation of a target instrument sound from the mixed sound of various instruments2005
- Author(s)
  Unoki, M., Kubo, M., Haniu, A., Akagi, M.
- Journal Title
  
  Proc.EuroSpeech2005
  
  Pages: 2097-2100
- Related Report
  2005 Annual Research Report
[Presentation] Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments2008
- Author(s)
  Atsushi Haniu, Masashi Unoki, and Masato Akagi
- Organizer
  Asian Student workshop
- Place of Presentation
  Tokyo
- Year and Date
  2008-03-20
- Related Report
  2007 Annual Research Report
[Presentation] 雑音環境における選択的音源分離を規範とした音声認識2008
- Author(s)
  羽二生篤, 鵜木祐史, 赤木正人
- Organizer
  日本音響学会平成20年春季研究発表会
- Place of Presentation
  千葉工業大学
- Year and Date
  2008-03-17
- Related Report
  2007 Annual Research Report

音源分離を規範とした音声認識手法に関する研究

Principal Investigator

赤木 正人 Japan Advanced Institute of Science and Technology, 情報科学研究科, 教授 (20242571)

¥3,200,000 (Direct Cost: ¥3,200,000)

Report

Research Products

[Journal Article] A speech recognition method based on the selective sound segregation in vanous noisy environments2008

Author(s)

Journal Title

Related Report

[Journal Article] A study on a speech recognition method based on the selective sound segregation in various noisy environments2007

Author(s)

Journal Title

Related Report

[Journal Article] A study on a speech recognition method based on the selective sound segregation in noisy environment2007

Author(s)

Journal Title

Related Report

[Journal Article] A Model-Concept of the Selective Sound Segregation : -A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed bound of Various Instruments -2006

Author(s)

Journal Title

Related Report

[Journal Article] A model for selective segregation of a target instrument sound from the mixed sound of various instruments2005

Author(s)

Journal Title

Related Report

[Presentation] Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments2008

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 雑音環境における選択的音源分離を規範とした音声認識2008

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

赤木正人 Japan Advanced Institute of Science and Technology, 情報科学研究科, 教授 (20242571)