Analysis of factors of misrecognition and study of training methods for use of speech recognition system

Research Project

Project/Area Number	25730117
Research Category	Grant-in-Aid for Young Scientists (B)
Allocation Type	Multi-year Fund
Research Field	Perceptual information processing
Research Institution	Daido University
Principal Investigator	Tsuge Satoru 大同大学, 情報学部, 准教授 (00325250)
Project Period (FY)	2013-04-01 – 2017-03-31
Project Status	Completed (Fiscal Year 2016)
Budget Amount *help	¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000) Fiscal Year 2016: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000) Fiscal Year 2015: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000) Fiscal Year 2014: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000) Fiscal Year 2013: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywords	音声認識 / 音声インターフェース / 音声認識システム訓練方法 / 誤認識要因の解明 / 音声インタフェース / 音声認識訓練システム
Outline of Final Research Achievements	The main theme of my study is the wide spread of speech recognition system. In this study, we focused on "analysis of factors of mis-recognition" and "training method for speech recognition system users". For analysis of factors of mis-recognition, I investigated the factors of mis-recognition under the noisy environment using CENSREC1 speech database. From this investigation, one of mis-recognition caused by the error of voice activation detection. In addition, I collected the speech data when the speaker uttered the speech recognition system for long term. Using these speech data, I investigated the factors of mis-recognition. On the other hand, I unfortunately have not constructed the training system for speech recognition system user. But, I have constructed some speech recognition system, which are a navigation system on potable system, a introduction system of university on personal computer, and so on.

Report

(5 results)

2016 Annual Research Report Final Research Report ( PDF )
2015 Research-status Report
2014 Research-status Report
2013 Research-status Report

Research Products
(11 results)

All 2016 2015 2014 2013

All Journal Article (4 results) (of which Peer Reviewed: 3 results) Presentation (7 results)

[Journal Article] 種々のテキスト検索モデルの頑健性向上による音声ドキュメント検索の高精度化2015
- Author(s)
  市川賢, 北岡教英, 柘植覚, 武田一哉, 北研二
- Journal Title
  
  情報処理学会論文誌
  
  Volume: 56 Pages: 1003-1012
- NAID
  110009884094
- Related Report
  2014 Research-status Report
- Peer Reviewed
[Journal Article] 音声ドキュメント検索における種々の検討および線形補間係数を自動決定する検索質問拡張2014
- Author(s)
  柘植覚, 大橋宏正, 市川賢, 北岡教英, 武田一哉, 北研二
- Journal Title
  
  情報処理学会論文誌
  
  Volume: 55 Pages: 1625-1636
- NAID
  110009795217
- Related Report
  2014 Research-status Report
- Peer Reviewed
[Journal Article] 話者認識におけるロバストネス2013
- Author(s)
  王龍標, 西田昌史, 柘植覚, 網野加苗
- Journal Title
  
  日本音響学会誌
  
  Volume: 69 Pages: 357-364
- Related Report
  2013 Research-status Report
[Journal Article] エネルギー変化の線形予測符号化に基づくリズム特徴量を用いた音楽印象識別2013
- Author(s)
  三好真人, 柘植覚, 福見稔
- Journal Title
  
  情報処理学会論文誌
  
  Volume: 54 Pages: 1275-1287
- NAID
  110009579536
- Related Report
  2013 Research-status Report
- Peer Reviewed
[Presentation] Combination method air and bone conducted speech for speaker recognition in i-vector space2016
- Author(s)
  Satoru Tsuge
- Organizer
  5th Joint Meeting of the Acoustical Society of America and Acoustical Society of Japan
- Place of Presentation
  Hawaii, USA
- Year and Date
  2016-11-28
- Related Report
  2016 Annual Research Report
[Presentation] AWA長期間収録音声コーパスの公開について2016
- Author(s)
  大須賀智子
- Organizer
  日本音響学会2016年秋季研究発表会講演論文集
- Place of Presentation
  富山大学
- Year and Date
  2016-09-13
- Related Report
  2016 Annual Research Report
[Presentation] STD Method Based on Hash Function for NTCIR11 SpokenQuery&Doc Task2014
- Author(s)
  Satoru Tsuge, Norihide Kitaoka, Kazuya Takeda and Kenji Kita
- Organizer
  10th NTCIR Workshop Meeting
- Place of Presentation
  Tokyo
- Year and Date
  2014-12-09 – 2014-12-12
- Related Report
  2014 Research-status Report
[Presentation] ビット演算に基づく高速な音声ドキュメント検索語検出2014
- Author(s)
  北研二, 松本和幸, 吉田稔, 柘植覚, 北岡教英, 武田一哉
- Organizer
  音声ドキュメント処理ワークショップ
- Place of Presentation
  豊橋, 愛知
- Related Report
  2013 Research-status Report
[Presentation] Missing feature theory for speaker verification with short utterances2014
- Author(s)
  Yoko Takahashi, Shingo Kuroiwa, Yasuo Horiuchi, Satoru Tsuge
- Organizer
  2014 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing
- Place of Presentation
  Hawaii, USA
- Related Report
  2013 Research-status Report
[Presentation] Spoken document retrieval using both word-based and syllable-based document spaces with latent semantic indexing2013
- Author(s)
  Ken Ichikawa, Satoru Tsuge, Norihide Kitaoka, Kazuya Takeda, and Kenji Kita
- Organizer
  2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference
- Place of Presentation
  Kaoshiung, Taiwan
- Related Report
  2013 Research-status Report
[Presentation] 音声ドキュメント検索手法における拡張クエリの超平面によるモデル化と潜在意味解析の適用2013
- Author(s)
  市川賢, 柘植覚, 北岡教英, 武田一哉，北研二
- Organizer
  日本音響学会講論集
- Place of Presentation
  豊橋, 愛知
- Related Report
  2013 Research-status Report

Analysis of factors of misrecognition and study of training methods for use of speech recognition system

Principal Investigator

Tsuge Satoru 大同大学, 情報学部, 准教授 (00325250)

¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)

Report

Research Products

[Journal Article] 種々のテキスト検索モデルの頑健性向上による音声ドキュメント検索の高精度化2015

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 音声ドキュメント検索における種々の検討および線形補間係数を自動決定する検索質問拡張2014

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 話者認識におけるロバストネス2013

Author(s)

Journal Title

Related Report

[Journal Article] エネルギー変化の線形予測符号化に基づくリズム特徴量を用いた音楽印象識別2013

Author(s)

Journal Title

NAID

Related Report

[Presentation] Combination method air and bone conducted speech for speaker recognition in i-vector space2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] AWA長期間収録音声コーパスの公開について2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] STD Method Based on Hash Function for NTCIR11 SpokenQuery&Doc Task2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] ビット演算に基づく高速な音声ドキュメント検索語検出2014

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Missing feature theory for speaker verification with short utterances2014

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Spoken document retrieval using both word-based and syllable-based document spaces with latent semantic indexing2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 音声ドキュメント検索手法における拡張クエリの超平面によるモデル化と潜在意味解析の適用2013

Author(s)

Organizer

Place of Presentation

Related Report