• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Analysis of factors of misrecognition and study of training methods for use of speech recognition system

Research Project

Project/Area Number 25730117
Research Category

Grant-in-Aid for Young Scientists (B)

Allocation TypeMulti-year Fund
Research Field Perceptual information processing
Research InstitutionDaido University

Principal Investigator

Tsuge Satoru  大同大学, 情報学部, 准教授 (00325250)

Project Period (FY) 2013-04-01 – 2017-03-31
Project Status Completed (Fiscal Year 2016)
Budget Amount *help
¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)
Fiscal Year 2016: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000)
Fiscal Year 2015: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Fiscal Year 2014: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000)
Fiscal Year 2013: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywords音声認識 / 音声インターフェース / 音声認識システム訓練方法 / 誤認識要因の解明 / 音声インタフェース / 音声認識訓練システム
Outline of Final Research Achievements

The main theme of my study is the wide spread of speech recognition system. In this study, we focused on "analysis of factors of mis-recognition" and "training method for speech recognition system users". For analysis of factors of mis-recognition, I investigated the factors of mis-recognition under the noisy environment using CENSREC1 speech database. From this investigation, one of mis-recognition caused by the error of voice activation detection. In addition, I collected the speech data when the speaker uttered the speech recognition system for long term. Using these speech data, I investigated the factors of mis-recognition. On the other hand, I unfortunately have not constructed the training system for speech recognition system user. But, I have constructed some speech recognition system, which are a navigation system on potable system, a introduction system of university on personal computer, and so on.

Report

(5 results)
  • 2016 Annual Research Report   Final Research Report ( PDF )
  • 2015 Research-status Report
  • 2014 Research-status Report
  • 2013 Research-status Report
  • Research Products

    (11 results)

All 2016 2015 2014 2013

All Journal Article (4 results) (of which Peer Reviewed: 3 results) Presentation (7 results)

  • [Journal Article] 種々のテキスト検索モデルの頑健性向上による音声ドキュメント検索の高精度化2015

    • Author(s)
      市川 賢, 北岡 教英, 柘植 覚, 武田 一哉, 北 研二
    • Journal Title

      情報処理学会論文誌

      Volume: 56 Pages: 1003-1012

    • NAID

      110009884094

    • Related Report
      2014 Research-status Report
    • Peer Reviewed
  • [Journal Article] 音声ドキュメント検索における種々の検討および線形補間係数を自動決定する検索質問拡張2014

    • Author(s)
      柘植 覚, 大橋 宏正, 市川 賢, 北岡 教英, 武田 一哉, 北 研二
    • Journal Title

      情報処理学会論文誌

      Volume: 55 Pages: 1625-1636

    • NAID

      110009795217

    • Related Report
      2014 Research-status Report
    • Peer Reviewed
  • [Journal Article] 話者認識におけるロバストネス2013

    • Author(s)
      王龍標, 西田昌史, 柘植覚, 網野加苗
    • Journal Title

      日本音響学会誌

      Volume: 69 Pages: 357-364

    • Related Report
      2013 Research-status Report
  • [Journal Article] エネルギー変化の線形予測符号化に基づくリズム特徴量を用いた音楽印象識別2013

    • Author(s)
      三好 真人, 柘植 覚, 福見 稔
    • Journal Title

      情報処理学会論文誌

      Volume: 54 Pages: 1275-1287

    • NAID

      110009579536

    • Related Report
      2013 Research-status Report
    • Peer Reviewed
  • [Presentation] Combination method air and bone conducted speech for speaker recognition in i-vector space2016

    • Author(s)
      Satoru Tsuge
    • Organizer
      5th Joint Meeting of the Acoustical Society of America and Acoustical Society of Japan
    • Place of Presentation
      Hawaii, USA
    • Year and Date
      2016-11-28
    • Related Report
      2016 Annual Research Report
  • [Presentation] AWA長期間収録音声コーパスの公開について2016

    • Author(s)
      大須賀智子
    • Organizer
      日本音響学会2016年秋季研究発表会講演論文集
    • Place of Presentation
      富山大学
    • Year and Date
      2016-09-13
    • Related Report
      2016 Annual Research Report
  • [Presentation] STD Method Based on Hash Function for NTCIR11 SpokenQuery&Doc Task2014

    • Author(s)
      Satoru Tsuge, Norihide Kitaoka, Kazuya Takeda and Kenji Kita
    • Organizer
      10th NTCIR Workshop Meeting
    • Place of Presentation
      Tokyo
    • Year and Date
      2014-12-09 – 2014-12-12
    • Related Report
      2014 Research-status Report
  • [Presentation] ビット演算に基づく高速な音声ドキュメント検索語検出2014

    • Author(s)
      北研二, 松本和幸, 吉田稔, 柘植覚, 北岡教英, 武田一哉
    • Organizer
      音声ドキュメント処理ワークショップ
    • Place of Presentation
      豊橋, 愛知
    • Related Report
      2013 Research-status Report
  • [Presentation] Missing feature theory for speaker verification with short utterances2014

    • Author(s)
      Yoko Takahashi, Shingo Kuroiwa, Yasuo Horiuchi, Satoru Tsuge
    • Organizer
      2014 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing
    • Place of Presentation
      Hawaii, USA
    • Related Report
      2013 Research-status Report
  • [Presentation] Spoken document retrieval using both word-based and syllable-based document spaces with latent semantic indexing2013

    • Author(s)
      Ken Ichikawa, Satoru Tsuge, Norihide Kitaoka, Kazuya Takeda, and Kenji Kita
    • Organizer
      2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference
    • Place of Presentation
      Kaoshiung, Taiwan
    • Related Report
      2013 Research-status Report
  • [Presentation] 音声ドキュメント検索手法における拡張クエリの超平面によるモデル化と潜在意味解析の適用2013

    • Author(s)
      市川賢, 柘植覚, 北岡教英, 武田一哉,北研二
    • Organizer
      日本音響学会講論集
    • Place of Presentation
      豊橋, 愛知
    • Related Report
      2013 Research-status Report

URL: 

Published: 2014-07-25   Modified: 2019-07-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi