• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Speech Recognition for Computer-Supported Conference Systems under Ubiquitous/Wearable Computing Environment

Research Project

Project/Area Number 12480083
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionTokyo Institute of Technology

Principal Investigator

FURUI Sadaoki  Tokyo Institute of Technology, Graduate School of Information Sci. & Eng., Department of Computer Science, Professor, 大学院・情報理工学研究科, 教授 (90293076)

Co-Investigator(Kenkyū-buntansha) IWANO Koji  Tokyo Institute of Technology, Assistant Professor, Graduate School of Information Sci. & Eng., 大学院・情報理工学研究科, 助手 (90323823)
Project Period (FY) 2000 – 2002
Project Status Completed (Fiscal Year 2002)
Budget Amount *help
¥2,900,000 (Direct Cost: ¥2,900,000)
Fiscal Year 2002: ¥800,000 (Direct Cost: ¥800,000)
Fiscal Year 2001: ¥2,100,000 (Direct Cost: ¥2,100,000)
KeywordsUbiquitous / wearable computing environment / Computer-supported meeting system / Parallel computer / Spoken dialogue / Speech recognition system / Speech contents / 話者適応 / 会議CSCWシステム / 音声認識 / Ubiquitous / Wearable Computing / 話し言葉 / 音響バックオフ / 対話型システム / モデル学習
Research Abstract

Research to build speech recognition technology far computer-supported conference systems under ubiquitous/wearable computing environment has been conducted. First, methods for building both language models appropriate to spontaneous speech and acoustic models automatically adapted to voice individuality have been investigated. Since the cross-talk problem cannot be avoided even if a microphone is attached to each participant of meetings or discussions, an acoustic backing-off method has been tried. In this method, the acoustic score during the cross-talk period is replaced by a mean value for a previous speech Period. The proposed method was confirmed to be effective to improve the recognition Performance.
Second, a parallel committer-based speech recognition system consisting of multiple recognizers having acoustic models adapted to each speaker has been built to recognize meeting utterances. Speaker change is automatically detected during the meeting, and acoustic models are adapted using an unsupervised method. For a new speaker, a speaker-adapted model is incrementally created. A speech recognition result having the maximum likelihood is chosen from the results of multiple recognizers using the speaker-adapted acoustic models. It was confirmed that this method is effective to build a real-time reformation system with a good performance.
Third, the parallel computer-based speech recognition system has been applied to a mixed- initiative spoken dialogue system accepting multiple topics in parallel. Effectiveness of the system was also confirmed. Various other issues related to spontaneous speech recognition have also been an investigated in this research.

Report

(4 results)
  • 2002 Annual Research Report   Final Research Report Summary
  • 2001 Annual Research Report
  • 2000 Annual Research Report
  • Research Products

    (25 results)

All Other

All Publications (25 results)

  • [Publications] Sadaoki Furui, Daisuke Itoh, Zhipeng Zhang: "Neural-Network-Based HMM Adaptation for Noisy Speech Recognition"Acoust.Sci.& Tech.. Vol.24,No.2. 69-75 (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Chiori Hori, Sadaoki Furui, Rob Malkin, Hua Yu, Alex Waibel: "A Statistical Approach to Automatic Speech Summarization"EURASIP Journal on Applied Signal Processing. (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] 篠崎 隆宏, 古井 貞煕: "日本語話し言葉コーパスを用いた講演音声認識"情報処理学会論文誌. Vol.43,No.7. 2098-2107 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Katsutoshi Ohtsuki, Tatsuo Matsuoka, Shoichi Matsunaga, Sadaoki Furui: "Topic Extraction Based on Continuous Speech Recognition in Broadcast News Speech"IEICE Trans.Inf.& Syst.. Vol.E85-D, No.7. 1138-1144 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Zhipeng Zhang, Sadaoki Furui, Katsutoshi Ohtsuki: "On-line Incremental Speaker Adaptation for Broadcast News Transcription"Speech Communication. 37. 271-281 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] 堀 智織, 古井 貞煕: "単語抽出による音声要約文生成法とその評価"電子情報通信学会論文誌. J85-D-II No.2. 200-209 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Sadaoki Furui: "Digital Speech Processing, Synthesis, and Recognition(Second Edition, Revised and Expanded)"Marcel Dekker, Inc.. 452 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Sadaoki Furui, Daisuke Itoh and Zhipeng Zhang: "Neural-Network-Based HMM Adaptation for Noisy Speech Recognition"Acoust. Sic. & Tech.. 24, No.2. 69-75 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Chiori Hori, Sadaoki Furui, Rob Malkin, Hua Yu and Alex Waibel: "A Statistical Approach to Automatic Speech Summarization"EURASIP Journal on Applied Signal Processing. (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Takahiro Shinozaki and Sadaoki Furui: "Presentation Transcription Using a Japanese Spontaneous Speech Corpus"IPSJ Journal. 43, No.7. 2098-2107 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Katsutoshi Ohtsuki, Tatsuo Matsuoka, Shoichi Mistunaga and Sadaoki Furui: "Topic Extraction Based on Continuous Speech Recognition in Broadcast News Speech"IEICE Trans.Inf. & Syst.. E85-D, No.7. 1138-1144 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Zhipeng Zhang, Sadaoki Furui and Katsutoshi Ohtsuki: "On-line Incremental Speaker Adaptation for Broadcast News Transcription"Speech Communication. 37. 271-281

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Chiori Hori and Sadaoki Furui: "Summarized Speech Sentence Generation Based on Word Extraction and Its Evaluation"Trans. IEICE. J85-DII-No.2. 200-209 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Sadaoki Furui: "Digital Speech Processing, Synthesis, and Recognition(Second Edition, Revised and Expanded)"Marcel Dekker, Inc.. 452 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Chiori Hori, Sadaoki Furui, Rob Malkin, Hua Yu, Alex Waibel: "A Statistical Approach to Automatic Speech Summarization"EURASIP Journal on Applied Signal Processing. 29. 361-367 (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] 篠崎 隆宏, 古井 貞煕: "日本語話し言葉コーパスを用いた講演音声認識"情報処理学会論文誌. Vol.43, No.7. 2098-2107 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Katsutoshi Ohtsuki, Tatsuo Matsuoka, Shoichi Matsunaga, Sadaoki Furui: "Topic Extraction Based on Continuous Speech Recognition in Broadcast News Speech"IEICE Trans. Inf. & Syst.. Vol.E85-D, No.7. 1138-1144 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Zhipeng Zhang, Sadaoki Furui, Katsutoshi Ohtsuki: "On-line Incremental Speaker Adaptation for Broadcast News Transcription"Speech Communication. 37. 271-281 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 篠崎,斎藤,堀,古井: "話し言葉音声の認識を目指して"電子情報通信学会技術報告. SP2000-96. 7-12 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 篠崎,細川,古井: "話し言葉コーパスを用いた音声認識の検討"日本音響学会春季研究発表会講演論文集. (予定). (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] 斎藤,古井: "対談音声を対象とした音声認識の検討"日本音響学会春季研究発表会講演論文集. (予定). (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] Sadaoki Furui: "Speech recognition technology in the ubiquitous/wearable computing environment"Proc.IEEE Int.Conf.Acoust.,Speech,Signal Process. 3735-3738 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Furui,Maekawa,Isahara,Shinozaki,Ohdaira: "Toward the realization of spontaneous speech recognition"Proc.Int.Conf.Spoken Lang.Process.. III518-521 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Sadaoki Furui: "Steps toward flexible speech recognition-Recent progress at Tokyo Institute of Technology"Proc.8th Australian Int.Conf.Speech Sci & Tech. 19-29 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Sadaoki Furui: "Digital Speech Processing,Synthesis and Recognition (2nd Edition)"Marcel Dekker. 452 (2000)

    • Related Report
      2000 Annual Research Report

URL: 

Published: 2001-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi