• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2002 Fiscal Year Final Research Report Summary

Speech Recognition for Computer-Supported Conference Systems under Ubiquitous/Wearable Computing Environment

Research Project

Project/Area Number 12480083
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionTokyo Institute of Technology

Principal Investigator

FURUI Sadaoki  Tokyo Institute of Technology, Graduate School of Information Sci. & Eng., Department of Computer Science, Professor, 大学院・情報理工学研究科, 教授 (90293076)

Co-Investigator(Kenkyū-buntansha) IWANO Koji  Tokyo Institute of Technology, Assistant Professor, Graduate School of Information Sci. & Eng., 大学院・情報理工学研究科, 助手 (90323823)
Project Period (FY) 2000 – 2002
KeywordsUbiquitous / wearable computing environment / Computer-supported meeting system / Parallel computer / Spoken dialogue / Speech recognition system / Speech contents
Research Abstract

Research to build speech recognition technology far computer-supported conference systems under ubiquitous/wearable computing environment has been conducted. First, methods for building both language models appropriate to spontaneous speech and acoustic models automatically adapted to voice individuality have been investigated. Since the cross-talk problem cannot be avoided even if a microphone is attached to each participant of meetings or discussions, an acoustic backing-off method has been tried. In this method, the acoustic score during the cross-talk period is replaced by a mean value for a previous speech Period. The proposed method was confirmed to be effective to improve the recognition Performance.
Second, a parallel committer-based speech recognition system consisting of multiple recognizers having acoustic models adapted to each speaker has been built to recognize meeting utterances. Speaker change is automatically detected during the meeting, and acoustic models are adapted using an unsupervised method. For a new speaker, a speaker-adapted model is incrementally created. A speech recognition result having the maximum likelihood is chosen from the results of multiple recognizers using the speaker-adapted acoustic models. It was confirmed that this method is effective to build a real-time reformation system with a good performance.
Third, the parallel computer-based speech recognition system has been applied to a mixed- initiative spoken dialogue system accepting multiple topics in parallel. Effectiveness of the system was also confirmed. Various other issues related to spontaneous speech recognition have also been an investigated in this research.

  • Research Products

    (14 results)

All Other

All Publications (14 results)

  • [Publications] Sadaoki Furui, Daisuke Itoh, Zhipeng Zhang: "Neural-Network-Based HMM Adaptation for Noisy Speech Recognition"Acoust.Sci.& Tech.. Vol.24,No.2. 69-75 (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Chiori Hori, Sadaoki Furui, Rob Malkin, Hua Yu, Alex Waibel: "A Statistical Approach to Automatic Speech Summarization"EURASIP Journal on Applied Signal Processing. (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 篠崎 隆宏, 古井 貞煕: "日本語話し言葉コーパスを用いた講演音声認識"情報処理学会論文誌. Vol.43,No.7. 2098-2107 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Katsutoshi Ohtsuki, Tatsuo Matsuoka, Shoichi Matsunaga, Sadaoki Furui: "Topic Extraction Based on Continuous Speech Recognition in Broadcast News Speech"IEICE Trans.Inf.& Syst.. Vol.E85-D, No.7. 1138-1144 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Zhipeng Zhang, Sadaoki Furui, Katsutoshi Ohtsuki: "On-line Incremental Speaker Adaptation for Broadcast News Transcription"Speech Communication. 37. 271-281 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 堀 智織, 古井 貞煕: "単語抽出による音声要約文生成法とその評価"電子情報通信学会論文誌. J85-D-II No.2. 200-209 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Sadaoki Furui: "Digital Speech Processing, Synthesis, and Recognition(Second Edition, Revised and Expanded)"Marcel Dekker, Inc.. 452 (2000)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Sadaoki Furui, Daisuke Itoh and Zhipeng Zhang: "Neural-Network-Based HMM Adaptation for Noisy Speech Recognition"Acoust. Sic. & Tech.. 24, No.2. 69-75 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Chiori Hori, Sadaoki Furui, Rob Malkin, Hua Yu and Alex Waibel: "A Statistical Approach to Automatic Speech Summarization"EURASIP Journal on Applied Signal Processing. (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Takahiro Shinozaki and Sadaoki Furui: "Presentation Transcription Using a Japanese Spontaneous Speech Corpus"IPSJ Journal. 43, No.7. 2098-2107 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Katsutoshi Ohtsuki, Tatsuo Matsuoka, Shoichi Mistunaga and Sadaoki Furui: "Topic Extraction Based on Continuous Speech Recognition in Broadcast News Speech"IEICE Trans.Inf. & Syst.. E85-D, No.7. 1138-1144 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Zhipeng Zhang, Sadaoki Furui and Katsutoshi Ohtsuki: "On-line Incremental Speaker Adaptation for Broadcast News Transcription"Speech Communication. 37. 271-281

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Chiori Hori and Sadaoki Furui: "Summarized Speech Sentence Generation Based on Word Extraction and Its Evaluation"Trans. IEICE. J85-DII-No.2. 200-209 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Sadaoki Furui: "Digital Speech Processing, Synthesis, and Recognition(Second Edition, Revised and Expanded)"Marcel Dekker, Inc.. 452 (2000)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2004-04-14  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi