2004 Fiscal Year Final Research Report Summary

A study on content summarization for large spoken documents and content retrieval through spoken dialogue

Research Project

Project/Area Number	13480095
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	Toyohashi University of Technology
Principal Investigator	NAKAGAWA Seiichi Toyohashi University of Technology, Department of Information and Computer Sciences, 工学部, 教授 (20115893)
Co-Investigator(Kenkyū-buntansha)	NITTA Tsuneo Toyohashi University of Technology, Department of Knowledge-based Information Engineering, 大学院・工学研究科, 教授 (70314101) MASUYAMA Shigeru Toyohashi University of Technology, Department of Knowledge-based Information Engineering, 工学部, 教授 (60173762) KITAOKA Norihide Toyohashi University of Technology, Department of Information and Computer Sciences, 工学部, 講師 (10333501) KOBAYASHI Satoshi Shimane University, General Information Processing Center, 総合情報処理センター, 助教授 (90314096) UTURO Takehito Kyoto University, Graduate School of Informatics, 情報学研究科, 講師 (90263433)
Project Period (FY)	2001 – 2004
Keywords	Speech Database / Speech Recognition / Spoken Language / Speech Summarization / Speech Retrieval / Question-Answering / Dictation / Spoken Dialogue
Research Abstract	To develop an accurate large vocabulary continuous speech recognition system for spoken document retrieval in open domain, we proposed a search method using two search algorithms in parallel to achieve efficient and accurate decoding. We evaluated this new search algorithm and obtained significant improvement of recognition performance without severe increase of computational cost We also proposed to apply machine learning techniques to the task of combining outputs of multiple LVCSR models. The proposed technique had advantages over that by voting schemes such as ROVER, especially when the majority of participating models are not reliable. By using this technique, we performed a speech-driven Web retrieval task and improved speech recognition accuracy of spoken queries and then improved retrieval accuracy in speech driven Web retrieval We tried the summarization of spoken lectures. For this purpose, we investigated relations between linguistic surface information and human's results, and we obtained useful surface linguistic information. Next, we summarized spoken lectures based on this information, and compared them with human's results. As a result, we obtained a better F-measure and k-value comparable with human's results. We have developed a portable speech recognition module and an interpreter module in a spoken dialogue system. Furthermore, we also developed a dialogue strategy design tool, applied it to Mt.Fuji sightseeing guidance retrieval, literature retrieval and hotel reservation retrieval and then confirmed the usefulness.

Research Products
(13 results)

All 2005 2004 2003

All Journal Article (13 results)

[Journal Article] An supervised speaker adaptation method for lecture-style spontaneous speech recognition using multiple recognition system2005
- Author(s)
  S.Nakagawa
- Journal Title
  
  Trans.Inst.Elect.Comm.Inform.Engrs. ED-88・3
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Improving key word recognition of spoken queries by combining multiple speech recognizer's outputs for speech-driven WEB retrieval task2005
- Author(s)
  M.Matushita
- Journal Title
  
  Trans.Inst.Elect.Comm.Inform.Engrs. ED-88・3
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] An supervised speaker adaptation method for lecture style spontaneous speech recognition using multiple recognition system.2005
- Author(s)
  Seiichi Nakagawa
- Journal Title
  
  Trans.Inst.Elect.Comm.Inform.Engrs. ED-88・3
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Improving keyword recognition of spoken queries by combing multiple speech recognizer's outputs for speech-driven WEB retrieval task.2005
- Author(s)
  M.Matushita
- Journal Title
  
  Trans.Inst.Elect.Comm.Inform.Engrs. ED-88・3
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] 1-best近似木構造辞書探索と線形辞書探索の併用による大語彙連続音声認識2004
- Author(s)
  北岡教英
- Journal Title
  
  電子情報通信学会論文誌 87-DII・3
  
  Pages: 799-807
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] 機械学習を用いた複数の大語彙連続音声認識モデルの出力の混合2004
- Author(s)
  宇津呂武仁
- Journal Title
  
  電子情報通信学会論文誌 87-DII・7
  
  Pages: 1428-1440
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] 音声対話システムの誤認識に対するユーザの繰り返し発話の検出と認識2004
- Author(s)
  北岡教英
- Journal Title
  
  電子情報通信学会論文誌 87-DII・7
  
  Pages: 1441-1450
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Large vocabulary continuous speech recognition using linear lexicons search with N-best approximation and tree lexicon search with 1-best approximation.2004
- Author(s)
  Norihide Kitaoka
- Journal Title
  
  Trans.Inst.Elect.Comm.Inform.(in Japanese) 87-D II・3
  
  Pages: 799-807
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Combing outputs of multiple LVSCR models by machine learning.2004
- Author(s)
  Takehito Utsuro
- Journal Title
  
  Trans.Inst.Elect.Comm.Inform.(in Japanese) 87-D II・7
  
  Pages: 1428-1440
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Detection and recognition of correction Utterances on miss-recognition of spoken dialog system.2004
- Author(s)
  Norihide Kitaoka
- Journal Title
  
  Trans.Inst.Elect.Comm.Inform. 87-D II・7
  
  Pages: 1441-1450
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] 音声認識誤りと未知語に頑健な音声文書検索手法2003
- Author(s)
  西崎博光
- Journal Title
  
  電子情報通信学会論文誌 86-DII・10
  
  Pages: 1369-1381
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Continuous speech recognition using an one-line speaker adaptation method based on automatic speaker clustering2003
- Author(s)
  Wei Zhang, Seiichi Nakagawa
- Journal Title
  
  Trans.Inst.Elect.Comm.Inform. ED-86, 3
  
  Pages: 464-473
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Robust spoken documents retrieval methods for miss-recognition and out-of-vocabulary keywords.2003
- Author(s)
  Hiromitu Nishizaki
- Journal Title
  
  Trans.Inst.Elect.Comm.Inform.(in Japanese) 86-D II・10
  
  Pages: 1369-1381
- Description
  「研究成果報告書概要(欧文)」より

2004 Fiscal Year Final Research Report Summary

A study on content summarization for large spoken documents and content retrieval through spoken dialogue

Principal Investigator

NAKAGAWA Seiichi Toyohashi University of Technology, Department of Information and Computer Sciences, 工学部, 教授 (20115893)

Research Products

[Journal Article] An supervised speaker adaptation method for lecture-style spontaneous speech recognition using multiple recognition system2005

Author(s)

Journal Title

Description

[Journal Article] Improving key word recognition of spoken queries by combining multiple speech recognizer's outputs for speech-driven WEB retrieval task2005

Author(s)

Journal Title

Description

[Journal Article] An supervised speaker adaptation method for lecture style spontaneous speech recognition using multiple recognition system.2005

Author(s)

Journal Title

Description

[Journal Article] Improving keyword recognition of spoken queries by combing multiple speech recognizer's outputs for speech-driven WEB retrieval task.2005

Author(s)

Journal Title

Description

[Journal Article] 1-best近似木構造辞書探索と線形辞書探索の併用による大語彙連続音声認識2004

Author(s)

Journal Title

Description

[Journal Article] 機械学習を用いた複数の大語彙連続音声認識モデルの出力の混合2004

Author(s)

Journal Title

Description

[Journal Article] 音声対話システムの誤認識に対するユーザの繰り返し発話の検出と認識2004

Author(s)

Journal Title

Description

[Journal Article] Large vocabulary continuous speech recognition using linear lexicons search with N-best approximation and tree lexicon search with 1-best approximation.2004

Author(s)

Journal Title

Description

[Journal Article] Combing outputs of multiple LVSCR models by machine learning.2004

Author(s)

Journal Title

Description

[Journal Article] Detection and recognition of correction Utterances on miss-recognition of spoken dialog system.2004

Author(s)

Journal Title

Description

[Journal Article] 音声認識誤りと未知語に頑健な音声文書検索手法2003

Author(s)

Journal Title

Description

[Journal Article] Continuous speech recognition using an one-line speaker adaptation method based on automatic speaker clustering2003

Author(s)

Journal Title

Description

[Journal Article] Robust spoken documents retrieval methods for miss-recognition and out-of-vocabulary keywords.2003

Author(s)

Journal Title

Description