• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2004 Fiscal Year Final Research Report Summary

A study on content summarization for large spoken documents and content retrieval through spoken dialogue

Research Project

Project/Area Number 13480095
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionToyohashi University of Technology

Principal Investigator

NAKAGAWA Seiichi  Toyohashi University of Technology, Department of Information and Computer Sciences, 工学部, 教授 (20115893)

Co-Investigator(Kenkyū-buntansha) NITTA Tsuneo  Toyohashi University of Technology, Department of Knowledge-based Information Engineering, 大学院・工学研究科, 教授 (70314101)
MASUYAMA Shigeru  Toyohashi University of Technology, Department of Knowledge-based Information Engineering, 工学部, 教授 (60173762)
KITAOKA Norihide  Toyohashi University of Technology, Department of Information and Computer Sciences, 工学部, 講師 (10333501)
KOBAYASHI Satoshi  Shimane University, General Information Processing Center, 総合情報処理センター, 助教授 (90314096)
UTURO Takehito  Kyoto University, Graduate School of Informatics, 情報学研究科, 講師 (90263433)
Project Period (FY) 2001 – 2004
KeywordsSpeech Database / Speech Recognition / Spoken Language / Speech Summarization / Speech Retrieval / Question-Answering / Dictation / Spoken Dialogue
Research Abstract

To develop an accurate large vocabulary continuous speech recognition system for spoken document retrieval in open domain, we proposed a search method using two search algorithms in parallel to achieve efficient and accurate decoding. We evaluated this new search algorithm and obtained significant improvement of recognition performance without severe increase of computational cost We also proposed to apply machine learning techniques to the task of combining outputs of multiple LVCSR models. The proposed technique had advantages over that by voting schemes such as ROVER, especially when the majority of participating models are not reliable. By using this technique, we performed a speech-driven Web retrieval task and improved speech recognition accuracy of spoken queries and then improved retrieval accuracy in speech driven Web retrieval We tried the summarization of spoken lectures. For this purpose, we investigated relations between linguistic surface information and human's results, and we obtained useful surface linguistic information. Next, we summarized spoken lectures based on this information, and compared them with human's results. As a result, we obtained a better F-measure and k-value comparable with human's results. We have developed a portable speech recognition module and an interpreter module in a spoken dialogue system. Furthermore, we also developed a dialogue strategy design tool, applied it to Mt.Fuji sightseeing guidance retrieval, literature retrieval and hotel reservation retrieval and then confirmed the usefulness.

  • Research Products

    (13 results)

All 2005 2004 2003

All Journal Article (13 results)

  • [Journal Article] An supervised speaker adaptation method for lecture-style spontaneous speech recognition using multiple recognition system2005

    • Author(s)
      S.Nakagawa
    • Journal Title

      Trans.Inst.Elect.Comm.Inform.Engrs. ED-88・3

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Improving key word recognition of spoken queries by combining multiple speech recognizer's outputs for speech-driven WEB retrieval task2005

    • Author(s)
      M.Matushita
    • Journal Title

      Trans.Inst.Elect.Comm.Inform.Engrs. ED-88・3

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] An supervised speaker adaptation method for lecture style spontaneous speech recognition using multiple recognition system.2005

    • Author(s)
      Seiichi Nakagawa
    • Journal Title

      Trans.Inst.Elect.Comm.Inform.Engrs. ED-88・3

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Improving keyword recognition of spoken queries by combing multiple speech recognizer's outputs for speech-driven WEB retrieval task.2005

    • Author(s)
      M.Matushita
    • Journal Title

      Trans.Inst.Elect.Comm.Inform.Engrs. ED-88・3

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] 1-best近似木構造辞書探索と線形辞書探索の併用による大語彙連続音声認識2004

    • Author(s)
      北岡教英
    • Journal Title

      電子情報通信学会論文誌 87-DII・3

      Pages: 799-807

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 機械学習を用いた複数の大語彙連続音声認識モデルの出力の混合2004

    • Author(s)
      宇津呂武仁
    • Journal Title

      電子情報通信学会論文誌 87-DII・7

      Pages: 1428-1440

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 音声対話システムの誤認識に対するユーザの繰り返し発話の検出と認識2004

    • Author(s)
      北岡教英
    • Journal Title

      電子情報通信学会論文誌 87-DII・7

      Pages: 1441-1450

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Large vocabulary continuous speech recognition using linear lexicons search with N-best approximation and tree lexicon search with 1-best approximation.2004

    • Author(s)
      Norihide Kitaoka
    • Journal Title

      Trans.Inst.Elect.Comm.Inform.(in Japanese) 87-D II・3

      Pages: 799-807

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Combing outputs of multiple LVSCR models by machine learning.2004

    • Author(s)
      Takehito Utsuro
    • Journal Title

      Trans.Inst.Elect.Comm.Inform.(in Japanese) 87-D II・7

      Pages: 1428-1440

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Detection and recognition of correction Utterances on miss-recognition of spoken dialog system.2004

    • Author(s)
      Norihide Kitaoka
    • Journal Title

      Trans.Inst.Elect.Comm.Inform. 87-D II・7

      Pages: 1441-1450

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] 音声認識誤りと未知語に頑健な音声文書検索手法2003

    • Author(s)
      西崎博光
    • Journal Title

      電子情報通信学会論文誌 86-DII・10

      Pages: 1369-1381

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Continuous speech recognition using an one-line speaker adaptation method based on automatic speaker clustering2003

    • Author(s)
      Wei Zhang, Seiichi Nakagawa
    • Journal Title

      Trans.Inst.Elect.Comm.Inform. ED-86, 3

      Pages: 464-473

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Robust spoken documents retrieval methods for miss-recognition and out-of-vocabulary keywords.2003

    • Author(s)
      Hiromitu Nishizaki
    • Journal Title

      Trans.Inst.Elect.Comm.Inform.(in Japanese) 86-D II・10

      Pages: 1369-1381

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2006-07-11  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi