• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Development of a multi-modal dialogue system and a tool for a spoken dialogue system

Research Project

Project/Area Number 08558030
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section展開研究
Research Field Intelligent informatics
Research InstitutionToyohashi University of Technology

Principal Investigator

NAKAGAWA Seiichi  Toyohashi University of Technology, Faculty of Engineering, Professor, 工学部, 教授 (20115893)

Co-Investigator(Kenkyū-buntansha) YAMAMOTO Mikio  University of Tsukuba Intitle of Information Sciences and Electronics, Assosiate, 第3学類, 助教授 (40210562)
KAI Atsuhiko  Toyohashi University of Technology, Faculty of Engineering, Research Assistant, 工学部, 助手 (60283496)
MINEMATSU Nobuaki  Toyohashi University of Technology, Faculty of Engineering, Research Assistant, 工学部, 助手 (90273333)
NITTA Tsuneo  Toyohashi University of Technology, Faculty of Engineering, Professor, 工学部, 教授
Project Period (FY) 1996 – 1998
Project Status Completed (Fiscal Year 1998)
Budget Amount *help
¥6,300,000 (Direct Cost: ¥6,300,000)
Fiscal Year 1998: ¥900,000 (Direct Cost: ¥900,000)
Fiscal Year 1997: ¥1,500,000 (Direct Cost: ¥1,500,000)
Fiscal Year 1996: ¥3,900,000 (Direct Cost: ¥3,900,000)
Keywordsspeech recognition / free software / spoken dialogue / multi-modal / portability / マルチモーダルインターフェース / 言語理解 / 質問応答システム
Research Abstract

In this research, we published a continuous speech recognition free software which consists of a clients server architecture and thus a user can effectively use this software as a means of speech input modality for developing a spoken dealogue system or multimodal dialogue system on standard PCs.
In order to realize natural human-macbin interaction, we have developed a multi-modal sightseeing guidance system with 1) speech input / output, 2) touch screen input (on map/in menu) and 3) graphical/text output (map, photograph, menu and dealogue historiy). Furthermore, we implemented an agent interface wiht real face image / animation and recorded speech / synthesized speech to the system, and carried out evaluation experiments which consist of task completions and questionnaires to evaluate the interface and whole system. The evaluation experiments showed the effectiveness.
Recently the study of robustenss and usability for speech recognition and language processing has been established, and speech recognition systems and dialogue systems have been developed to be practical use. But if these systems will become practical, it is important that not only those fundamental techniques but also the techniques of portability and expansibility should be developed.
Based on this consideration, we examined our system in portability by transfering the domain of the system form the Mt. Fuji sightseeing, guidance to the Mikawa sightseeing guidance. Also we designed a domain independent platform of spoken dialogue system for database retrival, and applied the platform to a literature retrieval system.

Report

(4 results)
  • 1998 Annual Research Report   Final Research Report Summary
  • 1997 Annual Research Report
  • 1996 Annual Research Report
  • Research Products

    (31 results)

All Other

All Publications (31 results)

  • [Publications] 甲斐 充彦: "冗長語・言い直しを含む発話のための未知語処理を用いた音声認識システムの評価" 電子情報通信学会論文誌. 80DII-10. 2615-2625 (1997)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 中川 聖一: "マルチモーダル観光案内対話システム" 人工知能学会誌. 13・2. 241-251 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 中川 聖一: "Bigramの使用による話し言葉用確率文脈自由文法の自動学習" 情報処理学会論文誌. 39・3. 575-584 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 伊藤 敏彦: "協調的応答を備えた観光案内音声対話システムとその評価" 情報処理学会論文誌. 39・5. 1248-1257 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 小暮 悟: "音声対話システムの移植性に関する考察" 情報処理学会, 音声言語情報処理研究報告. SLP25. 13-18 (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 甲斐 充彦: "単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語処理" 情報処理学会論文誌. 40・4. (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 堂下 修司: "音声による人間と機械との対話" オーム社, 383 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 田〓 行則: "音声" 岩波書店, 256 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] A.Denda, T.Ito and S.Nakagawa: "A robust dialogue system with spontaneous speech and touch screen" Proc.Int.Conf.Multimodal Interface-96'. 144-151 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] S.Nakagawa, A.Kai, T.Itoh and M.Ida: "An isolated/continuous speech recognition system on a personal computer" Proc.1997-China-Japan Symposium on Advanced Information Technology. 72-79 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] T.Itoh, A.Denda, S.Kogure and S.Nakagawa: "A robust dialogue system with spontaneous speech understanding and cooperative response" Proc.Interactive Spoken Dialog Systems.57-60 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] A.Kai, Y.Hirose and S.Nakagawa: "Dealing with out-of vocabulary words and speech disfluencies in an N-gram besed speech understanding system" Proc.5th Int.Conf.Spoken Language Processing. 2427-2430 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] K.Hanai, K.Yamamoto, N.Minematsu and S.Nakagawa: "Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context degendercy" Proc.5th Int.Conf.Spoken Langueage Processing. 2935-2938 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] S.Kogure, T.Itoh and S.Nakagawa: "A Semantic interperter for a robust spoken dialogue system" Proc.2nd Int.Conf.Multimodal Interface. II-61-66 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 甲斐 充彦: "冗長語・言い直しを含む発話のための未知語処理を用いた音声認識システムの評価" 電子情報通信学会論文誌. 80DII-10. 2615-2625 (1997)

    • Related Report
      1998 Annual Research Report
  • [Publications] 中川聖一: "マルチモーダル観光案内対話システム" 人工知能学会誌. 13・2. 241-251 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 中川聖一: "Bigramの使用による話し言葉用確率文脈自由文法の自動学習" 情報処理学会論文誌. 39・3. 575-584 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 伊藤敏彦: "協調的応答を備えた観光案内音声対話システムとその評価" 情報処理学会論文誌. 39・5. 1248-1257 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 小暮 悟: "音声対話システムの移植性に関する考察" 情報処理学会、音声言語情報処理研究報告. SLP25. 13-18 (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] 甲斐 充彦: "単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語処理" 情報処理学会論文誌. 40・4. (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] 堂下 修司: "音声による人間と機械との対話" オーム社, 383 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 田窪 行則: "音声" 岩波書店, 256 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 中川聖一: "マルチモーダル観光案内対話システム" 人工知能学会誌. 13・2. 241-251 (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 伊藤敏彦: "協調的応答を備えた観光案内音声対話システムとその評価" 情報処理学会論文誌. 39・3. (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 甲斐充彦: "自然な発話を対象としたパソコン/ワークステーション用連続音声認識ソフトウェア" 日本音響学会秋季研究発表会論文集. (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 山本幹雄: "メニューによりガイドされた文節単位による音声対話システム" 情報処理学会論文誌. 37・4. 461-469 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 山本幹雄: "人間の理解手法を用いたロバストな音声対話システム" 情報処理学会論文誌. 37・4. 471-481 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] Min Zhou: "Succeding word prediction for speech recognition based on stochastic language model." Trans.IEICE Inf.& Syst.E79-D・4. 333-341 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 中川聖一: "セグメント統計量を用いた隠れマルコフモデルによる音声認識" 電子情報通信学会論文誌. 79-DII・12. 2032-2038 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 中川聖一: "音声対話システムの構成法とユーザ発話の関係" 電子情報通信学会論文誌. 79-DII・12. 2139-2145 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 伊藤敏彦: "マルチモーダルインターフェースと協調的応答を備えた観光案内対話システムの評価" 情報処理学会シンポジュウム、インタラクション'97. 135-142 (1997)

    • Related Report
      1996 Annual Research Report

URL: 

Published: 1996-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi