Development of a multi-modal dialogue system and a tool for a spoken dialogue system

Research Project

Project/Area Number	08558030
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	展開研究
Research Field	Intelligent informatics
Research Institution	Toyohashi University of Technology
Principal Investigator	NAKAGAWA Seiichi Toyohashi University of Technology, Faculty of Engineering, Professor, 工学部, 教授 (20115893)
Co-Investigator(Kenkyū-buntansha)	YAMAMOTO Mikio University of Tsukuba Intitle of Information Sciences and Electronics, Assosiate, 第3学類, 助教授 (40210562) KAI Atsuhiko Toyohashi University of Technology, Faculty of Engineering, Research Assistant, 工学部, 助手 (60283496) MINEMATSU Nobuaki Toyohashi University of Technology, Faculty of Engineering, Research Assistant, 工学部, 助手 (90273333) NITTA Tsuneo Toyohashi University of Technology, Faculty of Engineering, Professor, 工学部, 教授
Project Period (FY)	1996 – 1998
Project Status	Completed (Fiscal Year 1998)
Budget Amount *help	¥6,300,000 (Direct Cost: ¥6,300,000) Fiscal Year 1998: ¥900,000 (Direct Cost: ¥900,000) Fiscal Year 1997: ¥1,500,000 (Direct Cost: ¥1,500,000) Fiscal Year 1996: ¥3,900,000 (Direct Cost: ¥3,900,000)
Keywords	speech recognition / free software / spoken dialogue / multi-modal / portability / マルチモーダルインターフェース / 言語理解 / 質問応答システム
Research Abstract	In this research, we published a continuous speech recognition free software which consists of a clients server architecture and thus a user can effectively use this software as a means of speech input modality for developing a spoken dealogue system or multimodal dialogue system on standard PCs. In order to realize natural human-macbin interaction, we have developed a multi-modal sightseeing guidance system with 1) speech input / output, 2) touch screen input (on map/in menu) and 3) graphical/text output (map, photograph, menu and dealogue historiy). Furthermore, we implemented an agent interface wiht real face image / animation and recorded speech / synthesized speech to the system, and carried out evaluation experiments which consist of task completions and questionnaires to evaluate the interface and whole system. The evaluation experiments showed the effectiveness. Recently the study of robustenss and usability for speech recognition and language processing has been established, and speech recognition systems and dialogue systems have been developed to be practical use. But if these systems will become practical, it is important that not only those fundamental techniques but also the techniques of portability and expansibility should be developed. Based on this consideration, we examined our system in portability by transfering the domain of the system form the Mt. Fuji sightseeing, guidance to the Mikawa sightseeing guidance. Also we designed a domain independent platform of spoken dialogue system for database retrival, and applied the platform to a literature retrieval system.

Report

(4 results)

1998 Annual Research Report Final Research Report Summary
1997 Annual Research Report
1996 Annual Research Report

Research Products
(31 results)

All Other

All Publications (31 results)

[Publications] 甲斐充彦: "冗長語・言い直しを含む発話のための未知語処理を用いた音声認識システムの評価" 電子情報通信学会論文誌. 80DII-10. 2615-2625 (1997)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 中川聖一: "マルチモーダル観光案内対話システム" 人工知能学会誌. 13・2. 241-251 (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 中川聖一: "Bigramの使用による話し言葉用確率文脈自由文法の自動学習" 情報処理学会論文誌. 39・3. 575-584 (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 伊藤敏彦: "協調的応答を備えた観光案内音声対話システムとその評価" 情報処理学会論文誌. 39・5. 1248-1257 (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 小暮悟: "音声対話システムの移植性に関する考察" 情報処理学会, 音声言語情報処理研究報告. SLP25. 13-18 (1999)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 甲斐充彦: "単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語処理" 情報処理学会論文誌. 40・4. (1999)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 堂下修司: "音声による人間と機械との対話" オーム社, 383 (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 田〓行則: "音声" 岩波書店, 256 (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] A.Denda, T.Ito and S.Nakagawa: "A robust dialogue system with spontaneous speech and touch screen" Proc.Int.Conf.Multimodal Interface-96'. 144-151 (1996)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] S.Nakagawa, A.Kai, T.Itoh and M.Ida: "An isolated/continuous speech recognition system on a personal computer" Proc.1997-China-Japan Symposium on Advanced Information Technology. 72-79 (1997)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] T.Itoh, A.Denda, S.Kogure and S.Nakagawa: "A robust dialogue system with spontaneous speech understanding and cooperative response" Proc.Interactive Spoken Dialog Systems.57-60 (1997)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] A.Kai, Y.Hirose and S.Nakagawa: "Dealing with out-of vocabulary words and speech disfluencies in an N-gram besed speech understanding system" Proc.5th Int.Conf.Spoken Language Processing. 2427-2430 (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] K.Hanai, K.Yamamoto, N.Minematsu and S.Nakagawa: "Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context degendercy" Proc.5th Int.Conf.Spoken Langueage Processing. 2935-2938 (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] S.Kogure, T.Itoh and S.Nakagawa: "A Semantic interperter for a robust spoken dialogue system" Proc.2nd Int.Conf.Multimodal Interface. II-61-66 (1999)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 甲斐充彦: "冗長語・言い直しを含む発話のための未知語処理を用いた音声認識システムの評価" 電子情報通信学会論文誌. 80DII-10. 2615-2625 (1997)
- Related Report
  1998 Annual Research Report
[Publications] 中川聖一: "マルチモーダル観光案内対話システム" 人工知能学会誌. 13・2. 241-251 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 中川聖一: "Bigramの使用による話し言葉用確率文脈自由文法の自動学習" 情報処理学会論文誌. 39・3. 575-584 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 伊藤敏彦: "協調的応答を備えた観光案内音声対話システムとその評価" 情報処理学会論文誌. 39・5. 1248-1257 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 小暮悟: "音声対話システムの移植性に関する考察" 情報処理学会、音声言語情報処理研究報告. SLP25. 13-18 (1999)
- Related Report
  1998 Annual Research Report
[Publications] 甲斐充彦: "単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語処理" 情報処理学会論文誌. 40・4. (1999)
- Related Report
  1998 Annual Research Report
[Publications] 堂下修司: "音声による人間と機械との対話" オーム社, 383 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 田窪行則: "音声" 岩波書店, 256 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 中川聖一: "マルチモーダル観光案内対話システム" 人工知能学会誌. 13・2. 241-251 (1998)
- Related Report
  1997 Annual Research Report
[Publications] 伊藤敏彦: "協調的応答を備えた観光案内音声対話システムとその評価" 情報処理学会論文誌. 39・3. (1998)
- Related Report
  1997 Annual Research Report
[Publications] 甲斐充彦: "自然な発話を対象としたパソコン/ワークステーション用連続音声認識ソフトウェア" 日本音響学会秋季研究発表会論文集. (1997)
- Related Report
  1997 Annual Research Report
[Publications] 山本幹雄: "メニューによりガイドされた文節単位による音声対話システム" 情報処理学会論文誌. 37・4. 461-469 (1996)
- Related Report
  1996 Annual Research Report
[Publications] 山本幹雄: "人間の理解手法を用いたロバストな音声対話システム" 情報処理学会論文誌. 37・4. 471-481 (1996)
- Related Report
  1996 Annual Research Report
[Publications] Min Zhou: "Succeding word prediction for speech recognition based on stochastic language model." Trans.IEICE Inf.& Syst.E79-D・4. 333-341 (1996)
- Related Report
  1996 Annual Research Report
[Publications] 中川聖一: "セグメント統計量を用いた隠れマルコフモデルによる音声認識" 電子情報通信学会論文誌. 79-DII・12. 2032-2038 (1996)
- Related Report
  1996 Annual Research Report
[Publications] 中川聖一: "音声対話システムの構成法とユーザ発話の関係" 電子情報通信学会論文誌. 79-DII・12. 2139-2145 (1996)
- Related Report
  1996 Annual Research Report
[Publications] 伊藤敏彦: "マルチモーダルインターフェースと協調的応答を備えた観光案内対話システムの評価" 情報処理学会シンポジュウム、インタラクション'97. 135-142 (1997)
- Related Report
  1996 Annual Research Report

Development of a multi-modal dialogue system and a tool for a spoken dialogue system

Principal Investigator

NAKAGAWA Seiichi Toyohashi University of Technology, Faculty of Engineering, Professor, 工学部, 教授 (20115893)

¥6,300,000 (Direct Cost: ¥6,300,000)

Report

Research Products

[Publications] 甲斐 充彦: "冗長語・言い直しを含む発話のための未知語処理を用いた音声認識システムの評価" 電子情報通信学会論文誌. 80DII-10. 2615-2625 (1997)

Description

Related Report

[Publications] 中川 聖一: "マルチモーダル観光案内対話システム" 人工知能学会誌. 13・2. 241-251 (1998)

Description

Related Report

[Publications] 中川 聖一: "Bigramの使用による話し言葉用確率文脈自由文法の自動学習" 情報処理学会論文誌. 39・3. 575-584 (1998)

Description

Related Report

[Publications] 伊藤 敏彦: "協調的応答を備えた観光案内音声対話システムとその評価" 情報処理学会論文誌. 39・5. 1248-1257 (1998)

Description

Related Report

[Publications] 小暮 悟: "音声対話システムの移植性に関する考察" 情報処理学会, 音声言語情報処理研究報告. SLP25. 13-18 (1999)

Description

Related Report

[Publications] 甲斐 充彦: "単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語処理" 情報処理学会論文誌. 40・4. (1999)

Description

Related Report

[Publications] 堂下 修司: "音声による人間と機械との対話" オーム社, 383 (1998)

Description

Related Report

[Publications] 田〓 行則: "音声" 岩波書店, 256 (1998)

Description

Related Report

[Publications] A.Denda, T.Ito and S.Nakagawa: "A robust dialogue system with spontaneous speech and touch screen" Proc.Int.Conf.Multimodal Interface-96'. 144-151 (1996)

Description

Related Report

[Publications] S.Nakagawa, A.Kai, T.Itoh and M.Ida: "An isolated/continuous speech recognition system on a personal computer" Proc.1997-China-Japan Symposium on Advanced Information Technology. 72-79 (1997)

Description

Related Report

[Publications] T.Itoh, A.Denda, S.Kogure and S.Nakagawa: "A robust dialogue system with spontaneous speech understanding and cooperative response" Proc.Interactive Spoken Dialog Systems.57-60 (1997)

Description

Related Report

[Publications] A.Kai, Y.Hirose and S.Nakagawa: "Dealing with out-of vocabulary words and speech disfluencies in an N-gram besed speech understanding system" Proc.5th Int.Conf.Spoken Language Processing. 2427-2430 (1998)

Description

Related Report

[Publications] K.Hanai, K.Yamamoto, N.Minematsu and S.Nakagawa: "Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context degendercy" Proc.5th Int.Conf.Spoken Langueage Processing. 2935-2938 (1998)

Description

Related Report

[Publications] S.Kogure, T.Itoh and S.Nakagawa: "A Semantic interperter for a robust spoken dialogue system" Proc.2nd Int.Conf.Multimodal Interface. II-61-66 (1999)

Description

Related Report

[Publications] 甲斐 充彦: "冗長語・言い直しを含む発話のための未知語処理を用いた音声認識システムの評価" 電子情報通信学会論文誌. 80DII-10. 2615-2625 (1997)

Related Report

[Publications] 中川聖一: "マルチモーダル観光案内対話システム" 人工知能学会誌. 13・2. 241-251 (1998)

Related Report

[Publications] 中川聖一: "Bigramの使用による話し言葉用確率文脈自由文法の自動学習" 情報処理学会論文誌. 39・3. 575-584 (1998)

Related Report

[Publications] 伊藤敏彦: "協調的応答を備えた観光案内音声対話システムとその評価" 情報処理学会論文誌. 39・5. 1248-1257 (1998)

Related Report

[Publications] 小暮 悟: "音声対話システムの移植性に関する考察" 情報処理学会、音声言語情報処理研究報告. SLP25. 13-18 (1999)

Related Report

[Publications] 甲斐 充彦: "単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語処理" 情報処理学会論文誌. 40・4. (1999)

Related Report

[Publications] 堂下 修司: "音声による人間と機械との対話" オーム社, 383 (1998)

Related Report

[Publications] 田窪 行則: "音声" 岩波書店, 256 (1998)

Related Report

[Publications] 中川聖一: "マルチモーダル観光案内対話システム" 人工知能学会誌. 13・2. 241-251 (1998)

Related Report

[Publications] 伊藤敏彦: "協調的応答を備えた観光案内音声対話システムとその評価" 情報処理学会論文誌. 39・3. (1998)

Related Report

[Publications] 甲斐充彦: "自然な発話を対象としたパソコン/ワークステーション用連続音声認識ソフトウェア" 日本音響学会秋季研究発表会論文集. (1997)

Related Report

[Publications] 山本幹雄: "メニューによりガイドされた文節単位による音声対話システム" 情報処理学会論文誌. 37・4. 461-469 (1996)

Related Report

[Publications] 山本幹雄: "人間の理解手法を用いたロバストな音声対話システム" 情報処理学会論文誌. 37・4. 471-481 (1996)

Related Report

[Publications] Min Zhou: "Succeding word prediction for speech recognition based on stochastic language model." Trans.IEICE Inf.& Syst.E79-D・4. 333-341 (1996)

Related Report

[Publications] 中川聖一: "セグメント統計量を用いた隠れマルコフモデルによる音声認識" 電子情報通信学会論文誌. 79-DII・12. 2032-2038 (1996)

Related Report

[Publications] 中川聖一: "音声対話システムの構成法とユーザ発話の関係" 電子情報通信学会論文誌. 79-DII・12. 2139-2145 (1996)

Related Report

[Publications] 甲斐充彦: "冗長語・言い直しを含む発話のための未知語処理を用いた音声認識システムの評価" 電子情報通信学会論文誌. 80DII-10. 2615-2625 (1997)

[Publications] 中川聖一: "マルチモーダル観光案内対話システム" 人工知能学会誌. 13・2. 241-251 (1998)

[Publications] 中川聖一: "Bigramの使用による話し言葉用確率文脈自由文法の自動学習" 情報処理学会論文誌. 39・3. 575-584 (1998)

[Publications] 伊藤敏彦: "協調的応答を備えた観光案内音声対話システムとその評価" 情報処理学会論文誌. 39・5. 1248-1257 (1998)

[Publications] 小暮悟: "音声対話システムの移植性に関する考察" 情報処理学会, 音声言語情報処理研究報告. SLP25. 13-18 (1999)

[Publications] 甲斐充彦: "単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語処理" 情報処理学会論文誌. 40・4. (1999)

[Publications] 堂下修司: "音声による人間と機械との対話" オーム社, 383 (1998)

[Publications] 田〓行則: "音声" 岩波書店, 256 (1998)

[Publications] 甲斐充彦: "冗長語・言い直しを含む発話のための未知語処理を用いた音声認識システムの評価" 電子情報通信学会論文誌. 80DII-10. 2615-2625 (1997)

[Publications] 小暮悟: "音声対話システムの移植性に関する考察" 情報処理学会、音声言語情報処理研究報告. SLP25. 13-18 (1999)

[Publications] 甲斐充彦: "単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語処理" 情報処理学会論文誌. 40・4. (1999)

[Publications] 堂下修司: "音声による人間と機械との対話" オーム社, 383 (1998)

[Publications] 田窪行則: "音声" 岩波書店, 256 (1998)