Study for Spontaneous Spoken Dialogue Understanding System

Research Project

Project/Area Number	09480059
Research Category	Grant-in-Aid for Scientific Research (B).
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	CHIBA UNIVERSITY
Principal Investigator	ICHIKAWA Akira Chiba University, Graduate School on Science and Technology, Professor, 自然科学研究科, 教授 (80241933)
Co-Investigator(Kenkyū-buntansha)	HATAOKA Nobuo Hitachi Central Research Laboratory, Chief Research Scientist, 主管研究員 IMIYA Atsushi Chiba University, Department of Engineering, Professor, 工学部, 教授 (10176505) HORIUCHI Yasuo Chiba University, Graduate School on Science and Technology, Assistant, 自然科学研究科, 助手 (30272347)
Project Period (FY)	1997 – 2000
Project Status	Completed (Fiscal Year 2000)
Budget Amount *help	¥12,300,000 (Direct Cost: ¥12,300,000) Fiscal Year 2000: ¥3,200,000 (Direct Cost: ¥3,200,000) Fiscal Year 1999: ¥2,600,000 (Direct Cost: ¥2,600,000) Fiscal Year 1998: ¥2,200,000 (Direct Cost: ¥2,200,000) Fiscal Year 1997: ¥4,300,000 (Direct Cost: ¥4,300,000)
Keywords	spontaneous spoken dialogue understanding system / multi-agent system / reinforcement learning / utterance forecasting / 抑揚情報 / 協調的同時処理手法 / 予測文 / 実時間音声対話インーターフェース / 音声対話インターフェース技術 / 自然対話音声コーパス / ニュース文 / 抑揚 / 句構造 / マルチエージェント方式 / 効率的照合手法 / 強化学習方式 / 音声対話理解 / 発話の維持 / 心理的要因 / 実時間処理法 / プロフィットシェアリング法 / 音声対話 / 自然対話 / 話者交替 / 心理要因 / 抑揚木 / 結合係数
Research Abstract	In a spontaneous spoken dialogue understanding system, real-time response and robustness to the environment are required. To realize these requirements, we propose a multi-agent system as the system architecture. Each agent has its own function, e.g.phoneme recognition, input utterance structure reasoning from prosody, input utterance forecasting, word reasoning, parsing, etc. The output of this system is the result of co-operation of individual agents that adjust their own behavior to the environment or the input data independency. We propose the co-operation processes. A reinforcement learning method is proposed for a phoneme recognition agent as a sample agent, and adopted a continuous dynamic programming technique to deal with continuous phoneme recognition. To clarify the fundamental characteristics of the proposed method, we define some simple quasi conditions for the experiments, and confirm favorable results. The prosodic structure of the input utterance is represented as a tree form and constructed using FO, duration and pause information of each phrase of the utterance. The tree structure shows how strong successive phrases are concerned with each other. The forecasting the next input utterance uses the handling of two status transfer tables ; controlling for the dialogue in progress and for the dialogue in future. The system can be expected to achieve high adaptability to the environment(e.g., variation of speakers and tasks)and robustness. Some application systems(e.g.WWW voice browser)ware developed.

Report

(5 results)

2000 Annual Research Report Final Research Report Summary
1999 Annual Research Report
1998 Annual Research Report
1997 Annual Research Report

Research Products

(38 results)

All Other

All Publications (38 results)

[Publications] 清水智之,山本剛,市川熹: "音声対話処理のためのマルチエージェントシステム"人工知能学会大会予稿集 12回. 506-508 (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Horiuchi,Y.and Ichikawa,A.: "Prosodic Structure in Japanese Spontaneous Speech"Proc.of ICSLP'98.. 591-594 (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Ichikawa.A. et.Al.: "Evaluation of Annotation Schemes for Japanese Discourse"Proc.of Workshop in "Towards Standards and Tool for Discourse Tagging". 26-34 (1999)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Ichikawa,A.,Shimizu,T..,Horiuchi,Y.,: "REINFORCEMENT LEARNING FOR PHONEME RECOGNITION"Proc.Of EUROSPEECH'99.. 1107-1110 (1999)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] 藤原敦史,堀内靖雄,市川熹: "視覚障害者用WWWブラウジングインタフェースの検討"ヒューマンインタフェース学会論文誌.. Vol.2,No.2. 31-38 (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] 土井信洋,堀内靖雄,市川熹,: "マルチエージェント音声対話システムの内部機構"第9回マルチ・エージェントと協調計算ワークショップ(MACC2000). (HP). (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Shimizu, T., Yamamoto, T., Ichikawa, A.: "Multi Agent System for Spoken Dialogue Understanding(in Japanese)"Proc.of Japanese Artificial Soc.98. 506-509 (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Horiuchi, Y.and Ichikawa, A.: "Prosodic Structure in Japanese Spontaneous Speech"Proc.of ICSLP'98. 591-594 (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Ichikawa, A.et.al.: "Evaluation of Annotation Schemes for Japanese Discourse"Proc.of Workshop in "Towards Standards and Tool for Discourse Tagging". 26-34 (1999)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Ichikawa, A.Shimizu, T.., Horiuchi, Y.: "REINFORCEMENT LEARNING FOR PHONEME RECOGNITION"Proc.Of EUROSPEECH'99. 1107-1110 (1999)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Horiuchi, Y., Fujiwara, A., Ichikawa, A.: "NEW WWW BROWSER FOR VISUALLY IMPARED PEOPLE"Proc.Of EUROSPEECH'99. 2139-2142 (1999)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Fujita, S., Okada, K, Horiuchi, Y., Ichikawa, A.: "Examination for the Performance of the Matching Part of the Voice Dialogue System which merge the Prosody/Intonation Analysis and the Sentence Forseeing(in Japanese)"Technical Report of IEICE, SP99-172. 99. 41-48 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] SHimizu, H., Horiuchi, Y., Ichikawa, A.: "Speech rate of keywords in spontaneous speech(in Japanese)"Technical Report of JSAI. SIG-SLUD-A003-6. 31-36 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Ohsuga, T., Horiuchi, Y., Ichikawa, A.: "Improvement on Automatic Detection of Phoneme Boundaries(in Japanese)"Proc.of the Spontaneous Speech Science and Technology Workshop. 143-148 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] Suzuki, N., Horiuchi, Y., Ichikawa, A.: "Quasi-Real Time Estimation of Tree Structure From Prosody(in Japanese)"Spring Meeting of JSA. Vol.1, 3-6-11. 341-342 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] 藤原敦史,堀内靖雄,市川熹: ""視覚障害者用WWWブラウジングインタフェースの検討"、"ヒューマンインタフェース学会論文誌. Vol.2,No.2. pp.31-38 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 市川熹,堀内靖雄,土屋俊: ""日本語地図課題コーパス"、"日本音声学会、音声研究、. 4巻2号. pp.4-15 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 大須賀智子,堀内靖雄,市川熹: ""音素境界の自動認識を目指して""人工知能学会言語・音声理解と対話処理研究会. SLUD-A002-4. pp.17-22 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 土井信洋,堀内靖雄,市川熹: ""マルチエージェント音声対話システムの内部機構""第9回マルチ・エージェントと協調計算ワークショップ(MACC2000). 143-148 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 清水久志,堀内靖雄,市川熹: ""自然対話における発話の時間構造の分析""人工知能学会言語・音声理解と対話処理研究会. SLUD-A003. pp.31-36 (2001)
- Related Report
  2000 Annual Research Report
[Publications] 大須賀智子,堀内靖雄,市川熹: ""音素セグメンテーションの自動化に関する検討"、"「話し言葉の科学と工学」. pp.143-148 (2001)
- Related Report
  2000 Annual Research Report
[Publications] Hanae Koiso, Akira Ichikawa, et al.: "An Analysis of Turn-Taking and Backchannels Based on Prosodic and Syntactic Features in Japanese Dialogue"Kingston Press Service LANGUAGE AND SPEECH. 41・3-4. 291-318 (1999)
- Related Report
  1999 Annual Research Report
[Publications] Ichikawa, A., Araki, M., et al.: "Evaluation of Annotation Schemes for Japanese Discourse"ACL Towards Standards and Tools for Disoourse Tagging WS. 26-34 (1999)
- Related Report
  1999 Annual Research Report
[Publications] Y. HORIUCHI, A. ICHIKAWA, et al.: "New WWW Browser for Visually Impaired People Using Interactive Voice Technology"Eurospeech'99. vol. 3. 1107-1110 (1999)
- Related Report
  1999 Annual Research Report
[Publications] A. ICHIKAWA, T. SHIMIZU & Y. HORIUCHI: "Reinforcement Learning for Phoneme Recognition"Eurospeech'99. vol. 5. 2139-2142 (1999)
- Related Report
  1999 Annual Research Report
[Publications] 岡田一秀、堀内靖雄、市川熹: "音声対話システムにおけるメタトピックの予測と応答合成"電子情報通信学会音声研究会. SP99-146. 65-72 (2000)
- Related Report
  1999 Annual Research Report
[Publications] 藤田聡、岡田一秀、堀内靖雄、市川熹: "音韻・抑揚・文章予測を混用した音声対話処理におけるマッチング部の性能検討"電子情報通信学会音声研究会. SP99-. (2000)
- Related Report
  1999 Annual Research Report
[Publications] 堀内靖雄他: "日本語地図課題対話コーパスの設計と特徴" 人工知能学会誌. Vol.14,No.2. 261-272 (1999)
- Related Report
  1998 Annual Research Report
[Publications] Akira ICHIKAWA et al.: "Standardizing Annotation Schemes for Japanese Discourse" Proc.of the 1st Int.Conf.on Language Resources and Evaluation. 28-30. 731-736 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 清水智之他: "音韻認識に対する強化学習の基礎検討" 1998年度春季日本音響学会講演論文集. 3-Q-11. 139-140 (1999)
- Related Report
  1998 Annual Research Report
[Publications] 清水智之他: "音声対話処理のためのマルチエージェントシステム" 1998年度人工知能学会全国大会論文集. 29-05. 506-509 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 堀内靖雄他: "自然対話音声の抑揚木の一検討" 電子情報通信学会音声研究会資料. SP98-3. 17-14 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 市川熹: "音声言語モデルに思うこと" 自然言語処理. Vol.6.No.2. 1-8 (1999)
- Related Report
  1998 Annual Research Report
[Publications] 田村博編共著: "ヒューマンインターフェース「音声言語による対話」" オーム社, 259-263 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 郵政省通信総合研究所編: "通信の百科事典(音声関係の項目)" 丸善, (1998)
- Related Report
  1998 Annual Research Report
[Publications] 中野、仲、市川他、: "日本語地図課題対話コーパスの基礎的統計" 人工知能学会言語・音声理解と対話処理研究会資料. SIG-SLUD-9701. 19-24 (1997)
- Related Report
  1997 Annual Research Report
[Publications] 市川、荒木、石崎、他: "談話タグ標準化の現状" 人工知能学会言語・音声理解と対話処理研究会資料. SIG-SLUD-9703. 41-49 (1997)
- Related Report
  1997 Annual Research Report
[Publications] 講演堀内、高橋、市川: "自然対話音声の抑揚木の一検討" 電子情報通信学会音声研究会資料. 98-01. (1998)
- Related Report
  1997 Annual Research Report

Study for Spontaneous Spoken Dialogue Understanding System

Principal Investigator

ICHIKAWA Akira Chiba University, Graduate School on Science and Technology, Professor, 自然科学研究科, 教授 (80241933)

¥12,300,000 (Direct Cost: ¥12,300,000)

Report

Research Products

[Publications] 清水智之,山本剛,市川熹: "音声対話処理のためのマルチエージェントシステム"人工知能学会大会予稿集 12回. 506-508 (1998)

Description

Related Report

[Publications] Horiuchi,Y.and Ichikawa,A.: "Prosodic Structure in Japanese Spontaneous Speech"Proc.of ICSLP'98.. 591-594 (1998)

Description

Related Report

[Publications] Ichikawa.A. et.Al.: "Evaluation of Annotation Schemes for Japanese Discourse"Proc.of Workshop in "Towards Standards and Tool for Discourse Tagging". 26-34 (1999)

Description

Related Report

[Publications] Ichikawa,A.,Shimizu,T..,Horiuchi,Y.,: "REINFORCEMENT LEARNING FOR PHONEME RECOGNITION"Proc.Of EUROSPEECH'99.. 1107-1110 (1999)

Description

Related Report

[Publications] 藤原敦史,堀内靖雄,市川熹: "視覚障害者用WWWブラウジングインタフェースの検討"ヒューマンインタフェース学会論文誌.. Vol.2,No.2. 31-38 (2000)

Description

Related Report

[Publications] 土井信洋,堀内靖雄,市川熹,: "マルチエージェント音声対話システムの内部機構"第9回マルチ・エージェントと協調計算ワークショップ(MACC2000). (HP). (2000)

Description

Related Report

[Publications] Shimizu, T., Yamamoto, T., Ichikawa, A.: "Multi Agent System for Spoken Dialogue Understanding(in Japanese)"Proc.of Japanese Artificial Soc.98. 506-509 (1998)

Description

Related Report

[Publications] Horiuchi, Y.and Ichikawa, A.: "Prosodic Structure in Japanese Spontaneous Speech"Proc.of ICSLP'98. 591-594 (1998)

Description

Related Report

[Publications] Ichikawa, A.et.al.: "Evaluation of Annotation Schemes for Japanese Discourse"Proc.of Workshop in "Towards Standards and Tool for Discourse Tagging". 26-34 (1999)

Description

Related Report

[Publications] Ichikawa, A.Shimizu, T.., Horiuchi, Y.: "REINFORCEMENT LEARNING FOR PHONEME RECOGNITION"Proc.Of EUROSPEECH'99. 1107-1110 (1999)

Description

Related Report

[Publications] Horiuchi, Y., Fujiwara, A., Ichikawa, A.: "NEW WWW BROWSER FOR VISUALLY IMPARED PEOPLE"Proc.Of EUROSPEECH'99. 2139-2142 (1999)

Description

Related Report

[Publications] Fujita, S., Okada, K, Horiuchi, Y., Ichikawa, A.: "Examination for the Performance of the Matching Part of the Voice Dialogue System which merge the Prosody/Intonation Analysis and the Sentence Forseeing(in Japanese)"Technical Report of IEICE, SP99-172. 99. 41-48 (2000)

Description

Related Report

[Publications] SHimizu, H., Horiuchi, Y., Ichikawa, A.: "Speech rate of keywords in spontaneous speech(in Japanese)"Technical Report of JSAI. SIG-SLUD-A003-6. 31-36 (2001)

Description

Related Report

[Publications] Ohsuga, T., Horiuchi, Y., Ichikawa, A.: "Improvement on Automatic Detection of Phoneme Boundaries(in Japanese)"Proc.of the Spontaneous Speech Science and Technology Workshop. 143-148 (2001)

Description

Related Report

[Publications] Suzuki, N., Horiuchi, Y., Ichikawa, A.: "Quasi-Real Time Estimation of Tree Structure From Prosody(in Japanese)"Spring Meeting of JSA. Vol.1, 3-6-11. 341-342 (2001)

Description

Related Report

[Publications] 藤原敦史,堀内靖雄,市川熹: ""視覚障害者用WWWブラウジングインタフェースの検討"、"ヒューマンインタフェース学会論文誌. Vol.2,No.2. pp.31-38 (2000)

Related Report

[Publications] 市川熹,堀内靖雄,土屋俊: ""日本語地図課題コーパス"、"日本音声学会、音声研究、. 4巻2号. pp.4-15 (2000)

Related Report

[Publications] 大須賀智子,堀内靖雄,市川熹: ""音素境界の自動認識を目指して""人工知能学会 言語・音声理解と対話処理研究会. SLUD-A002-4. pp.17-22 (2000)

Related Report

[Publications] 土井信洋,堀内靖雄,市川熹: ""マルチエージェント音声対話システムの内部機構""第9回マルチ・エージェントと協調計算ワークショップ(MACC2000). 143-148 (2000)

Related Report

[Publications] 清水久志,堀内靖雄,市川熹: ""自然対話における発話の時間構造の分析""人工知能学会 言語・音声理解と対話処理研究会. SLUD-A003. pp.31-36 (2001)

Related Report

[Publications] 大須賀智子,堀内靖雄,市川熹: ""音素セグメンテーションの自動化に関する検討"、"「話し言葉の科学と工学」. pp.143-148 (2001)

Related Report

[Publications] Hanae Koiso, Akira Ichikawa, et al.: "An Analysis of Turn-Taking and Backchannels Based on Prosodic and Syntactic Features in Japanese Dialogue"Kingston Press Service LANGUAGE AND SPEECH. 41・3-4. 291-318 (1999)

Related Report

[Publications] Ichikawa, A., Araki, M., et al.: "Evaluation of Annotation Schemes for Japanese Discourse"ACL Towards Standards and Tools for Disoourse Tagging WS. 26-34 (1999)

Related Report

[Publications] Y. HORIUCHI, A. ICHIKAWA, et al.: "New WWW Browser for Visually Impaired People Using Interactive Voice Technology"Eurospeech'99. vol. 3. 1107-1110 (1999)

Related Report

[Publications] A. ICHIKAWA, T. SHIMIZU & Y. HORIUCHI: "Reinforcement Learning for Phoneme Recognition"Eurospeech'99. vol. 5. 2139-2142 (1999)

Related Report

[Publications] 岡田一秀、堀内靖雄、市川 熹: "音声対話システムにおけるメタトピックの予測と応答合成"電子情報通信学会音声研究会. SP99-146. 65-72 (2000)

Related Report

[Publications] 藤田 聡、岡田一秀、堀内靖雄、市川 熹: "音韻・抑揚・文章予測を混用した音声対話処理におけるマッチング部の性能検討"電子情報通信学会音声研究会. SP99-. (2000)

Related Report

[Publications] 堀内靖雄 他: "日本語地図課題対話コーパスの設計と特徴" 人工知能学会誌. Vol.14,No.2. 261-272 (1999)

Related Report

[Publications] Akira ICHIKAWA et al.: "Standardizing Annotation Schemes for Japanese Discourse" Proc.of the 1st Int.Conf.on Language Resources and Evaluation. 28-30. 731-736 (1998)

Related Report

[Publications] 清水智之 他: "音韻認識に対する強化学習の基礎検討" 1998年度春季日本音響学会講演論文集. 3-Q-11. 139-140 (1999)

[Publications] 大須賀智子,堀内靖雄,市川熹: ""音素境界の自動認識を目指して""人工知能学会言語・音声理解と対話処理研究会. SLUD-A002-4. pp.17-22 (2000)

[Publications] 清水久志,堀内靖雄,市川熹: ""自然対話における発話の時間構造の分析""人工知能学会言語・音声理解と対話処理研究会. SLUD-A003. pp.31-36 (2001)

[Publications] 岡田一秀、堀内靖雄、市川熹: "音声対話システムにおけるメタトピックの予測と応答合成"電子情報通信学会音声研究会. SP99-146. 65-72 (2000)

[Publications] 藤田聡、岡田一秀、堀内靖雄、市川熹: "音韻・抑揚・文章予測を混用した音声対話処理におけるマッチング部の性能検討"電子情報通信学会音声研究会. SP99-. (2000)

[Publications] 堀内靖雄他: "日本語地図課題対話コーパスの設計と特徴" 人工知能学会誌. Vol.14,No.2. 261-272 (1999)

[Publications] 清水智之他: "音韻認識に対する強化学習の基礎検討" 1998年度春季日本音響学会講演論文集. 3-Q-11. 139-140 (1999)

[Publications] 清水智之他: "音声対話処理のためのマルチエージェントシステム" 1998年度人工知能学会全国大会論文集. 29-05. 506-509 (1998)

[Publications] 堀内靖雄他: "自然対話音声の抑揚木の一検討" 電子情報通信学会音声研究会資料. SP98-3. 17-14 (1998)

[Publications] 市川熹: "音声言語モデルに思うこと" 自然言語処理. Vol.6.No.2. 1-8 (1999)

[Publications] 田村博編共著: "ヒューマンインターフェース「音声言語による対話」" オーム社, 259-263 (1998)

[Publications] 中野、仲、市川他、: "日本語地図課題対話コーパスの基礎的統計" 人工知能学会言語・音声理解と対話処理研究会資料. SIG-SLUD-9701. 19-24 (1997)

[Publications] 市川、荒木、石崎、他: "談話タグ標準化の現状" 人工知能学会言語・音声理解と対話処理研究会資料. SIG-SLUD-9703. 41-49 (1997)

[Publications] 講演堀内、高橋、市川: "自然対話音声の抑揚木の一検討" 電子情報通信学会音声研究会資料. 98-01. (1998)