2000 Fiscal Year Final Research Report Summary

Study for Spontaneous Spoken Dialogue Understanding System

Research Project

Project/Area Number	09480059
Research Category	Grant-in-Aid for Scientific Research (B).
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	CHIBA UNIVERSITY
Principal Investigator	ICHIKAWA Akira Chiba University, Graduate School on Science and Technology, Professor, 自然科学研究科, 教授 (80241933)
Co-Investigator(Kenkyū-buntansha)	HATAOKA Nobuo Hitachi Central Research Laboratory, Chief Research Scientist, 主管研究員 IMIYA Atsushi Chiba University, Department of Engineering, Professor, 工学部, 教授 (10176505) HORIUCHI Yasuo Chiba University, Graduate School on Science and Technology, Assistant, 自然科学研究科, 助手 (30272347)
Project Period (FY)	1997 – 2000
Keywords	spontaneous spoken dialogue understanding system / multi-agent system / reinforcement learning / reinforcement learning / utterance forecasting / 協調的同時処理手法 / 予測文
Research Abstract	In a spontaneous spoken dialogue understanding system, real-time response and robustness to the environment are required. To realize these requirements, we propose a multi-agent system as the system architecture. Each agent has its own function, e.g.phoneme recognition, input utterance structure reasoning from prosody, input utterance forecasting, word reasoning, parsing, etc. The output of this system is the result of co-operation of individual agents that adjust their own behavior to the environment or the input data independency. We propose the co-operation processes. A reinforcement learning method is proposed for a phoneme recognition agent as a sample agent, and adopted a continuous dynamic programming technique to deal with continuous phoneme recognition. To clarify the fundamental characteristics of the proposed method, we define some simple quasi conditions for the experiments, and confirm favorable results. The prosodic structure of the input utterance is represented as a tree form and constructed using FO, duration and pause information of each phrase of the utterance. The tree structure shows how strong successive phrases are concerned with each other. The forecasting the next input utterance uses the handling of two status transfer tables ; controlling for the dialogue in progress and for the dialogue in future. The system can be expected to achieve high adaptability to the environment(e.g., variation of speakers and tasks)and robustness. Some application systems(e.g.WWW voice browser)ware developed.

Research Products
(15 results)

All Other

All Publications (15 results)

[Publications] 清水智之,山本剛,市川熹: "音声対話処理のためのマルチエージェントシステム"人工知能学会大会予稿集 12回. 506-508 (1998)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Horiuchi,Y.and Ichikawa,A.: "Prosodic Structure in Japanese Spontaneous Speech"Proc.of ICSLP'98.. 591-594 (1998)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Ichikawa.A. et.Al.: "Evaluation of Annotation Schemes for Japanese Discourse"Proc.of Workshop in "Towards Standards and Tool for Discourse Tagging". 26-34 (1999)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Ichikawa,A.,Shimizu,T..,Horiuchi,Y.,: "REINFORCEMENT LEARNING FOR PHONEME RECOGNITION"Proc.Of EUROSPEECH'99.. 1107-1110 (1999)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 藤原敦史,堀内靖雄,市川熹: "視覚障害者用WWWブラウジングインタフェースの検討"ヒューマンインタフェース学会論文誌.. Vol.2,No.2. 31-38 (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 土井信洋,堀内靖雄,市川熹,: "マルチエージェント音声対話システムの内部機構"第9回マルチ・エージェントと協調計算ワークショップ(MACC2000). (HP). (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Shimizu, T., Yamamoto, T., Ichikawa, A.: "Multi Agent System for Spoken Dialogue Understanding(in Japanese)"Proc.of Japanese Artificial Soc.98. 506-509 (1998)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Horiuchi, Y.and Ichikawa, A.: "Prosodic Structure in Japanese Spontaneous Speech"Proc.of ICSLP'98. 591-594 (1998)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Ichikawa, A.et.al.: "Evaluation of Annotation Schemes for Japanese Discourse"Proc.of Workshop in "Towards Standards and Tool for Discourse Tagging". 26-34 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Ichikawa, A.Shimizu, T.., Horiuchi, Y.: "REINFORCEMENT LEARNING FOR PHONEME RECOGNITION"Proc.Of EUROSPEECH'99. 1107-1110 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Horiuchi, Y., Fujiwara, A., Ichikawa, A.: "NEW WWW BROWSER FOR VISUALLY IMPARED PEOPLE"Proc.Of EUROSPEECH'99. 2139-2142 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Fujita, S., Okada, K, Horiuchi, Y., Ichikawa, A.: "Examination for the Performance of the Matching Part of the Voice Dialogue System which merge the Prosody/Intonation Analysis and the Sentence Forseeing(in Japanese)"Technical Report of IEICE, SP99-172. 99. 41-48 (2000)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] SHimizu, H., Horiuchi, Y., Ichikawa, A.: "Speech rate of keywords in spontaneous speech(in Japanese)"Technical Report of JSAI. SIG-SLUD-A003-6. 31-36 (2001)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Ohsuga, T., Horiuchi, Y., Ichikawa, A.: "Improvement on Automatic Detection of Phoneme Boundaries(in Japanese)"Proc.of the Spontaneous Speech Science and Technology Workshop. 143-148 (2001)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Suzuki, N., Horiuchi, Y., Ichikawa, A.: "Quasi-Real Time Estimation of Tree Structure From Prosody(in Japanese)"Spring Meeting of JSA. Vol.1, 3-6-11. 341-342 (2001)
- Description
  「研究成果報告書概要(欧文)」より

2000 Fiscal Year Final Research Report Summary

Study for Spontaneous Spoken Dialogue Understanding System

Principal Investigator

ICHIKAWA Akira Chiba University, Graduate School on Science and Technology, Professor, 自然科学研究科, 教授 (80241933)

Research Products

[Publications] 清水智之,山本剛,市川熹: "音声対話処理のためのマルチエージェントシステム"人工知能学会大会予稿集 12回. 506-508 (1998)

Description

[Publications] Horiuchi,Y.and Ichikawa,A.: "Prosodic Structure in Japanese Spontaneous Speech"Proc.of ICSLP'98.. 591-594 (1998)

Description

[Publications] Ichikawa.A. et.Al.: "Evaluation of Annotation Schemes for Japanese Discourse"Proc.of Workshop in "Towards Standards and Tool for Discourse Tagging". 26-34 (1999)

Description

[Publications] Ichikawa,A.,Shimizu,T..,Horiuchi,Y.,: "REINFORCEMENT LEARNING FOR PHONEME RECOGNITION"Proc.Of EUROSPEECH'99.. 1107-1110 (1999)

Description

[Publications] 藤原敦史,堀内靖雄,市川熹: "視覚障害者用WWWブラウジングインタフェースの検討"ヒューマンインタフェース学会論文誌.. Vol.2,No.2. 31-38 (2000)

Description

[Publications] 土井信洋,堀内靖雄,市川熹,: "マルチエージェント音声対話システムの内部機構"第9回マルチ・エージェントと協調計算ワークショップ(MACC2000). (HP). (2000)

Description

[Publications] Shimizu, T., Yamamoto, T., Ichikawa, A.: "Multi Agent System for Spoken Dialogue Understanding(in Japanese)"Proc.of Japanese Artificial Soc.98. 506-509 (1998)

Description

[Publications] Horiuchi, Y.and Ichikawa, A.: "Prosodic Structure in Japanese Spontaneous Speech"Proc.of ICSLP'98. 591-594 (1998)

Description

[Publications] Ichikawa, A.et.al.: "Evaluation of Annotation Schemes for Japanese Discourse"Proc.of Workshop in "Towards Standards and Tool for Discourse Tagging". 26-34 (1999)

Description

[Publications] Ichikawa, A.Shimizu, T.., Horiuchi, Y.: "REINFORCEMENT LEARNING FOR PHONEME RECOGNITION"Proc.Of EUROSPEECH'99. 1107-1110 (1999)

Description

[Publications] Horiuchi, Y., Fujiwara, A., Ichikawa, A.: "NEW WWW BROWSER FOR VISUALLY IMPARED PEOPLE"Proc.Of EUROSPEECH'99. 2139-2142 (1999)

Description

[Publications] Fujita, S., Okada, K, Horiuchi, Y., Ichikawa, A.: "Examination for the Performance of the Matching Part of the Voice Dialogue System which merge the Prosody/Intonation Analysis and the Sentence Forseeing(in Japanese)"Technical Report of IEICE, SP99-172. 99. 41-48 (2000)

Description

[Publications] SHimizu, H., Horiuchi, Y., Ichikawa, A.: "Speech rate of keywords in spontaneous speech(in Japanese)"Technical Report of JSAI. SIG-SLUD-A003-6. 31-36 (2001)

Description

[Publications] Ohsuga, T., Horiuchi, Y., Ichikawa, A.: "Improvement on Automatic Detection of Phoneme Boundaries(in Japanese)"Proc.of the Spontaneous Speech Science and Technology Workshop. 143-148 (2001)

Description

[Publications] Suzuki, N., Horiuchi, Y., Ichikawa, A.: "Quasi-Real Time Estimation of Tree Structure From Prosody(in Japanese)"Spring Meeting of JSA. Vol.1, 3-6-11. 341-342 (2001)

Description