2001 Fiscal Year Final Research Report Summary

Study on Enhancement of Spoken Language Processing Using Dialogue Corpus Annotated with Discourse Information

Research Project

Project/Area Number	11480073
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	The University of Electro-Communications
Principal Investigator	KUREMATSU Akira The Univ. of Electro-Communications, Graduate School of Electro-Communications, Professor, 大学院・電気通信学研究科, 教授 (90251701)
Co-Investigator(Kenkyū-buntansha)	伝康晴千葉大学, 文学部, 助教授 (70291458) 山下洋一立命館大学, 理工学部, 教授 (80174689) 荒木雅弘京都工芸繊維大学, 工芸学部, 助教授 (50252490) 中里収名桜大学, 国際学部, 助教授 (90257197) 石崎雅人北陸先端科学技術大学院大学, 知識科学研究科, 助教授 (30303340)
Project Period (FY)	1999 – 2001
Keywords	Spoken Dialogue Corpus / Goal Oriented Dialogue / Dialogue Tag / Prosody / Morphological Information / Dialogue Act / Dialogue Segment
Research Abstract	Dialogue corpora are indispensable to speech and language research. We developed a Japanese dialogue corpus annotated with multi-level information. The annotation information consists of speech, transcription delimited by slash unite, prosodic, part of speech, dialogue acts and dialogue segmentation. The corpus consists of 40 goal-oriented dialogues collected at different research groups. The tagging scheme was evaluated On an experimental basis. A method to infer the utterance-unit tag from both the text corpus and its ,morpheme analysis was proposed. The GUI-based annotation environment was developed which enables the users to predict dialogue acts and relevance information using machine laming techniques and to store the annotated data in the XML format. The autonomous model for turn-taking was proposed, predicting the distribution of smooth transitions between speakers. A method of tagging lubricant words which include discourse markers, fillers and acknowledgement tokens. A discourse level tagging tool using Transformation-based Learning from training corpus was developed. A rule based approach to extract dialogue acts and topics from utterances was investigated. Studies on identifying dialogue acts based on prosodic information and key words information were undertaken and the use of prosodic information was shown to be effective for dialogue tagging. Linguistic cues for dialogue act classification based on statistical analysis were explored. A tool to generate dialogue patterns based on automatically generated Voice XML from dialogue corpus. The disagreements of topic boundaries caused by different strategies were analyzed. The high correlation was shown between the degree of topic break and prosodic parameters by analyzing the topic segment tags and prosody.

Research Products
(14 results)

All Other

All Publications (14 results)

[Publications] 中里, 田本, 菊池, 吉村: "課題遂行対話における対話潤滑語の認定"人工知能学会誌. 14. 900-906 (1999)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Araki, M., Ueda, K., Nishimoto, T., Niimi, Y.: "A semantic tagging tool for spoken dialogue corpus"Proceedings of the 6^<th> International Conference on Spoken Language Processing. 4. 720-723 (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 小椋, 伝: "談話行為型の認識に役立つ言語情報の同定"人工知能学会研究会資料. SIG-SLUD-9903. 31-36 (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 小倉, 須藤, 松永, 石崎: "多レベルの知識を利用した課題遂行対話のためのセグメンテーション分割"人工知能学会研究会資料. SIG-SLUD-A0103. 33-38 (2002)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 大谷, 荒木, 西本, 新美: "対話例からの混合主導型音声対話システムの構築"人工知能学会研究会資料. SIG-SLUD-A0103. 21-26 (2002)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 中村, 榑松: "スケジューリングタスクにおける2進分類木による発話意図分類"人工知能学会研究会資料. SIG-SLUD-A0103. 13-20 (2002)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 石崎, 伝: "談話と対話"東京大学出版会. 239 (2001)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Araki, M., Ueda, K., Nishimoto, T. and Niimi, Y.: "A semantic tagging tool for spoken dialogue corpus"Proceedings of the 6^<th> International Conference on Spoken Language Processing. Vol.4. 720-723 (2000)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Araki, M., Kimura, Y., Nishimoto, T. and Niimi, Y.: "Development of a machine learnable discourse tagging tool"Proceedings of 2nd SIGDIAL Workshop on Discourse and Dialogue. 20-25 (2001)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Niimi, Y., Oku, T., Nishimoto, T. and Araki, M.: "A rule based approach to extraction of topics and dialog acts in a spoken dialog system"Proceedings of Eurospeech 2001. 2185-2188 (2001)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Ichikawa, A., Araki, M., Horiuchi, Y., Ishizaki, M., Itabashi, S., Itoh, T., Kashioka, H., Kato, K., Kikuchi, H., Koiso, H., Kumagai, T., Kurematsu, A., Maekawa, K., Nakazato, S., Tamato, M., Tutiya, S., Yamashita, Y. and Yoshimura, T.: "Evaluation of Annotation Schemes for Japanese Discourse"Proceedings of ACL '99 Workshop on Towards Standards and Tools for Discourse Tagging. 26-34 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Japanese Discourse Research Initiative: "Japanese Dialogue Corpus of Multi-level Annotation"Proceedings of the 1st SOGdial Workshop on Discourse and Dialogue. 1-8 (2000)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Kurematsu, A., Hosoki, M., Morimoto, Y.: "Prosody Pattern Recognition and Identification of Utterance Intentions"Proceedings of ISCA Prosody Workshop. 93-96 (2001)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Yamashita, Y. and Murai, M.: "An Annotation Scheme of Spoken Dialogues with Topic Break Indexes"Proceedings of the 6^<th> International Conference on Spoken Language Processing. Vol.1. 569-572 (2000)
- Description
  「研究成果報告書概要(欧文)」より

2001 Fiscal Year Final Research Report Summary

Study on Enhancement of Spoken Language Processing Using Dialogue Corpus Annotated with Discourse Information

Principal Investigator

KUREMATSU Akira The Univ. of Electro-Communications, Graduate School of Electro-Communications, Professor, 大学院・電気通信学研究科, 教授 (90251701)

Research Products

[Publications] 中里, 田本, 菊池, 吉村: "課題遂行対話における対話潤滑語の認定"人工知能学会誌. 14. 900-906 (1999)

Description

[Publications] Araki, M., Ueda, K., Nishimoto, T., Niimi, Y.: "A semantic tagging tool for spoken dialogue corpus"Proceedings of the 6^<th> International Conference on Spoken Language Processing. 4. 720-723 (2000)

Description

[Publications] 小椋, 伝: "談話行為型の認識に役立つ言語情報の同定"人工知能学会研究会資料. SIG-SLUD-9903. 31-36 (2000)

Description

[Publications] 小倉, 須藤, 松永, 石崎: "多レベルの知識を利用した課題遂行対話のためのセグメンテーション分割"人工知能学会研究会資料. SIG-SLUD-A0103. 33-38 (2002)

Description

[Publications] 大谷, 荒木, 西本, 新美: "対話例からの混合主導型音声対話システムの構築"人工知能学会研究会資料. SIG-SLUD-A0103. 21-26 (2002)

Description

[Publications] 中村, 榑松: "スケジューリングタスクにおける2進分類木による発話意図分類"人工知能学会研究会資料. SIG-SLUD-A0103. 13-20 (2002)

Description

[Publications] 石崎, 伝: "談話と対話"東京大学出版会. 239 (2001)

Description

[Publications] Araki, M., Ueda, K., Nishimoto, T. and Niimi, Y.: "A semantic tagging tool for spoken dialogue corpus"Proceedings of the 6^<th> International Conference on Spoken Language Processing. Vol.4. 720-723 (2000)

Description

[Publications] Araki, M., Kimura, Y., Nishimoto, T. and Niimi, Y.: "Development of a machine learnable discourse tagging tool"Proceedings of 2nd SIGDIAL Workshop on Discourse and Dialogue. 20-25 (2001)

Description

[Publications] Niimi, Y., Oku, T., Nishimoto, T. and Araki, M.: "A rule based approach to extraction of topics and dialog acts in a spoken dialog system"Proceedings of Eurospeech 2001. 2185-2188 (2001)

Description

Description

[Publications] Japanese Discourse Research Initiative: "Japanese Dialogue Corpus of Multi-level Annotation"Proceedings of the 1st SOGdial Workshop on Discourse and Dialogue. 1-8 (2000)

Description

[Publications] Kurematsu, A., Hosoki, M., Morimoto, Y.: "Prosody Pattern Recognition and Identification of Utterance Intentions"Proceedings of ISCA Prosody Workshop. 93-96 (2001)

Description

[Publications] Yamashita, Y. and Murai, M.: "An Annotation Scheme of Spoken Dialogues with Topic Break Indexes"Proceedings of the 6^<th> International Conference on Spoken Language Processing. Vol.1. 569-572 (2000)

Description