2003 Fiscal Year Final Research Report Summary

Development for speech interface for form -based in formation access services on Web

Research Project

Project/Area Number	13558033
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	展開研究
Research Field	Intelligent informatics
Research Institution	Toyohashi University Technology
Principal Investigator	NAKAGAWA Seiichi Toyohashi University, Faculty of Engineering, Professor, 工学部, 教授 (20115893)
Co-Investigator(Kenkyū-buntansha)	KAI Atsuhiko Toyohashi University, Faculty of Engineering, Associate Professor, 工学部, 助教授 (60283496) KOBAYASHI Satoshi Toyohashi University, Faculty of Engineering, Research Associate, 工学部, 助手 (90314096) KITAOKA Norihide Toyohashi University, Faculty of Engineering, Lecturer, 工学部, 講師 (10333501) NAKANO Takashi CAI, Co, Department of Enfineering, Research Leader, 技術部・リーダ(研究職) ITOH Toshihilo Shizuoka University, Faculty of Engineering, Research Associate, 情報学部, 助手 (20313926)
Project Period (FY)	2001 – 2003
Keywords	WWW / Information Retrieval / Form-based Input / Speech Input / Pen-based Input / Speech Recognition / Name Input / PDA
Research Abstract	While some speech interface systems have been developed for accessing Web resources, they are limited for accessing some specific contents and they don't provide a universal interface for arbitrary information retrieval services on the WWW. We propose an interactive speech user interface system, which could be applied to many form-based information retrieval services of the WVVW. In particular, our system was implemented based on a client-server, a Web proxy-centered architecture and employed an information extraction and language processing of HTML documents for providing a general-purpose interface for many form-based WWW contents. We also performed some experiments by 12 subjects for the comparison of the usability under different usage conditions. As a result, the proposed system attained comparative and higher expected usability measures over the pen-touch input method under the condition of an ideal speech recognition performance, and could be expected to achieve the effectivenes … More s or the superiority over a pen touch-only interface in terms of the usability as their usage condition approaches to a realistic PDA usage condition. We also proposed an. interface for a name input based on speech recognition using syllable-based N-gram and a word dictionary, which was frequently required to input into form-based web pages. User first utters a name and then chooses the correct word/syllables by pen touch from word/syllable candidates which were obtained from speech recognition. Name utterance is hard to recognize accurately because of the large vocabulary size, so the system uses continuous syllable recognition with syllable-based N-gram and isolated word recognition with a dictionary containing frequent words. The user can find the correct the answer from word candidates or syllable sequence candidates at a rate of 82-86%, and can input correct name at a rate of 94-96% with syllable selection from the syllable lattice. Some subjects used this interface and felt that it was useful. Less

Research Products
(10 results)

All Other

All Publications (10 results)

[Publications] 松下雅彦: "音声入力によるWeb検索のためのキーワード認識・抽出法の検討"情報処理学会,音声言語情報処理. SLP-48(4). 21-28 (2003)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 押川洋徳: "音節N-gramと単語辞書併用による姓名入力インターフェース"情報処理学会,音声言語情報処理. SLP-49(30). 175-180 (2003)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 竹内真士: "韻律・表層的言語情報を発話タイミング制御に用いた雑認対話システム"情報処理学会,音声言語情報処理. SLP50-14. 87-92 (2004)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 梅田将満: "音声対話システムにおける移植性の高い汎用的意味理解部の構築"情報処理学会,自然言語処理研究会. (2004)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Satoru Kogure: "A development tool for spoken dialogue systems and its evaluation"Lecture Notes in Artificial Intelligence, (Springer). 2166. 373-380 (2001)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Masamitsu Umeda: "Interpreter for highly portable spoken dialogue system"Proc. 4-th Sigdial Workshop on Discourse and Dialogue. 105-114 (2003)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Masahiko Matsushita: "Keyword recognition and extraction for speech-driven Web retrieval task (in Japanese)"Information Processing Society of Japan. SLP48, 4. 21-28 (2003)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Hironori Oshikawa: "Speech interface for name input, using syllable N-gram and word dictionary (in Japanese)"Information Processing Society of Japan. SLP49, 30. 175-180 (2003)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Masashi Takeuchi: "A spoken dialog system activating the natural response timing using prosodic and linguistic information for chat-like conversation (in Japanese)"Information Processing Society of Japan. SLP50,14. 87-92 (2004)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Masamitsu Umeda: "Construction of highly portable general interpreter for the spoken dialogue system (in Japanese)"Information Processing Society of Japan. NL160. 93-100 (2004)
- Description
  「研究成果報告書概要(欧文)」より

2003 Fiscal Year Final Research Report Summary

Development for speech interface for form -based in formation access services on Web

Principal Investigator

NAKAGAWA Seiichi Toyohashi University, Faculty of Engineering, Professor, 工学部, 教授 (20115893)

Research Products

[Publications] 松下雅彦: "音声入力によるWeb検索のためのキーワード認識・抽出法の検討"情報処理学会,音声言語情報処理. SLP-48(4). 21-28 (2003)

Description

[Publications] 押川洋徳: "音節N-gramと単語辞書併用による姓名入力インターフェース"情報処理学会,音声言語情報処理. SLP-49(30). 175-180 (2003)

Description

[Publications] 竹内真士: "韻律・表層的言語情報を発話タイミング制御に用いた雑認対話システム"情報処理学会,音声言語情報処理. SLP50-14. 87-92 (2004)

Description

[Publications] 梅田将満: "音声対話システムにおける移植性の高い汎用的意味理解部の構築"情報処理学会,自然言語処理研究会. (2004)

Description

[Publications] Satoru Kogure: "A development tool for spoken dialogue systems and its evaluation"Lecture Notes in Artificial Intelligence, (Springer). 2166. 373-380 (2001)

Description

[Publications] Masamitsu Umeda: "Interpreter for highly portable spoken dialogue system"Proc. 4-th Sigdial Workshop on Discourse and Dialogue. 105-114 (2003)

Description

[Publications] Masahiko Matsushita: "Keyword recognition and extraction for speech-driven Web retrieval task (in Japanese)"Information Processing Society of Japan. SLP48, 4. 21-28 (2003)

Description

[Publications] Hironori Oshikawa: "Speech interface for name input, using syllable N-gram and word dictionary (in Japanese)"Information Processing Society of Japan. SLP49, 30. 175-180 (2003)

Description

[Publications] Masashi Takeuchi: "A spoken dialog system activating the natural response timing using prosodic and linguistic information for chat-like conversation (in Japanese)"Information Processing Society of Japan. SLP50,14. 87-92 (2004)

Description

[Publications] Masamitsu Umeda: "Construction of highly portable general interpreter for the spoken dialogue system (in Japanese)"Information Processing Society of Japan. NL160. 93-100 (2004)

Description