Development for speech interface for form -based in formation access services on Web

Research Project

Project/Area Number	13558033
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	展開研究
Research Field	Intelligent informatics
Research Institution	Toyohashi University Technology
Principal Investigator	NAKAGAWA Seiichi Toyohashi University, Faculty of Engineering, Professor, 工学部, 教授 (20115893)
Co-Investigator(Kenkyū-buntansha)	KAI Atsuhiko Toyohashi University, Faculty of Engineering, Associate Professor, 工学部, 助教授 (60283496) KOBAYASHI Satoshi Toyohashi University, Faculty of Engineering, Research Associate, 工学部, 助手 (90314096) KITAOKA Norihide Toyohashi University, Faculty of Engineering, Lecturer, 工学部, 講師 (10333501) NAKANO Takashi CAI, Co, Department of Enfineering, Research Leader, 技術部・リーダ(研究職) ITOH Toshihilo Shizuoka University, Faculty of Engineering, Research Associate, 情報学部, 助手 (20313926)
Project Period (FY)	2001 – 2003
Project Status	Completed (Fiscal Year 2003)
Budget Amount *help	¥6,700,000 (Direct Cost: ¥6,700,000) Fiscal Year 2003: ¥3,100,000 (Direct Cost: ¥3,100,000) Fiscal Year 2002: ¥3,600,000 (Direct Cost: ¥3,600,000)
Keywords	WWW / Information Retrieval / Form-based Input / Speech Input / Pen-based Input / Speech Recognition / Name Input / PDA / 音声認識 / 姓名音声認識 / ホームページ / 姓名入力 / Web情報 / フォーム型入力 / 音声インターフェース / 言い直し / 姓名音声入力 / 地名音声入力
Research Abstract	While some speech interface systems have been developed for accessing Web resources, they are limited for accessing some specific contents and they don't provide a universal interface for arbitrary information retrieval services on the WWW. We propose an interactive speech user interface system, which could be applied to many form-based information retrieval services of the WVVW. In particular, our system was implemented based on a client-server, a Web proxy-centered architecture and employed an information extraction and language processing of HTML documents for providing a general-purpose interface for many form-based WWW contents. We also performed some experiments by 12 subjects for the comparison of the usability under different usage conditions. As a result, the proposed system attained comparative and higher expected usability measures over the pen-touch input method under the condition of an ideal speech recognition performance, and could be expected to achieve the effectivenes … More s or the superiority over a pen touch-only interface in terms of the usability as their usage condition approaches to a realistic PDA usage condition. We also proposed an. interface for a name input based on speech recognition using syllable-based N-gram and a word dictionary, which was frequently required to input into form-based web pages. User first utters a name and then chooses the correct word/syllables by pen touch from word/syllable candidates which were obtained from speech recognition. Name utterance is hard to recognize accurately because of the large vocabulary size, so the system uses continuous syllable recognition with syllable-based N-gram and isolated word recognition with a dictionary containing frequent words. The user can find the correct the answer from word candidates or syllable sequence candidates at a rate of 82-86%, and can input correct name at a rate of 94-96% with syllable selection from the syllable lattice. Some subjects used this interface and felt that it was useful. Less

Report

(4 results)

2003 Annual Research Report Final Research Report Summary
2002 Annual Research Report
2001 Annual Research Report

Research Products
(20 results)

All Other

All Publications (20 results)

[Publications] 松下雅彦: "音声入力によるWeb検索のためのキーワード認識・抽出法の検討"情報処理学会,音声言語情報処理. SLP-48(4). 21-28 (2003)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] 押川洋徳: "音節N-gramと単語辞書併用による姓名入力インターフェース"情報処理学会,音声言語情報処理. SLP-49(30). 175-180 (2003)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] 竹内真士: "韻律・表層的言語情報を発話タイミング制御に用いた雑認対話システム"情報処理学会,音声言語情報処理. SLP50-14. 87-92 (2004)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] 梅田将満: "音声対話システムにおける移植性の高い汎用的意味理解部の構築"情報処理学会,自然言語処理研究会. (2004)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] Satoru Kogure: "A development tool for spoken dialogue systems and its evaluation"Lecture Notes in Artificial Intelligence, (Springer). 2166. 373-380 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] Masamitsu Umeda: "Interpreter for highly portable spoken dialogue system"Proc. 4-th Sigdial Workshop on Discourse and Dialogue. 105-114 (2003)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] Masahiko Matsushita: "Keyword recognition and extraction for speech-driven Web retrieval task (in Japanese)"Information Processing Society of Japan. SLP48, 4. 21-28 (2003)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] Hironori Oshikawa: "Speech interface for name input, using syllable N-gram and word dictionary (in Japanese)"Information Processing Society of Japan. SLP49, 30. 175-180 (2003)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] Masashi Takeuchi: "A spoken dialog system activating the natural response timing using prosodic and linguistic information for chat-like conversation (in Japanese)"Information Processing Society of Japan. SLP50,14. 87-92 (2004)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] Masamitsu Umeda: "Construction of highly portable general interpreter for the spoken dialogue system (in Japanese)"Information Processing Society of Japan. NL160. 93-100 (2004)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] 松下雅彦: "音声入力によるWeb検索のためのキーワード認識・抽出法の検討"情報処理学会,音声言語情報処理. SLP-48(4). 21-28 (2003)
- Related Report
  2003 Annual Research Report
[Publications] 押川洋徳: "音節N-gramと単語辞書併用による姓名入力インターフェース"情報処理学会,音声言語情報処理. SLP-49(30). 175-180 (2003)
- Related Report
  2003 Annual Research Report
[Publications] 竹内真士: "韻律・表層的言語情報を発話タイミング制御に用いた雑認対話システム"情報処理学会,音声言語情報処理. SLP50-14. 87-92 (2004)
- Related Report
  2003 Annual Research Report
[Publications] 梅田将満: "音声対話システムにおける移植性の高い汎用的意味理解部の構築"情報処理学会,自然言語処理研究会. (2004)
- Related Report
  2003 Annual Research Report
[Publications] 山田大輔, 北岡教英, 中川聖一: "音源情報の特徴量を用いた音声認識"電気学会電子情報システム部門誌(C). 122・C・12. 2028-2034 (2002)
- Related Report
  2002 Annual Research Report
[Publications] 北岡教英, 角谷直子, 中川聖: "カーナビの地名音声入力における誤認識時の言い直し発話の検出と認識"電気学会電子情報システム部門誌(C). 122・C.12. 2020-2027 (2002)
- Related Report
  2002 Annual Research Report
[Publications] N.Takahashi, S.Nakagawa: "Syllable recognition using syllable-segment statistics and syllable-based HMM"Proc. Int. Conf. Spoken Language Processing. 2633-2636 (2002)
- Related Report
  2002 Annual Research Report
[Publications] 高橋伸寿, 北岡教英, 中川聖一: "連続音声認識システムSPOJUSの改善"日本音響学会講演論文集. 3,4,9. 145-146 (2003)
- Related Report
  2002 Annual Research Report
[Publications] 押川洋徳, 北岡教英, 中川聖一: "ウェブブラウザにおける任意文字列入力を目的とした音声入力インターフェース"日本音響学会春季研究発表会. 217-218 (2002)
- Related Report
  2001 Annual Research Report
[Publications] 角谷直子, 北岡教英, 中川聖一: "カーナビの地名入力における誤認識時の言い直し発話の検出手法"日本音響学会春季研究発表会. 107-108 (2002)
- Related Report
  2001 Annual Research Report

Development for speech interface for form -based in formation access services on Web

Principal Investigator

NAKAGAWA Seiichi Toyohashi University, Faculty of Engineering, Professor, 工学部, 教授 (20115893)

¥6,700,000 (Direct Cost: ¥6,700,000)

Report

Research Products

[Publications] 松下雅彦: "音声入力によるWeb検索のためのキーワード認識・抽出法の検討"情報処理学会,音声言語情報処理. SLP-48(4). 21-28 (2003)

Description

Related Report

[Publications] 押川洋徳: "音節N-gramと単語辞書併用による姓名入力インターフェース"情報処理学会,音声言語情報処理. SLP-49(30). 175-180 (2003)

Description

Related Report

[Publications] 竹内真士: "韻律・表層的言語情報を発話タイミング制御に用いた雑認対話システム"情報処理学会,音声言語情報処理. SLP50-14. 87-92 (2004)

Description

Related Report

[Publications] 梅田将満: "音声対話システムにおける移植性の高い汎用的意味理解部の構築"情報処理学会,自然言語処理研究会. (2004)

Description

Related Report

[Publications] Satoru Kogure: "A development tool for spoken dialogue systems and its evaluation"Lecture Notes in Artificial Intelligence, (Springer). 2166. 373-380 (2001)

Description

Related Report

[Publications] Masamitsu Umeda: "Interpreter for highly portable spoken dialogue system"Proc. 4-th Sigdial Workshop on Discourse and Dialogue. 105-114 (2003)

Description

Related Report

[Publications] Masahiko Matsushita: "Keyword recognition and extraction for speech-driven Web retrieval task (in Japanese)"Information Processing Society of Japan. SLP48, 4. 21-28 (2003)

Description

Related Report

[Publications] Hironori Oshikawa: "Speech interface for name input, using syllable N-gram and word dictionary (in Japanese)"Information Processing Society of Japan. SLP49, 30. 175-180 (2003)

Description

Related Report

[Publications] Masashi Takeuchi: "A spoken dialog system activating the natural response timing using prosodic and linguistic information for chat-like conversation (in Japanese)"Information Processing Society of Japan. SLP50,14. 87-92 (2004)

Description

Related Report

[Publications] Masamitsu Umeda: "Construction of highly portable general interpreter for the spoken dialogue system (in Japanese)"Information Processing Society of Japan. NL160. 93-100 (2004)

Description

Related Report

[Publications] 松下雅彦: "音声入力によるWeb検索のためのキーワード認識・抽出法の検討"情報処理学会,音声言語情報処理. SLP-48(4). 21-28 (2003)

Related Report

[Publications] 押川洋徳: "音節N-gramと単語辞書併用による姓名入力インターフェース"情報処理学会,音声言語情報処理. SLP-49(30). 175-180 (2003)

Related Report

[Publications] 竹内真士: "韻律・表層的言語情報を発話タイミング制御に用いた雑認対話システム"情報処理学会,音声言語情報処理. SLP50-14. 87-92 (2004)

Related Report

[Publications] 梅田将満: "音声対話システムにおける移植性の高い汎用的意味理解部の構築"情報処理学会,自然言語処理研究会. (2004)

Related Report

[Publications] 山田大輔, 北岡教英, 中川聖一: "音源情報の特徴量を用いた音声認識"電気学会電子情報システム部門誌(C). 122・C・12. 2028-2034 (2002)

Related Report

[Publications] 北岡教英, 角谷直子, 中川聖: "カーナビの地名音声入力における誤認識時の言い直し発話の検出と認識"電気学会電子情報システム部門誌(C). 122・C.12. 2020-2027 (2002)

Related Report

[Publications] N.Takahashi, S.Nakagawa: "Syllable recognition using syllable-segment statistics and syllable-based HMM"Proc. Int. Conf. Spoken Language Processing. 2633-2636 (2002)

Related Report

[Publications] 高橋伸寿, 北岡教英, 中川聖一: "連続音声認識システムSPOJUSの改善"日本音響学会講演論文集. 3,4,9. 145-146 (2003)

Related Report

[Publications] 押川 洋徳, 北岡 教英, 中川 聖一: "ウェブブラウザにおける任意文字列入力を目的とした音声入力インターフェース"日本音響学会春季研究発表会. 217-218 (2002)

Related Report

[Publications] 角谷 直子, 北岡 教英, 中川 聖一: "カーナビの地名入力における誤認識時の言い直し発話の検出手法"日本音響学会春季研究発表会. 107-108 (2002)

Related Report

[Publications] 押川洋徳, 北岡教英, 中川聖一: "ウェブブラウザにおける任意文字列入力を目的とした音声入力インターフェース"日本音響学会春季研究発表会. 217-218 (2002)

[Publications] 角谷直子, 北岡教英, 中川聖一: "カーナビの地名入力における誤認識時の言い直し発話の検出手法"日本音響学会春季研究発表会. 107-108 (2002)