• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2003 Fiscal Year Final Research Report Summary

Development for speech interface for form -based in formation access services on Web

Research Project

Project/Area Number 13558033
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section展開研究
Research Field Intelligent informatics
Research InstitutionToyohashi University Technology

Principal Investigator

NAKAGAWA Seiichi  Toyohashi University, Faculty of Engineering, Professor, 工学部, 教授 (20115893)

Co-Investigator(Kenkyū-buntansha) KAI Atsuhiko  Toyohashi University, Faculty of Engineering, Associate Professor, 工学部, 助教授 (60283496)
KOBAYASHI Satoshi  Toyohashi University, Faculty of Engineering, Research Associate, 工学部, 助手 (90314096)
KITAOKA Norihide  Toyohashi University, Faculty of Engineering, Lecturer, 工学部, 講師 (10333501)
NAKANO Takashi  CAI, Co, Department of Enfineering, Research Leader, 技術部・リーダ(研究職)
ITOH Toshihilo  Shizuoka University, Faculty of Engineering, Research Associate, 情報学部, 助手 (20313926)
Project Period (FY) 2001 – 2003
KeywordsWWW / Information Retrieval / Form-based Input / Speech Input / Pen-based Input / Speech Recognition / Name Input / PDA
Research Abstract

While some speech interface systems have been developed for accessing Web resources, they are limited for accessing some specific contents and they don't provide a universal interface for arbitrary information retrieval services on the WWW. We propose an interactive speech user interface system, which could be applied to many form-based information retrieval services of the WVVW. In particular, our system was implemented based on a client-server, a Web proxy-centered architecture and employed an information extraction and language processing of HTML documents for providing a general-purpose interface for many form-based WWW contents. We also performed some experiments by 12 subjects for the comparison of the usability under different usage conditions. As a result, the proposed system attained comparative and higher expected usability measures over the pen-touch input method under the condition of an ideal speech recognition performance, and could be expected to achieve the effectivenes … More s or the superiority over a pen touch-only interface in terms of the usability as their usage condition approaches to a realistic PDA usage condition.
We also proposed an. interface for a name input based on speech recognition using syllable-based N-gram and a word dictionary, which was frequently required to input into form-based web pages. User first utters a name and then chooses the correct word/syllables by pen touch from word/syllable candidates which were obtained from speech recognition. Name utterance is hard to recognize accurately because of the large vocabulary size, so the system uses continuous syllable recognition with syllable-based N-gram and isolated word recognition with a dictionary containing frequent words. The user can find the correct the answer from word candidates or syllable sequence candidates at a rate of 82-86%, and can input correct name at a rate of 94-96% with syllable selection from the syllable lattice. Some subjects used this interface and felt that it was useful. Less

  • Research Products

    (10 results)

All Other

All Publications (10 results)

  • [Publications] 松下雅彦: "音声入力によるWeb検索のためのキーワード認識・抽出法の検討"情報処理学会,音声言語情報処理. SLP-48(4). 21-28 (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 押川洋徳: "音節N-gramと単語辞書併用による姓名入力インターフェース"情報処理学会,音声言語情報処理. SLP-49(30). 175-180 (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 竹内真士: "韻律・表層的言語情報を発話タイミング制御に用いた雑認対話システム"情報処理学会,音声言語情報処理. SLP50-14. 87-92 (2004)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 梅田将満: "音声対話システムにおける移植性の高い汎用的意味理解部の構築"情報処理学会,自然言語処理研究会. (2004)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Satoru Kogure: "A development tool for spoken dialogue systems and its evaluation"Lecture Notes in Artificial Intelligence, (Springer). 2166. 373-380 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Masamitsu Umeda: "Interpreter for highly portable spoken dialogue system"Proc. 4-th Sigdial Workshop on Discourse and Dialogue. 105-114 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Masahiko Matsushita: "Keyword recognition and extraction for speech-driven Web retrieval task (in Japanese)"Information Processing Society of Japan. SLP48, 4. 21-28 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Hironori Oshikawa: "Speech interface for name input, using syllable N-gram and word dictionary (in Japanese)"Information Processing Society of Japan. SLP49, 30. 175-180 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Masashi Takeuchi: "A spoken dialog system activating the natural response timing using prosodic and linguistic information for chat-like conversation (in Japanese)"Information Processing Society of Japan. SLP50,14. 87-92 (2004)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Masamitsu Umeda: "Construction of highly portable general interpreter for the spoken dialogue system (in Japanese)"Information Processing Society of Japan. NL160. 93-100 (2004)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2005-04-19  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi