• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Research on the word recognition method based on voice and lip shape movements in very noisy circumstances

Research Project

Project/Area Number 11650426
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Measurement engineering
Research InstitutionTottori University

Principal Investigator

KONISHI Ryosuke  Tottori University, Faculty of Engineering, Professor, 工学部・電気電子工学科, 教授 (00032269)

Co-Investigator(Kenkyū-buntansha) SUGAHARA Kazunori  Tottori University, Faculty of Engineering, Associate Professor, 工学部・電気電子工学科, 助教授 (90149948)
Project Period (FY) 1999 – 2000
Project Status Completed (Fiscal Year 2000)
Budget Amount *help
¥3,300,000 (Direct Cost: ¥3,300,000)
Fiscal Year 2000: ¥1,200,000 (Direct Cost: ¥1,200,000)
Fiscal Year 1999: ¥2,100,000 (Direct Cost: ¥2,100,000)
KeywordsWord recognithon / Lip shape movements / Active contour model / 動的輪郭モデル / 高雑音環境 / 唇形状抽出 / HMM
Research Abstract

Word recognition techniques have been investigated by many researchers. In these researches, many kinds of characteristic parameters of speech signal are studied and many kinds of recognition methods are proposed. The Cepstrum parameter and HMM (Hidden Markov Model) are representative examples of them.
In the calm circumstances, they enable us to recognize natural speed speaking speech with relatively high recognition rate. However, in the noisy circumstances, it is still difficult to achieve the high word recognition rate by using methods that depend on only auditory information.
On the other hand, it is well known that human has ability to understand other person's talks just by watching his mouth movements without any auditory information. This ability is called "Lip reading". If the word recognition system based on voices and lip shape movements, it can be expected to offer not only effective means of word recognition in very noisy circumstances but also easy human-machine communication methods.
To construct the system described above, following problems must be solved.
1. Very fast and exact extraction of lip shape from series of face images is required.
2. Parameters that describe lip shapes must be investigated.
3. Accurate recognition method is required.
Because of these problems, the word recognition systems which uses lip shape movements have not been developed until now. In this research, we had tried to develop the real time word recognition system based on the voices and lip shape movements on the commercial base personal computer. For realization of fast and exact operation, the technique called as the modified Sampled Active Contour Model (modified SACM) is adopted to extract lip shapes. For describing the extracted lip shapes, new parameter is proposed in this paper. And the recognition of the lip shape movements is achieved by HMM according to the proposed new parameters.

Report

(3 results)
  • 2000 Annual Research Report   Final Research Report Summary
  • 1999 Annual Research Report
  • Research Products

    (21 results)

All Other

All Publications (21 results)

  • [Publications] 菅原一孔: "パーソナルコンピュータ上での読唇システムの実時間実現"計測自動制御学会論文誌. 36. 1145-1151 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Kazunori SUGAHARA: "Personal Computer Based Real Time Lip Reading System"2000 5th.Int.Conf.Signal Processing. 1341-1346 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] 新地俊幹: "画像情報と音声情報を併用した単語認識システムの構築について"電子情報通信学会技術研究報告. CAS98-66. 37-44 (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] 岸野誠: "画像情報を用いた単語発話区間抽出処理の自動化"第16回センシングフォーラム論文集. 45-50 (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] 岸野誠: "画像情報を用いた発話区間抽出処理の単語認識システムへの応用"電気学会センサシステム応用技術研究会. 25-30 (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] 菅原一孔: "画像情報を取り入れた単語認識システムの実時間実現"電子情報通信学会技術研究報告. PRMU-269. 57-63 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Kazunori SUGAHARA, Toshimi SHINCHI, Makoto KISHINO, Ryosuke KONISHI: "Real Time Realization of Lip Reading System on the Personal Computer"Transactions of the society of instrument and control engineers. Vol.36, No.12. 1145-1151 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Kazunori SUGAHARA, Makoto KISHINO, Ryosuke KONISHI: "Personal Computer Based Real Time Lip Reading"2000 5th International Conference on Signal Processing Proceedings Beijing. 1341-1346 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Makoto KISHINO, Masahiro OKI, Tomoyuki OSAKI, Kazunori SUGAHARA, Ryosuke KONISHI: "A Word Spotting Method by Using Image Data"Proceedings of the l6th SICE Sensing Forum. 45-50 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Toshimi SHINCHI, Youichi HAYASHIGUCHI, Makoto KISHINO, Kazunori SUGAHARA, Ryosuke KONISHI: "On the Word Recognition System using Image and Voice Information"Technical report of the institute of electronics, information and communication engineers. Vol.CAS98-66. 37-44 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Makoto KISHINO, Masahiro OKI, Tomoyuki OSAKI, Kazunori SUGAHARA, Ryosuke KONISHI: "Extraction of Word-Speaking Period by Using Image Data and its Application to Real Time Word Recognition System"Technical report of the institute of electrical engineers of Japan. Vol.PRMU99-269. 57-63 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Kazunori SUGAHARA, Masanobu WASHIO, Makoto KISHINO, Ryosuke KONISHI: "Symbolic circuit analyzing system using network environment"Technical report of the institute of electronics, information and communication engineers. Vol.PRMU99-269. 57-63 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] 菅原一孔: "パーソナルコンピュータ上での読唇システムの実時間実現"計測自動制御学会論文誌. 36. 1145-1151 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Kazunori SUGAHARA: "Personal Computer Based Real Time Lip Reading System"2000 5th.Int.Conf.Signal Processing. 1341-1346 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 新地俊幹: "画像情報と音声情報を併用した単語認識システムの構築について"電子情報通信学会技術研究報告. CAS98-66. 37-44 (1999)

    • Related Report
      2000 Annual Research Report
  • [Publications] 岸野誠: "画像情報を用いた単語発話区間抽出処理の自動化"第16回センシングフォーラム論文集. 45-50 (1999)

    • Related Report
      2000 Annual Research Report
  • [Publications] 岸野誠: "画像情報を用いた発話区間抽出処理の単語認識システムへの応用"電気学会センサシステム応用技術研究会. 25-30 (1999)

    • Related Report
      2000 Annual Research Report
  • [Publications] 菅原一孔: "画像情報を取り入れた単語認識システムの実時間実現"電子情報通信学会技術研究報告. PRMU-269. 57-63 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 岸野誠: "画像情報を用いた単語発話区間抽出処理の自動化"計測自動制御学会第16回センシングフォーラム講演論文集. 45-50 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] 岸野誠: "画像情報を用いた単語発話区間抽出処理の単語認識システムへの応用"電気学会センサシステム応用技術研究会資料. 25-30 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] 菅原一孔: "画像情報をとり入れた単語認識システムの実時間実現"電子情報通信学会・パターン認識・メディア理解研究会. (発表予定). (2000)

    • Related Report
      1999 Annual Research Report

URL: 

Published: 1999-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi