• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Fusion of array signal processing and facial image process-ing for hands-free robust communication and authentication.

Research Project

Project/Area Number 08680443
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field 情報システム学(含情報図書館学)
Research InstitutionOsaka Electro-Communication University

Principal Investigator

MATSUMURA Masafumi  Osaka Electro-Communication University, Faculty of Information Science and Technology, Associate Profes-sor., 情報工学部, 助教授 (80209618)

Project Period (FY) 1996 – 1998
Project Status Completed (Fiscal Year 1998)
Budget Amount *help
¥2,400,000 (Direct Cost: ¥2,400,000)
Fiscal Year 1998: ¥500,000 (Direct Cost: ¥500,000)
Fiscal Year 1997: ¥600,000 (Direct Cost: ¥600,000)
Fiscal Year 1996: ¥1,300,000 (Direct Cost: ¥1,300,000)
KeywordsAuthentication / Array signal processing / Color image processing / Facial image / 3D shape of vocal tract / Articulatory model / Dialog system / hands-free robust communication / HSV色空間 / 音声生成過程 / 三次元声道形状 / 有限要素法 / カラー顔画像 / 人物認識 / 口唇形状 / 調音結合 / ホルマント周波数
Research Abstract

Human speech sounds with speaker characteristics involve complex interactions of the larynx, nasal cavity, oral cavity, and oropharynx. Authentication technique can be used to verify the identity claimed by people accessing systems ; that is, it enables access control of various services by voice and face iniages.
In conventional approach, however, there is few study on fusion of array signal processing and facial image processing for hands-free robust communication and authentication. The primary aim of our research is to develop new techniques for automatically recognizing who is speaking by using speaker-specific information included in speech wave and facial image. Intelligent audiovisual sensing system with four-microphone-array and CCD camera has been developed for hands-free robust communication and authentication. Array signal processing performed accurate estimation of mouth position. An articulatory model of three-dimensional vocal tract obtained by magnetic resonance imaging was proposed for extraction of speaker-specific information. A Color image processing technique was proposed for face segmentation and facial feature detection. Our group has successfully developed and tested the intelligent audiovisual sensing system for hands-free robust communication and authentication. The proposed techniques have wide application in the areas of speech communication and in the study of human interaction.

Report

(4 results)
  • 1998 Annual Research Report   Final Research Report Summary
  • 1997 Annual Research Report
  • 1996 Annual Research Report
  • Research Products

    (15 results)

All Other

All Publications (15 results)

  • [Publications] 新川 拓也: "磁気共鳴映像法を用いた摩擦音発声時の三次元形状の計測" 電気学会論文誌C. Vol.118-C No.718. 1060-1065 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 新川 拓也: "有限要素法を用いた摩擦音発生時の声道内呼気流の推定" 電気学会論文誌C. 発表予定.

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] T.Niikawa: "Accurate measurement of three-dimensional shapes of vocal tract and dental crown using magnetic resonance imaging" Third Joint meeting of Acoustical society of America and Japan. 2pSC13. 867-872 (1996)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] T.Niikawa: "Acoustic Characteristics of three-dimensional vocal tract shapes measured by MRI during vowel production" Hokkaido Workshop on speech production. 1-2. 8-9 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] T.Niikawa: "FEM analysis of aspirated air flow in three-dimentional vocal tract during fricative consonant phonation" 5th International Conferene on Spoken Language Processing. Fr1R15. 3127-3130 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Takuya Niikawa, Masafumi Matsumura, Koji Shimizu, Yasuji Hashimoto, Takashi Tachimura and Takashi Wada: "Measurement of three-dimensional shapes of vocal tract using magnetic resonance imaging." Trans.IEE of Japan. Vol.118-C,No.7/8. 1060-1065 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Takuya Niikawa, Takashi Tachimura, Takeshi Wada, Masafumi Matsumura Hiroshi Umeo: "FEM analysis aspi-rated air flow in three-dimensional vocal tract dur-ing fricative consonant phonation." Trans.IEE of Japan. (To appear).

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Takuya Niikawa, Masafumi Matsumura, Takashi Tachimura, Takeshi Wada, Koji Shimizu, and Yasuji Hashimoto: "Accurate measurement of three-dimensional shapes of vocal tract and dental crown using mag-netic resonance imaging : Japanese fricative conso-nants." Third Joint meeting of Acoustical society of America and acoustical so-ciety of Japan. 2pSC13. 867-872 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Takuya Niikawa, Eri Kawano, Masafumi Matsumura, Takashi Tachimura, and Takeshi Wada: "Acoustic char-acteristics of three-dimensional vocal tract chapes measured by MRI during vowel production." Hokkaido Workshop on Speech Production. 1-2 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Takuya Niikawa, Masafumi Matsumura, Takashi Tachimura, and Takeshi Wada: "FEM analysis of aspi-rated air flow in three-dimensional vocal tract dur-ing fricative consonant phonation." 5th International Conference on Spoken Language Process-ing, Fr1R15. 3127-3130 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 新川拓也: "磁気共鳴影像法を用いた摩擦音発声時の三次元声道形状の計測" 電気学会論文誌. 118巻718号. 1060-1065 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] Masafumi Matsumura: "Cantilever-type force-sensor-mounted platal plate for measuring palatolingual contact stress and pattern during speech phonation." Internation Conference on Spoken Language Processing. Tu4C4. (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] Takuya Niikawa: "FEM analysis of aspirated air flow in three-dimensional vocal tract during fricative consonant phonation." International Conference on Spoken Language Processing.Fr1R15. (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] Takuya Niikawa: "Accurate measurement of three dimensional shapes of uocal tract and dental crown using magnetic resonance imaging" The Journal of the Acoustical Society of America. 100,No.4 Pt2. 2658-2658 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] Masafumi Matsumura: "Measurement of palatolingnal contact stress and pattern doning consorant production using a force-sensor-mounted palatal plate" The Journal of the Acoustical Society of America. 100,No.4 Pt2. 2660-2660 (1996)

    • Related Report
      1996 Annual Research Report

URL: 

Published: 1996-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi