• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

1997 Fiscal Year Final Research Report Summary

A Study on Extraction of Robust Parameters of Individuality for Speaker Recognition Based on Acoustic Theory of Speech Production

Research Project

Project/Area Number 08650420
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field 情報通信工学
Research InstitutionUtsunomiya University

Principal Investigator

YANG Chang-Sheng  Utsunomiya University Faculty of Engineering, Assistant, 工学部, 助手 (80272219)

Co-Investigator(Kenkyū-buntansha) KASUYA Hideki  Utsunomiya University Faculty of Engineering, Professor, 工学部, 教授 (20006240)
Project Period (FY) 1996 – 1997
KeywordsMR Images / Vocal Tract Shapes / Vocal Tract Length / Individuality / Vocal Tract Parameters / Source Parameters / Source-Filter Model / Sub-Space Method
Research Abstract

In order to elucidate the relation among the anatomical structure of the vocal tract, the acoustic parameters, and the auditory perception, we have developed a new method to measure three dimensional vocal tract shapes from magnetic resonance (MR) images of sustained vowels. The dimensions of the vocal tract shapes of five Japanese vowels phonated by three adult males, three females and a boy are measured from the MR images. Differences of vocal tract shapes and acoustic parameters among male, female and child are systematically investigated. The result showed that the non-uniformity in the dimensions of the vocal tract shape had no essential effect on the acoustic characteristics.
In this research individualities of the vocal tract shape of vowels measured from MR images of males and females were discussed. Differences in dimensions of the vocal tract of the subjects and their effects on acoustic characteristics were investigated. Perceptual similarity tests of vowel quality showed tha … More t normalization of vowels from females to males could be made by relying largely on the vocal tract length.
Vowels of the males were carefully compared at the articulatory and acoustic levels. The result suggested that, for an identical vowel of males, the "invariance" of the phonation may be acoustic parameters (the first three formant frequencies F1, F2 and F3, which are important in auditory perception) rather than the articulatory simulation. It was also showed that the higher formant frequencies (F4, F5) are stable factors of speaker individuality.
To stably estimate speaker individual parameters from a speech signal, a source-filter model was introduced to represent the speech production, in which a speech signal is regarded as the output of a filter (the vocal tract) excited by a sound source.A novel speech analysis method was proposed to analyze the model parameters by using a direct subspace-based state-space system identification algorithm. Experimental results showed that not only the vocal tract parameters including the higher formant frequencies but also the source parameters can be estimated quite well.
In addition to speaker recognition, the results of this project can be expected to be applied in almost all the speech research areas, such as synthesis, perception, voice conversion, coding, recognition. Less

  • Research Products

    (12 results)

All Other

All Publications (12 results)

  • [Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conferenceon Acoustics,Speech.and Signal Processing. (印刷中). (1998)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 楊 長盛, 粕谷 英樹: "部分空間法を用いた有声子音の極・零の推定に関する検討" 日本音響学会平成10年度春季研究発表会講演論文集. I. 283-284 (1998)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 楊 長盛, 粕谷 英樹: "部分空間法を用いた音源・声道パラメータの推定法" 日本音響学会平成9年度秋季研究発表会講演論文集集. I. 291-292 (1997)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel:evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics,Information and Communication Engineers. SP96-120. 43-48 (1997)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 楊 長盛, 粕谷 英樹: "母音の不変性と個人性" 日本音響学会平成9年度春季研究発表会講演論文集集. I. 259-260 (1997)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by manetic resonanceimages" Proceeding of International Conference on Spoken Language Processing. 2. 949-952 (1996)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conference on Acoustics, Speech, and Signal Processing. (in print.). (1998)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] C.S.Yang and H.Kasuya: "A study on estimation of poles and zeros for voiced consonant by using subspace method" Proceeding of The 1998 Spring Meeting of the Acoustical Society of Japan. 283-284 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] C.S.Yang and H.Kasuya: "Estimation of source and vocal tract parameters by using subspace method" Proceeding of The 1997 Autumn Meeting of the Acoustical Society of Japan. 291-292 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel : evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics, Information and Communication Engineers. SP96-120. 43-48 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of vowels" Proceeding of The 1998 Spring Meeting of the Acoustical Society of Japan. 259-260 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images" Proceeding of International Conference on Spoken Language Processing. Vol.2. 949-952 (1996)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 1999-03-16  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi