• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A Study on Extraction of Robust Parameters of Individuality for Speaker Recognition Based on Acoustic Theory of Speech Production

Research Project

Project/Area Number 08650420
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field 情報通信工学
Research InstitutionUtsunomiya University

Principal Investigator

YANG Chang-Sheng  Utsunomiya University Faculty of Engineering, Assistant, 工学部, 助手 (80272219)

Co-Investigator(Kenkyū-buntansha) KASUYA Hideki  Utsunomiya University Faculty of Engineering, Professor, 工学部, 教授 (20006240)
Project Period (FY) 1996 – 1997
Project Status Completed (Fiscal Year 1997)
Budget Amount *help
¥2,000,000 (Direct Cost: ¥2,000,000)
Fiscal Year 1997: ¥600,000 (Direct Cost: ¥600,000)
Fiscal Year 1996: ¥1,400,000 (Direct Cost: ¥1,400,000)
KeywordsMR Images / Vocal Tract Shapes / Vocal Tract Length / Individuality / Vocal Tract Parameters / Source Parameters / Source-Filter Model / Sub-Space Method / 声道形状 / 声道長 / ホルマント周波数 / 音質(phonetic quality) / 個人性特徴
Research Abstract

In order to elucidate the relation among the anatomical structure of the vocal tract, the acoustic parameters, and the auditory perception, we have developed a new method to measure three dimensional vocal tract shapes from magnetic resonance (MR) images of sustained vowels. The dimensions of the vocal tract shapes of five Japanese vowels phonated by three adult males, three females and a boy are measured from the MR images. Differences of vocal tract shapes and acoustic parameters among male, female and child are systematically investigated. The result showed that the non-uniformity in the dimensions of the vocal tract shape had no essential effect on the acoustic characteristics.
In this research individualities of the vocal tract shape of vowels measured from MR images of males and females were discussed. Differences in dimensions of the vocal tract of the subjects and their effects on acoustic characteristics were investigated. Perceptual similarity tests of vowel quality showed tha … More t normalization of vowels from females to males could be made by relying largely on the vocal tract length.
Vowels of the males were carefully compared at the articulatory and acoustic levels. The result suggested that, for an identical vowel of males, the "invariance" of the phonation may be acoustic parameters (the first three formant frequencies F1, F2 and F3, which are important in auditory perception) rather than the articulatory simulation. It was also showed that the higher formant frequencies (F4, F5) are stable factors of speaker individuality.
To stably estimate speaker individual parameters from a speech signal, a source-filter model was introduced to represent the speech production, in which a speech signal is regarded as the output of a filter (the vocal tract) excited by a sound source.A novel speech analysis method was proposed to analyze the model parameters by using a direct subspace-based state-space system identification algorithm. Experimental results showed that not only the vocal tract parameters including the higher formant frequencies but also the source parameters can be estimated quite well.
In addition to speaker recognition, the results of this project can be expected to be applied in almost all the speech research areas, such as synthesis, perception, voice conversion, coding, recognition. Less

Report

(3 results)
  • 1997 Annual Research Report   Final Research Report Summary
  • 1996 Annual Research Report
  • Research Products

    (18 results)

All Other

All Publications (18 results)

  • [Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conferenceon Acoustics,Speech.and Signal Processing. (印刷中). (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] 楊 長盛, 粕谷 英樹: "部分空間法を用いた有声子音の極・零の推定に関する検討" 日本音響学会平成10年度春季研究発表会講演論文集. I. 283-284 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] 楊 長盛, 粕谷 英樹: "部分空間法を用いた音源・声道パラメータの推定法" 日本音響学会平成9年度秋季研究発表会講演論文集集. I. 291-292 (1997)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel:evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics,Information and Communication Engineers. SP96-120. 43-48 (1997)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] 楊 長盛, 粕谷 英樹: "母音の不変性と個人性" 日本音響学会平成9年度春季研究発表会講演論文集集. I. 259-260 (1997)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by manetic resonanceimages" Proceeding of International Conference on Spoken Language Processing. 2. 949-952 (1996)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conference on Acoustics, Speech, and Signal Processing. (in print.). (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] C.S.Yang and H.Kasuya: "A study on estimation of poles and zeros for voiced consonant by using subspace method" Proceeding of The 1998 Spring Meeting of the Acoustical Society of Japan. 283-284 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] C.S.Yang and H.Kasuya: "Estimation of source and vocal tract parameters by using subspace method" Proceeding of The 1997 Autumn Meeting of the Acoustical Society of Japan. 291-292 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel : evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics, Information and Communication Engineers. SP96-120. 43-48 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of vowels" Proceeding of The 1998 Spring Meeting of the Acoustical Society of Japan. 259-260 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images" Proceeding of International Conference on Spoken Language Processing. Vol.2. 949-952 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conference on Acoustics,Speech,and Signal Processing. (印刷中). (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 楊長盛、粕谷秀樹: "部分空間法を用いた有声子音の極・零の推定に関する検討" 日本音響学会平成10年度春季研究発表会講演論文集. I. 283-284 (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 楊長盛、粕谷秀樹: "部分空間法を用いた音源・声道パラメータの推定法" 日本音響学会平成9年度秋季研究発表会講演論文集. I. 291-292 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel:evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics,Information and Communication Engineers. SP96-120. 43-48 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 楊長盛、粕谷秀樹: "母音の不変性と個人性" 日本音響学会平成9年度春季研究発表会講演論文集. I. 259-260 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images" Proceeding of International Conference on Spoken Language Processing. 2. 949-952 (1996)

    • Related Report
      1997 Annual Research Report

URL: 

Published: 1996-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi