A Study on Extraction of Robust Parameters of Individuality for Speaker Recognition Based on Acoustic Theory of Speech Production

Research Project

Project/Area Number	08650420
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	情報通信工学
Research Institution	Utsunomiya University
Principal Investigator	YANG Chang-Sheng Utsunomiya University Faculty of Engineering, Assistant, 工学部, 助手 (80272219)
Co-Investigator(Kenkyū-buntansha)	KASUYA Hideki Utsunomiya University Faculty of Engineering, Professor, 工学部, 教授 (20006240)
Project Period (FY)	1996 – 1997
Project Status	Completed (Fiscal Year 1997)
Budget Amount *help	¥2,000,000 (Direct Cost: ¥2,000,000) Fiscal Year 1997: ¥600,000 (Direct Cost: ¥600,000) Fiscal Year 1996: ¥1,400,000 (Direct Cost: ¥1,400,000)
Keywords	MR Images / Vocal Tract Shapes / Vocal Tract Length / Individuality / Vocal Tract Parameters / Source Parameters / Source-Filter Model / Sub-Space Method / 声道形状 / 声道長 / ホルマント周波数 / 音質(phonetic quality) / 個人性特徴
Research Abstract	In order to elucidate the relation among the anatomical structure of the vocal tract, the acoustic parameters, and the auditory perception, we have developed a new method to measure three dimensional vocal tract shapes from magnetic resonance (MR) images of sustained vowels. The dimensions of the vocal tract shapes of five Japanese vowels phonated by three adult males, three females and a boy are measured from the MR images. Differences of vocal tract shapes and acoustic parameters among male, female and child are systematically investigated. The result showed that the non-uniformity in the dimensions of the vocal tract shape had no essential effect on the acoustic characteristics. In this research individualities of the vocal tract shape of vowels measured from MR images of males and females were discussed. Differences in dimensions of the vocal tract of the subjects and their effects on acoustic characteristics were investigated. Perceptual similarity tests of vowel quality showed tha … More t normalization of vowels from females to males could be made by relying largely on the vocal tract length. Vowels of the males were carefully compared at the articulatory and acoustic levels. The result suggested that, for an identical vowel of males, the "invariance" of the phonation may be acoustic parameters (the first three formant frequencies F1, F2 and F3, which are important in auditory perception) rather than the articulatory simulation. It was also showed that the higher formant frequencies (F4, F5) are stable factors of speaker individuality. To stably estimate speaker individual parameters from a speech signal, a source-filter model was introduced to represent the speech production, in which a speech signal is regarded as the output of a filter (the vocal tract) excited by a sound source.A novel speech analysis method was proposed to analyze the model parameters by using a direct subspace-based state-space system identification algorithm. Experimental results showed that not only the vocal tract parameters including the higher formant frequencies but also the source parameters can be estimated quite well. In addition to speaker recognition, the results of this project can be expected to be applied in almost all the speech research areas, such as synthesis, perception, voice conversion, coding, recognition. Less

Report

(3 results)

1997 Annual Research Report Final Research Report Summary
1996 Annual Research Report

Research Products

(18 results)

All Other

All Publications (18 results)

[Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conferenceon Acoustics,Speech.and Signal Processing. (印刷中). (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] 楊長盛, 粕谷英樹: "部分空間法を用いた有声子音の極・零の推定に関する検討" 日本音響学会平成10年度春季研究発表会講演論文集. I. 283-284 (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] 楊長盛, 粕谷英樹: "部分空間法を用いた音源・声道パラメータの推定法" 日本音響学会平成9年度秋季研究発表会講演論文集集. I. 291-292 (1997)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel:evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics,Information and Communication Engineers. SP96-120. 43-48 (1997)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] 楊長盛, 粕谷英樹: "母音の不変性と個人性" 日本音響学会平成9年度春季研究発表会講演論文集集. I. 259-260 (1997)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by manetic resonanceimages" Proceeding of International Conference on Spoken Language Processing. 2. 949-952 (1996)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conference on Acoustics, Speech, and Signal Processing. (in print.). (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] C.S.Yang and H.Kasuya: "A study on estimation of poles and zeros for voiced consonant by using subspace method" Proceeding of The 1998 Spring Meeting of the Acoustical Society of Japan. 283-284 (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] C.S.Yang and H.Kasuya: "Estimation of source and vocal tract parameters by using subspace method" Proceeding of The 1997 Autumn Meeting of the Acoustical Society of Japan. 291-292 (1997)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel : evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics, Information and Communication Engineers. SP96-120. 43-48 (1997)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of vowels" Proceeding of The 1998 Spring Meeting of the Acoustical Society of Japan. 259-260 (1997)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images" Proceeding of International Conference on Spoken Language Processing. Vol.2. 949-952 (1996)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conference on Acoustics,Speech,and Signal Processing. (印刷中). (1998)
- Related Report
  1997 Annual Research Report
[Publications] 楊長盛、粕谷秀樹: "部分空間法を用いた有声子音の極・零の推定に関する検討" 日本音響学会平成10年度春季研究発表会講演論文集. I. 283-284 (1998)
- Related Report
  1997 Annual Research Report
[Publications] 楊長盛、粕谷秀樹: "部分空間法を用いた音源・声道パラメータの推定法" 日本音響学会平成9年度秋季研究発表会講演論文集. I. 291-292 (1997)
- Related Report
  1997 Annual Research Report
[Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel:evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics,Information and Communication Engineers. SP96-120. 43-48 (1997)
- Related Report
  1997 Annual Research Report
[Publications] 楊長盛、粕谷秀樹: "母音の不変性と個人性" 日本音響学会平成9年度春季研究発表会講演論文集. I. 259-260 (1997)
- Related Report
  1997 Annual Research Report
[Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images" Proceeding of International Conference on Spoken Language Processing. 2. 949-952 (1996)
- Related Report
  1997 Annual Research Report

A Study on Extraction of Robust Parameters of Individuality for Speaker Recognition Based on Acoustic Theory of Speech Production

Principal Investigator

YANG Chang-Sheng Utsunomiya University Faculty of Engineering, Assistant, 工学部, 助手 (80272219)

¥2,000,000 (Direct Cost: ¥2,000,000)

Report

Research Products

[Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conferenceon Acoustics,Speech.and Signal Processing. (印刷中). (1998)

Description

Related Report

[Publications] 楊 長盛, 粕谷 英樹: "部分空間法を用いた有声子音の極・零の推定に関する検討" 日本音響学会平成10年度春季研究発表会講演論文集. I. 283-284 (1998)

Description

Related Report

[Publications] 楊 長盛, 粕谷 英樹: "部分空間法を用いた音源・声道パラメータの推定法" 日本音響学会平成9年度秋季研究発表会講演論文集集. I. 291-292 (1997)

Description

Related Report

[Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel:evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics,Information and Communication Engineers. SP96-120. 43-48 (1997)

Description

Related Report

[Publications] 楊 長盛, 粕谷 英樹: "母音の不変性と個人性" 日本音響学会平成9年度春季研究発表会講演論文集集. I. 259-260 (1997)

Description

Related Report

[Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by manetic resonanceimages" Proceeding of International Conference on Spoken Language Processing. 2. 949-952 (1996)

Description

Related Report

[Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conference on Acoustics, Speech, and Signal Processing. (in print.). (1998)

Description

Related Report

[Publications] C.S.Yang and H.Kasuya: "A study on estimation of poles and zeros for voiced consonant by using subspace method" Proceeding of The 1998 Spring Meeting of the Acoustical Society of Japan. 283-284 (1998)

Description

Related Report

[Publications] C.S.Yang and H.Kasuya: "Estimation of source and vocal tract parameters by using subspace method" Proceeding of The 1997 Autumn Meeting of the Acoustical Society of Japan. 291-292 (1997)

Description

Related Report

[Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel : evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics, Information and Communication Engineers. SP96-120. 43-48 (1997)

Description

Related Report

[Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of vowels" Proceeding of The 1998 Spring Meeting of the Acoustical Society of Japan. 259-260 (1997)

Description

Related Report

[Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images" Proceeding of International Conference on Spoken Language Processing. Vol.2. 949-952 (1996)

Description

Related Report

[Publications] C.S.Yang and H.Kasuya: "Automatic estimation of formant and voice source parameters using a subspace based algorithm" Proceeding of IEEE International Conference on Acoustics,Speech,and Signal Processing. (印刷中). (1998)

Related Report

[Publications] 楊長盛、粕谷秀樹: "部分空間法を用いた有声子音の極・零の推定に関する検討" 日本音響学会平成10年度春季研究発表会講演論文集. I. 283-284 (1998)

Related Report

[Publications] 楊長盛、粕谷秀樹: "部分空間法を用いた音源・声道パラメータの推定法" 日本音響学会平成9年度秋季研究発表会講演論文集. I. 291-292 (1997)

Related Report

[Publications] C.S.Yang and H.Kasuya: "Invariance and individuality of the vowel:evidence from articulatory and acoustic observations" Technical Report of the Institute of Electronics,Information and Communication Engineers. SP96-120. 43-48 (1997)

Related Report

[Publications] 楊長盛、粕谷秀樹: "母音の不変性と個人性" 日本音響学会平成9年度春季研究発表会講演論文集. I. 259-260 (1997)

Related Report

[Publications] C.S.Yang and H.Kasuya: "Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images" Proceeding of International Conference on Spoken Language Processing. 2. 949-952 (1996)

Related Report

[Publications] 楊長盛, 粕谷英樹: "部分空間法を用いた有声子音の極・零の推定に関する検討" 日本音響学会平成10年度春季研究発表会講演論文集. I. 283-284 (1998)

[Publications] 楊長盛, 粕谷英樹: "部分空間法を用いた音源・声道パラメータの推定法" 日本音響学会平成9年度秋季研究発表会講演論文集集. I. 291-292 (1997)

[Publications] 楊長盛, 粕谷英樹: "母音の不変性と個人性" 日本音響学会平成9年度春季研究発表会講演論文集集. I. 259-260 (1997)