Investigations on active perception in speech production and hearing systems

Research Project

Project/Area Number	11650425
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Measurement engineering
Research Institution	Wakayama University
Principal Investigator	KAWAHARA Hideki Wakayama University, Faculty of Systems Engineering, Professor, システム工学部, 教授 (40294300)
Project Period (FY)	1999 – 2001
Project Status	Completed (Fiscal Year 2001)
Budget Amount *help	¥3,400,000 (Direct Cost: ¥3,400,000) Fiscal Year 2001: ¥1,300,000 (Direct Cost: ¥1,300,000) Fiscal Year 2000: ¥1,200,000 (Direct Cost: ¥1,200,000) Fiscal Year 1999: ¥900,000 (Direct Cost: ¥900,000)
Keywords	fundamental frequency / group delay / general theory of motion control / high-quality speech analysis / high-quality speech synthesis / perception-to-production interactions / fixed-point / active control model / 不動 / 音声分析 / 音声合成 / 音声変換 / 声帯振動 / タイミング抽出 / 運動制御 / 歌声音声 / 発声聴覚相互作用 / 基本周波数抽出 / 感度解析 / 擬似白色信号 / 交換聴覚フィードバック / スプライン関数
Research Abstract	Precise procedures for source information extraction were invented for providing an indispensable basis for further investigations. The major research goal, development of an F0 (fundamental frequency) control model based on an integration of findings of the head investigator in his previous investigations and "general theory of motor control" proposed by Kawato et.al., was yielded using data acquired by the new procedures. A working prototype of the model, a scat rendering model, was implemented based on a very high-quality speech analysis, modification and synthesis method, STRAIGHT, developed by the head investigator. A part of these accomplishments was presented at the 141st meeting of the Acoustical Society of America and gave a strong impression on the participants. The impact resulted in an invitation to the 143rd meeting, which will be held in June 2002, as an invited talker. The new source information extraction procedures (for frequency domain, time domain and aperiodicity attributes) were also applied for developing procedures for voice quality analysis and resynthesis. The procedures were introduced at MAVEBA, an international conference held in Italy and also gave a strong impact. A typical example to demonstrate the power of the procedures is a paper presented at EUROSPEECH'2002 in Denmark. It illustrated that the new procedures are precise enough to detect systematic F0 glitches around consonant-vowel and vowel-consonant transitions and also powerful enough to penetrate into the origin of the phenomenon. In spite of the fact that the glitches are not caused by active interactions between perception and production, the finding and investigations have scientific importance and illustrates the usefulness of the procedures. Finally, the findings further suggest that a new, still vague, auditory information representation based on fixed-point concept will be emerged as an instantiation of a generalized auditory processing principle.

Report

(4 results)

2001 Annual Research Report Final Research Report Summary
2000 Annual Research Report
1999 Annual Research Report

Research Products
(16 results)

All Other

All Publications (16 results)

[Publications] 河原英紀, 片寄晴弘: "高品質音声分析変換合成システムSTRAIGHTを用いたスキャット生成研究の提案"情報処理学会論文誌. 43. 208-218 (2002)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara, Osamu Fujimura, Jo Estill: "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT"Proc. MAVEBA. (CD-ROM). (2001)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara, Parham Zolfaghari: "Systematic FO glitches around vowel nasal transitions"Proc. EUROSPEECH'2001. 5. 2459-2462 (2001)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara et al.: "Accurate vocal event detection method based on a fixed-point analysis of mapping from time to weighted average group delay"Proc. ICSLP'2000. 1. 664-667 (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara et al.: "Auditory event detection based on a time-domain fixed point analysis"Proc. WESTPRAC VII. 1. 255-258 (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara et al.: "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of FO and periodicity"Proc. EUROSPEECH'1999. 6. 2781-2784 (1999)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara and Haruhiro Katayose: "Scat generation research program based on STRAIGHT, a high-quality speech analysis, modification and synthesis system"Journal of the Information Processing Society of Japan. 43. 208-218 (2002)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara, Osamu Fujimura and Jo Estill: "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT"Proc. MAVEBA, Italy. (CD-ROM). (2002)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara and Parham Zolfaghari: "Systematic F0 glitches around vowel nasal transitions"Proc. EUROSPEECH'2001. 5. 2459-2462 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara, Yoshinori Atake and Parham Zolfaghari: "Accurate vocal event detection method based on a fixed-point analysis of mapping from time to weighted average group delay"Proc. ICSLP'2000. 1. 664-667 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara, Parham Zolfaghari and Yoshinori Atake: "Auditory event detection based on a time-domain fixed point analysis"Proc. WESTPRAC VII. 1. 255-258 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Hideki Kawahara, Haruhiro Katayose, Alain de Cheveigne and Roy Patterson: "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity"Proc. EUROSPEECH'99. 6. 2781-2784 (1999)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] 河原英紀, 片寄晴弘: "高品質音声分析変換合成システムSTRAIGHTを用いたスキャット生成研究の提案"情報処理学会論文誌. 43・2. 208-218 (2002)
- Related Report
  2001 Annual Research Report
[Publications] Hideki Kawahara, Osamu Fujimura, Jo Estill: "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT"Proc.MAVEBA. (CD-ROM). (2001)
- Related Report
  2001 Annual Research Report
[Publications] Hideki Kawahara, Parham Zolfaghari: "Systematic FO glitches around vowel nasal transitions"Proc.EUROSPEECH'2001. 2459-2642 (2001)
- Related Report
  2001 Annual Research Report
[Publications] 河原英紀他: "Accurate vocal event detection method based on a fixed-point analysis of mapping from time to average group delay"Proc.ICSLP-2000. IV. 664-667 (2000)
- Related Report
  2000 Annual Research Report

Investigations on active perception in speech production and hearing systems

Principal Investigator

KAWAHARA Hideki Wakayama University, Faculty of Systems Engineering, Professor, システム工学部, 教授 (40294300)

¥3,400,000 (Direct Cost: ¥3,400,000)

Report

Research Products

[Publications] 河原英紀, 片寄晴弘: "高品質音声分析変換合成システムSTRAIGHTを用いたスキャット生成研究の提案"情報処理学会論文誌. 43. 208-218 (2002)

Description

Related Report

[Publications] Hideki Kawahara, Osamu Fujimura, Jo Estill: "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT"Proc. MAVEBA. (CD-ROM). (2001)

Description

Related Report

[Publications] Hideki Kawahara, Parham Zolfaghari: "Systematic FO glitches around vowel nasal transitions"Proc. EUROSPEECH'2001. 5. 2459-2462 (2001)

Description

Related Report

[Publications] Hideki Kawahara et al.: "Accurate vocal event detection method based on a fixed-point analysis of mapping from time to weighted average group delay"Proc. ICSLP'2000. 1. 664-667 (2000)

Description

Related Report

[Publications] Hideki Kawahara et al.: "Auditory event detection based on a time-domain fixed point analysis"Proc. WESTPRAC VII. 1. 255-258 (2000)

Description

Related Report

[Publications] Hideki Kawahara et al.: "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of FO and periodicity"Proc. EUROSPEECH'1999. 6. 2781-2784 (1999)

Description

Related Report

[Publications] Hideki Kawahara and Haruhiro Katayose: "Scat generation research program based on STRAIGHT, a high-quality speech analysis, modification and synthesis system"Journal of the Information Processing Society of Japan. 43. 208-218 (2002)

Description

Related Report

[Publications] Hideki Kawahara, Osamu Fujimura and Jo Estill: "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT"Proc. MAVEBA, Italy. (CD-ROM). (2002)

Description

Related Report

[Publications] Hideki Kawahara and Parham Zolfaghari: "Systematic F0 glitches around vowel nasal transitions"Proc. EUROSPEECH'2001. 5. 2459-2462 (2001)

Description

Related Report

[Publications] Hideki Kawahara, Yoshinori Atake and Parham Zolfaghari: "Accurate vocal event detection method based on a fixed-point analysis of mapping from time to weighted average group delay"Proc. ICSLP'2000. 1. 664-667 (2000)

Description

Related Report

[Publications] Hideki Kawahara, Parham Zolfaghari and Yoshinori Atake: "Auditory event detection based on a time-domain fixed point analysis"Proc. WESTPRAC VII. 1. 255-258 (2000)

Description

Related Report

[Publications] Hideki Kawahara, Haruhiro Katayose, Alain de Cheveigne and Roy Patterson: "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity"Proc. EUROSPEECH'99. 6. 2781-2784 (1999)

Description

Related Report

[Publications] 河原英紀, 片寄晴弘: "高品質音声分析変換合成システムSTRAIGHTを用いたスキャット生成研究の提案"情報処理学会論文誌. 43・2. 208-218 (2002)

Related Report

[Publications] Hideki Kawahara, Osamu Fujimura, Jo Estill: "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT"Proc.MAVEBA. (CD-ROM). (2001)

Related Report

[Publications] Hideki Kawahara, Parham Zolfaghari: "Systematic FO glitches around vowel nasal transitions"Proc.EUROSPEECH'2001. 2459-2642 (2001)

Related Report

[Publications] 河原英紀 他: "Accurate vocal event detection method based on a fixed-point analysis of mapping from time to average group delay"Proc.ICSLP-2000. IV. 664-667 (2000)

Related Report

[Publications] 河原英紀他: "Accurate vocal event detection method based on a fixed-point analysis of mapping from time to average group delay"Proc.ICSLP-2000. IV. 664-667 (2000)