Project/Area Number |
15300061
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Perception information processing/Intelligent robotics
|
Research Institution | Wakayama University |
Principal Investigator |
IRINO Toshio Wakayama university, Faculty of Systems Engineering, Professor, システム工学部, 教授 (20346331)
|
Co-Investigator(Kenkyū-buntansha) |
KAWAHARA Hideki Wakayama university, Faculty of Systems Engineering, Professor, システム工学部, 教授 (40294300)
AKAGI Masato Japan Advanced Institute of Science and Technology, School of Information Science, Professor, 情報科学研究科, 教授 (20242571)
UNOKI Masashi Japan Advanced Institute of Science and Technology, School of Information Science, Associate Professor, 情報科学研究科, 助教授 (00343187)
NISHIURA Takanobu Ritsumeikan university, College of Information Science and Engineering, Associate Professor, 情報理工学部, 助教授 (70343275)
BANNO Hideki Meijo university, Faculty of Science and Technology, Associate Professor, 理工学部, 講師 (20335003)
|
Project Period (FY) |
2003 – 2005
|
Project Status |
Completed (Fiscal Year 2005)
|
Budget Amount *help |
¥15,100,000 (Direct Cost: ¥15,100,000)
Fiscal Year 2005: ¥3,300,000 (Direct Cost: ¥3,300,000)
Fiscal Year 2004: ¥4,800,000 (Direct Cost: ¥4,800,000)
Fiscal Year 2003: ¥7,000,000 (Direct Cost: ¥7,000,000)
|
Keywords | Computational theory of audition / Gammachirp auditory filter / Scale theory / Wavelet-Mellin transform / High quality vocoder, STRAIGHT / Speech signal processing / Auditory scene analysis / Speech segregation, enhancement / マイクロホンアレー / 聴覚ボコーダ / STRAIGHT |
Research Abstract |
The purpose of this project was to construct computational theories of audition and to apply them to various sound signal processing techniques. There were four sub-goals : 1) The development of reliable auditory filterbanks ; 2) The experimental verification of a computational theory of auditory size-shape perception ; 3) The development of speech processing methods using vocoders ; and 4) Examining various speech applications and surveying their principles. 1)We determined a reasonable parameter set of compressive gammachirp (cGC) auditory filters by quantitatively fitting them to a thousand of human masking data. A dynamic version of the cGC filterbank (dcGC) was designed to account for the nonlinear characteristics of the cochlea within an analysis/synthesis framework, which is essential for various applications. The dcGC software will be available to the public on a web page as a de facto standard for auditory peripheral processing. 2)We performed perceptual experiments supporting t
… More
he computational theory of auditory size-shape perception. The experiments would be a milestone for the fixture studies in this new field. We organized a special session entitled "Size information in speech and animal calls" at the spring meeting of the Acoustical Society of America (ASA) in May 2005. We invited twelve speakers (seven from our group) to present the related works. 3)We improved an auditory vocoder for speech segregation and evaluated it with both objective and subjective measurements. We developed accurate and robust fundamental frequency methods that are important for this system. We improved the high quality analysis/synthesis system, STRAIGHT, applied it to auditory morphing, and developed a real-time version. 4)We developed several speech/acoustic processing methods : a speech recognition model using phoneme hypotheses ; a multi-microphone speech enhancement system ; a new robust acoustic measurement signal called "warped-TSP" ; and methods to deal with emotional speech signals and unexpected noise in a speech dialog system. We began using a statistical method to identify the neural circuits for directional sensitivity. Less
|