Project/Area Number |
08650418
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
情報通信工学
|
Research Institution | Tohoku UNIVERSITY |
Principal Investigator |
KOHATA Minoru Tohoku university, Garduate school of engineering, Associate Prof., 大学院・工学研究科, 助教授 (30186720)
|
Project Period (FY) |
1996 – 1997
|
Project Status |
Completed (Fiscal Year 1997)
|
Budget Amount *help |
¥1,700,000 (Direct Cost: ¥1,700,000)
Fiscal Year 1997: ¥500,000 (Direct Cost: ¥500,000)
Fiscal Year 1996: ¥1,200,000 (Direct Cost: ¥1,200,000)
|
Keywords | speech coding / Very low bit coding / fractal systems / perceptual coding / auditory filter |
Research Abstract |
A very low bit rate speechcoder at 1.2kbpsis newly proposedln this coder which is similar to the LPC vocoder, speechsignal is synthesizedusing sinusoids whose frequenciesare multiple of the fundamental frequency, and whose amplitudes are adaptively modulated based on auditory perceptualcharacteristics, in order to improve the quality. In this study, the basics of the new sinusoids synthesizerare introduced, thenhow to modulate the sinusoids based on perceptual characteristics is explained. The auditory perceptual characteristics are simulated by Gammatonefilters. And a new segment coder of spectral information (LSP coefficients) using DP matchingis also proposedtoreduce the total bit rate to 1.2kbps. According to MOS test (5 grades), the subjectivequality of the synthesized speech is improved by 1.0 compared with that of the simple LPC vocoder.
|