A Study on Efficient Speech Coding Systems with Auditory Filters

Research Project

Project/Area Number	11650392
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	情報通信工学
Research Institution	Chiba Institute of Technology
Principal Investigator	KOHATA Minoru Faculty of Engineering, Chiba Institute of Technology, Prof., 工学部, 教授 (30186720)
Project Period (FY)	1999 – 2000
Project Status	Completed (Fiscal Year 2000)
Budget Amount *help	¥1,700,000 (Direct Cost: ¥1,700,000) Fiscal Year 2000: ¥500,000 (Direct Cost: ¥500,000) Fiscal Year 1999: ¥1,200,000 (Direct Cost: ¥1,200,000)
Keywords	Speech coding / Auditory filter / Vocoder / Harmonic coding / Subjective speech quality / Information compression / ハーモニックコーディング
Research Abstract	In this study, a very low bit speech coder at 1.2 kbps is newly proposed. Like the LPC vocoder, it requures few types of information (power, pitch, and spectral information), but its quality is far superuor. In the proposed vocoder, the synthesized speech quality is improved based on auditory perceptualcharacterustics. The synthesis method is one of harmonic coding, using sinusoids whose frequencies are multiples of the fundamental frequency, where the amplitudes of the sinusoids are adaptively modulated using Gammatone filters as a perceptual weighting filter. The sinusoids' phases are also adjusted so as to maximize the perceptual quality. In order to reduce the total bit rate to 1.2 kbps, a new segment coder for spectral information (LSP coefficients) using DP matching is also proposed. The quality of the synthesized speech is considerably improved compared with that of the simple LPC vocoder, according to MOS and other preference tests.

Report

(3 results)

2000 Annual Research Report Final Research Report Summary
1999 Annual Research Report

Research Products
(4 results)

All Other

All Publications (4 results)

[Publications] M.Kohata,I,Mitsuya,M.Suzuki, S.Makino: "Efficient segment quantization of LSP coefficients for very low bit speech coding"Proc.Int.Conf.on Spoken Language Processing. 2000.3. 826-829 (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] M.Kohata, I.Mitsuya, M.Suzuki, S.Mokino: "Efficient segment quantization of LSP coefficients for very low bit speech coding"Proc.Int.Conf.on Spoken Language Processing. vol.2000.3. 826-829 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2000 Final Research Report Summary
[Publications] M.Kohata,I.Mitsuya,M.Suzuki,S.Makino: "Efficient segment quantization of LSP Coefficients for very low bit speech coding"Proc.Int.Conf.on Spoken Language Processing. 2000・3. 826-829 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 木幡稔: "正弦波重畳型ボコーダによる1.2kbit/s音声符号化方式"電子情報通信学会論文誌. J82-D-II 3__-. 340-349 (1999)
- Related Report
  1999 Annual Research Report

A Study on Efficient Speech Coding Systems with Auditory Filters

Principal Investigator

KOHATA Minoru Faculty of Engineering, Chiba Institute of Technology, Prof., 工学部, 教授 (30186720)

¥1,700,000 (Direct Cost: ¥1,700,000)

Report

Research Products

[Publications] M.Kohata,I,Mitsuya,M.Suzuki, S.Makino: "Efficient segment quantization of LSP coefficients for very low bit speech coding"Proc.Int.Conf.on Spoken Language Processing. 2000.3. 826-829 (2000)

Description

Related Report

[Publications] M.Kohata, I.Mitsuya, M.Suzuki, S.Mokino: "Efficient segment quantization of LSP coefficients for very low bit speech coding"Proc.Int.Conf.on Spoken Language Processing. vol.2000.3. 826-829 (2000)

Description

Related Report

[Publications] M.Kohata,I.Mitsuya,M.Suzuki,S.Makino: "Efficient segment quantization of LSP Coefficients for very low bit speech coding"Proc.Int.Conf.on Spoken Language Processing. 2000・3. 826-829 (2000)

Related Report

[Publications] 木幡 稔: "正弦波重畳型ボコーダによる1.2kbit/s音声符号化方式"電子情報通信学会論文誌. J82-D-II 3__-. 340-349 (1999)

Related Report

[Publications] 木幡稔: "正弦波重畳型ボコーダによる1.2kbit/s音声符号化方式"電子情報通信学会論文誌. J82-D-II 3__-. 340-349 (1999)