• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Study on a Speech Recognition-Synthesis System Based on Mel Cepstral Acoustic Processing and Multi-Level Knowledge Processing.

Research Project

Project/Area Number 61460131
Research Category

Grant-in-Aid for General Scientific Research (B)

Allocation TypeSingle-year Grants
Research Field 電子通信系統工学
Research InstitutionTokyo Institute of Technology

Principal Investigator

IMAI Satoshi  Tokyo Institute of Technology, 精密工学研究所, 教授 (50016763)

Co-Investigator(Kenkyū-buntansha) FURUICHI Chieko  Tokyo Institute of Technology, 精密工学研究所, 助手 (90016783)
Project Period (FY) 1986 – 1987
Project Status Completed (Fiscal Year 1987)
Budget Amount *help
¥6,600,000 (Direct Cost: ¥6,600,000)
Fiscal Year 1987: ¥800,000 (Direct Cost: ¥800,000)
Fiscal Year 1986: ¥5,800,000 (Direct Cost: ¥5,800,000)
KeywordsSpeech processing / Recognition-synthesis system / Segmentation into phonemic unit / Mel-cepstrum / Acoustic processing / Knowledge processing / Unbiased log spectral estimator / Improved cepstral methob / Log spectrum / Distance measure / Reside signal / Pattern Matching / 合成 / セグメンテーション / トップダウン / リンク構造
Research Abstract

It is shown through this research that the speech recognition-sythesis system based on the mel cepstral processing and the multi-level knowledge processing is very effective for establishing a natural human machine communication system.
The acoustic processor is an important component because the success of a speech recognition system mainly depends upon the performance of the acoustic-phonetic processor. We proposed an unbiased estimator of the log spectrum for the advanced acoustic processing. The unbiased log spectral estimation technique can extract an accurate and stable spectral envelope.
Using the several segmen tation parameters based on the unbiased log spectral estimate of speech signal, the segmentation of continuous Japanese speech into phonemic units can be successfully performed. The dynamic segmentation parameters obtained bt a qbtained by a quasi-derivative operation from the spectral envelope perameter are sufficiently stable for detecting phonemic boundaries. The perfor … More mance of the segmentation system was evaluated by processing the continuous, reading-rate speech samples uttered by 3 female and 3 male speakers. The segmentation error is 3.6%, consisting of 1.98% missed ans 1.58% extra for 1012 nominal count of phonemic units. The segmentation system is available to the reconition-synthesis system as a subsystem.
We compared the ceptral or mel cepstral distance measure with the traditional LPC cdistance measure. Form the the distance measure comparison, it is clarified that mel cepstral distance is much more effective in the word recoghition than the LPC cepstral or WLR distance measure.
For the purpose of realization of the rule-synthesis system for high quality speech, the lack of formulation for excitation source is a serious problem. In order to realize an intelligible and very high quality speech synthesis, an excitation signal with good properties is needed to replace the usual impulse train and M-sequence. We proposed a method of generating excitation signal with the spectral envelope and level according to the result obtained through the very short time spectral analysis. Less

Report

(2 results)
  • 1987 Final Research Report Summary
  • 1986 Annual Research Report
  • Research Products

    (22 results)

All Other

All Publications (22 results)

  • [Publications] 今井聖: 電子情報通信学会論文誌. J70-A. 471-480 (1987)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] IMAI, Satoshi: Proc. EUSIPCO-88. Sept. 4. (1988)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] 秋田昌憲: 電子通信学会論文誌. J69-A. 1464-1466 (1986)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] 今井聖: 電子情報通信学会論文誌.

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] 小林隆夫: 電子通信学会論文誌. J69-A. 1431-1438 (1986)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] 古市千枝子: 電子情報通信学会論文誌.

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] IMAI,Satoshi and FURUICHI,Chieko: "Unbiased Qestimation of log Spectrum." Trans,IEICE. J70-A. 471-480 (1987)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] TOKUDA,Keiichi KOBAYASHI,Takao and IMAI,Satoshi: "Cepstral Analysis with non-Uniform Spectral weighting for Spectral Envelope Extraction." Trans. IEICE. J70-A. 652-959 (1987)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] IMAI,Satoshi and FURUICHI,Chieko: "Unbiased Estimator of log Spectrum and its Application to Speech Signal Processing." Proc. EUSIPCO-88. (1988)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] AKITA,Masanori and IMAI,Satoshi: "Comparison of Weighting Functions for the Segmaentation of Sequences of Vowels" Trans.IECE. J69-A. 1464-1466 (1986)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] IMAI,Satoshi and FURUICHI,Chieko: "Segmentation of Continuous Speech into Phonemic Units." Trans. IEICE. (J71-A). ((1988))

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] KOBAYASHI,Takao and IMAI,Satoshi: "Generalized Cepstral Distance Measures" Trans.IEICE. J69-A. 1431-1438 (1986)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] KOBAYASHI,Takao KONDO,Shunichi and IMAI,Satoshi: "Evaluation of Generalized Cepstral Distance Measures for Isolated Word Recognition." Trans. IEICE. J71-A. (8) (1988)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] TOKUDA,Keiich, KObAYAShi,TAKaO and IMAI,Satoshi: "Recursion Formura for Calculation of Mel Gemeralized Cepstrum Coefficients." Trans. IEICE. J71-A. 128-131 (1988)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] FURUICHI,Chieko and IMAI,Satoshi: "Excitation Signal Generation for Rule-Synthesis of High-Quality Speech." Trans. IECICE. (J71-A). (8) ((1988))

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] Lee,Yanghee KOBAYASHI,Takao FURUICHI,Chieko and IMAI,Satoshi: "A conversion Rule for Phonetic Alternants in Korean Speech Synthesis-by-Rule." Trans. IECICE. J71-A. (9) (1988)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1987 Final Research Report Summary
  • [Publications] 秋田昌憲: 電子通信学会論文誌. J69-D. 1450-1458 (1986)

    • Related Report
      1986 Annual Research Report
  • [Publications] 秋田昌憲: 電子通信学会論文誌. J69-A. 1464-1466 (1986)

    • Related Report
      1986 Annual Research Report
  • [Publications] 小林隆夫: 電子通信学会論文誌. J69-7. 1431-1438 (1986)

    • Related Report
      1986 Annual Research Report
  • [Publications] 謝景棠: 電子通信学会技術報告. SP-86-85. 41-47 (1987)

    • Related Report
      1986 Annual Research Report
  • [Publications] 今井聖: 電子通信学会論文誌. J70-A. 10 (1987)

    • Related Report
      1986 Annual Research Report
  • [Publications] 古市千枝子: 電子通信学会論文誌. 9

    • Related Report
      1986 Annual Research Report

URL: 

Published: 1987-03-31   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi