• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Word Recognition using A Two-Dimensional Mel-cepstrum under Noisy Environments.

Research Project

Project/Area Number 63550253
Research Category

Grant-in-Aid for General Scientific Research (C)

Allocation TypeSingle-year Grants
Research Field 電子通信系統工学
Research InstitutionNagoya Institute of Technology

Principal Investigator

KITAMURA Tadashi  Faculty of Engineering, Nagoya Institute of Technology, Associate Professor, 工学部, 助教授 (60114865)

Co-Investigator(Kenkyū-buntansha) 早原 悦朗  名古屋工業大学, 工学部, 教授 (80024214)
山田 由之  名古屋工業大学, 工学部, 助手 (50024253)
Project Period (FY) 1988 – 1989
Project Status Completed (Fiscal Year 1989)
Budget Amount *help
¥2,100,000 (Direct Cost: ¥2,100,000)
Fiscal Year 1989: ¥200,000 (Direct Cost: ¥200,000)
Fiscal Year 1988: ¥1,900,000 (Direct Cost: ¥1,900,000)
Keywordsnoise / word recognition / two-dimensional mel-cepstrum / Japanese digit / dynamic features of spectra / 雑音下での単語音声認識 / 数字音声 / 雑音下での音声認識 / 人間の聴覚特性 / メル周波数 / スペクトルの時間変化情報
Research Abstract

The purpose of this research is to offer a new method for word recognition under noisy environments. In this study white noise generated by computer simulation and colored noise recorded in the Nagoya station are used. A speaker- independent word recognition method of ten Japanese digits using a two- dimensional mel-cepstrum(TDMC) is proposed. TDMC is defined as the two- dimensional Fourier transform of mel-frequency scaled logarithm spectra in the frequency and time domains and consists of average features and dynamic features of the two-dimensional mel-log spectra, Experimental results in this study are shown as follows.
1. Speech analysis-synthesis system using a TMDC and its estimation; The structure of speech analysis-synthesis system using a TMDC is proposed in order to study the size of the TDMC for synthesizing good quality speech. It is shown that the frequency of the required area of the TDMC is less than about 10Hz.
2. Reference patterns robust for the variation of signal-to-noise ratio (SNR) of input speech; In this study a single set of TDMCs of noise-added reference patterns with desired SNR is used for word recognition under noisy environments. Experimental results show that a recognition method using this reference pattern set is more effective than a usual method.
3. Distance measures for a word recognition method robust for the variation of SNR of input speech; Distance measures using a combination of dynamic and average features of the TDMC is proposed. It is shown that dynamic features are more important than average features for word recognition under noisy environment.

Report

(3 results)
  • 1989 Annual Research Report   Final Research Report Summary
  • 1988 Annual Research Report
  • Research Products

    (23 results)

All Other

All Publications (23 results)

  • [Publications] 浅村吉範、北村正: "2次元メルケプストラムによる音声の分析合成系" 電子情報通信学会音声研究会資料. SP88-47. 17-24 (1988)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] T.KITAMURA and E.HAYAHARA: "Word recognition using a two-dimensional mel-cepstrum in noisy environments" J.Acoust.Soc.Am.Suppl.1. Vol.84. PPP6 (1988)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] 浅村吉範、秋野秀之、北村正: "2次元メルケプストラムを用いた単音節の分析及び合成" 電子情報通信学会音声研究会資料. SP88-127. 41-48 (1989)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] 水谷忠司、北村正: "雑音下の数字音声認識における参照パタ-ンと距離尺度の検討" 電子情報通信学会音声研究会資料. SP88-121. 39-45 (1989)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] 北村正、片柳恵一: "2次元メルケプストラムの静的特徴・動的特徴を用いる数字音声認識" 電子情報通信学会論文誌(A). J72-A. 640-647 (1989)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] 北村正、早原悦朗: "2次元メルケプストラムの動的特徴を利用する雑音下での特定話者数字音声認識" 電子情報通信学会論文誌(D). J72-D-II. 1242-1247 (1989)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] Yoshinori Asamura, Tadashi Kitamura: "Speech Analysis-Synthesis System Using A Two-Dimensional Mel-Cepstrum" IEICE Technical Report SP88-47, pp.17-24, 1988.

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] Tadashi Kitamura, Etsuro Hayahara: "Word Recognition Using A Two-Dimensional Mel-Cepstrum under Noisy Environments" J.Acoust.Soc.Am.Suppl.1, Vol.84, PPP6, 1988.

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] Tadashi Mizutani, Tadashi Kitamura: "On Methods Making Reference Patterns and Distance Measures in Digit speech Recognition in Noisy Environments." IEICE Technical Report SP88-121, pp.39-45, 1988.

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] Tadashi Kitamura, Keiichi Katayanagi: "Digit Recognition Using Static and Dynamic Features of A Two-Dimensional Mel-Cepstrum." Trans.IEICE, Vol.J72-A, No.4, pp.640-647, 1989.

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] Tadashi Kitamura, Etsuro Hayahara: "Speaker-Dependent Digit Word Recognition in Noisy Environments Using Dynamic Features of A Two-Dimensional Mel-Cepstrum." Trans.IEICE, Vol.J72-D-II, No.8, pp.1242-1247, 1989.

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1989 Final Research Report Summary
  • [Publications] 北村正、水谷忠司: "スペクトル変化を利用する雑音下の数字音声認識" 日本音響学会平成元年度春季研究発表会講演論文集. 平1ー03. 113-114 (1989)

    • Related Report
      1989 Annual Research Report
  • [Publications] 北村正、片柳恵一: "2次元メルケプストラムの静的特徴・動的特徴を用いる数字音声認識" 電子情報通信学会論文誌(A). J72ーA. 640-647 (1989)

    • Related Report
      1989 Annual Research Report
  • [Publications] 北村正,早原悦朗: "2次元メルケプストラムの動的特徴を利用する雑音下での特定話者数字音声認識" 電子情報通信学会論文誌(D). J72ーDーII. 1242-1247 (1989)

    • Related Report
      1989 Annual Research Report
  • [Publications] 嶋崎靖彦、北村正: "雑音下における不特定話者の数字音声認識" 平成元年度電気関係学会東海支部連合大会講演論文集. 平1ー10. 422 (1989)

    • Related Report
      1989 Annual Research Report
  • [Publications] 北村正、水谷忠司: "マルチテンプレ-トを用いる雑音下での数字音声認識" 日本音響学会平成元年度秋季研究発表会講演論文集. 平1ー10. 65-66 (1989)

    • Related Report
      1989 Annual Research Report
  • [Publications] 北村正,嶋崎靖彦: "スペクトルの動的特徴を用いる雑音下での不特定話者数字音声認識" 日本音響学会平成2年度春季研究発表会講演論文集. 平2ー03. 5-6 (1990)

    • Related Report
      1989 Annual Research Report
  • [Publications] 北村正: 日本音響学会昭和63年度秋期研究発表会講演論文集. 昭63ー10. 59-60 (1988)

    • Related Report
      1988 Annual Research Report
  • [Publications] 浅村吉範: 電子情報通信学会音声研究会資料. SP88ー47. 17-24 (1988)

    • Related Report
      1988 Annual Research Report
  • [Publications] 水谷忠司: 電子情報通信学会音声研究会資料. SP88ー121. 39-45 (1989)

    • Related Report
      1988 Annual Research Report
  • [Publications] 浅村吉範: 電子情報通信学会音声研究会資料. SP88ー127. 41-48 (1989)

    • Related Report
      1988 Annual Research Report
  • [Publications] 北村正: 日本音響学会平成元年度度春期研究発表会講演論文集. 平成1ー03. 3-4 (1989)

    • Related Report
      1988 Annual Research Report
  • [Publications] 北村正: 電子情報通信学会1989年春期全国大会. 平成1ー03. A-23 (1989)

    • Related Report
      1988 Annual Research Report

URL: 

Published: 1988-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi