• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A Study for Utilizing the Linguistic Information in Phoneme Recognition to Understand Continuous Speech

Research Project

Project/Area Number 03452173
Research Category

Grant-in-Aid for General Scientific Research (B)

Allocation TypeSingle-year Grants
Research Field 情報工学
Research InstitutionChiba Institute of Technology

Principal Investigator

KIDO Ken'iti  Chiba Inst. of Tech., Engineering, Prof., 工学部, 教授 (30006209)

Co-Investigator(Kenkyū-buntansha) MAKINO Shozo  Tokyo Univ., Research Center for Applied Information Sciences, Associate Prof., 応用情報学研究センタ, 助教授 (00089806)
ARAI Shuichi  Chiba Inst. of Tech., Engineering, Associate Prof., 工学部, 講師 (20212590)
UKIGAI Masahiro  Chiba Inst. of Tech., Engineering, Associate Prof., 工学部, 助教授 (80118695)
SUGAWARA Kenji  Chiba Inst. of Tech., Engineering, Prof., 工学部, 教授 (00137853)
MIIDA Yoshiro  Chiba Inst. of Tech., Engineering, Prof., 工学部, 教授 (10083859)
伊與田 光宏  千葉工業大学, 工学部, 助教授 (90160069)
Project Period (FY) 1991 – 1993
Project Status Completed (Fiscal Year 1993)
Budget Amount *help
¥6,800,000 (Direct Cost: ¥6,800,000)
Fiscal Year 1993: ¥900,000 (Direct Cost: ¥900,000)
Fiscal Year 1992: ¥2,000,000 (Direct Cost: ¥2,000,000)
Fiscal Year 1991: ¥3,900,000 (Direct Cost: ¥3,900,000)
KeywordsContinuous Speech Recognition / Speech Recognition / Phoneme Recognition / Speaker Independent / Linguistic Information
Research Abstract

In this study, we proposed 2 higher performance phoneme recognition methodsand the continuous speech recognition method utilizing the linguistic information around the target phoneme.
At first, we proposed MR-HMM (Multi-Resolution HMM) based on Wavelet transform, which is able to control the time-frequency resolution. The WTD (Wavelet transform Tree Data) is proposed to represent the time-frequency space in scalogram that is obtained through Wavelet transform. Using this WTD structure, we proposed the State merge Algorithm stucying MR-HMM, it enables the high recognition rate.
Next, we proposed the phoneme recognition method using the 9 acoustic features besides the cepstrum parameters that is most popular but not enough. In general, it is necessary for using the several kinds of acoustic parameters to analyze what parameters are suitable for the specified phoneme recognition. But, the proposed method enables using the several kinds of parameters except that. We proposed the Membership Scale to enable applying the linear discriminant method that is for 2 category discrimination to the multi category discrimination. Using this method, the linguistic recognition stage can get the reliability of the results from the acoustical recognition stage.
Finally, we proposed the new linguistic recognition method, that uses the co-occurative relationship of the words in one sentence. This method doesn't use the grammatical knowledge, so the task fre speech is available. Combining this linguistic recognition method with the acoustic recognition methods mentioned above, the misrecognition in the acoustical recognition stage can be controlled by the linguistic rrecognition stage. From the experimental results, we confirmed the effectiveness of the proposed recognition methods.

Report

(4 results)
  • 1993 Annual Research Report   Final Research Report Summary
  • 1992 Annual Research Report
  • 1991 Annual Research Report
  • Research Products

    (24 results)

All Other

All Publications (24 results)

  • [Publications] 柵橋健二: "異常発声音の評価を目的とした音声分析表示法の予備的検討" 電子情報通信学会技術研究会資料. EA93-33. 17-23 (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1993 Final Research Report Summary
  • [Publications] 大内康裕: "正常および異常音声の第1・第2フォルマント平面における比較" 日本音響学会秋季研究発表会講演論文集. 593-594 (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1993 Final Research Report Summary
  • [Publications] 柵橋健二: "正常および異常音声のフォルマント周波数の時間遷移パターンによる比較" 日本音響学会秋季研究発表会講演論文集. 595-596 (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1993 Final Research Report Summary
  • [Publications] Shozo Makino: "Speech to Text Conversion System Based on Phoneme Recognition" Annals of Applied Information Science. 18. 51-65 (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1993 Final Research Report Summary
  • [Publications] 栗原世治: "各種音響パラメータが保持する個人性情報の分析" 日本音響学会秋季研究発表会講演論文集. 645-646 (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1993 Final Research Report Summary
  • [Publications] 小林淳: "動詞、名詞のスポッティングによる会話文の認識" 日本音響学会秋季研究発表会講演論文集. 175-176 (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1993 Final Research Report Summary
  • [Publications] 棚橋健二: "異常発声音の評価を目的とした音声分析表示法の予備的検討" 電子情報通信学会技術研究会資料. EA93-33. 17-23 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] 大内康裕: "正常および異常音声の第1・第2フォルマント平面における比較" 日本音響学会秋期研究発表会講演論文集. 593-594 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] 棚橋健二: "正常および異常音声のフォルマント周波数の時間遷移パターンによる比較" 日本音響学会秋期研究発表会講演論文集. 595-596 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] Shozo Makino: "Speech to Text Conversion System Based on Phoneme Recognition" Annals of Applied Information Science. 18. 51-65 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] 栗原世治: "各種音響パラメータが保持する個人性情報の分析" 日本音響学会秋期研究発表会講演論文集. 645-646 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] 小林淳: "動詞、名詞のスポッティングによる会話文の認識" 日本音響学会秋期研究発表会講演論文集. 175-176 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] 張 中: "ホルマントを用いた中国単語母音の分析と認識" 日本音響学会誌. 47. 281-288 (1991)

    • Related Report
      1992 Annual Research Report
  • [Publications] 伊藤 彰則: "機能語予測CYK法による日本語文音声の統語処理" 電子情報通信学会誌. J74-D11,9. 1147-1155 (1991)

    • Related Report
      1992 Annual Research Report
  • [Publications] 熊切 義博: "短時間FFTによる音声分析ディスプレイ装置" 日本音響学会春期研究発表会講演論文集. 1-5-17. 431-432 (1992)

    • Related Report
      1992 Annual Research Report
  • [Publications] 牧野 正三: "Recognition on phonemes in continuous speech using a modified LVQ2 method" Journal Acoustic Society Japan. Vol.13. 351-360 (1992)

    • Related Report
      1992 Annual Research Report
  • [Publications] 荒井 秀一: "A Network for Phenome Recognition by Spectral Local Peaks" Proc.14th International Congress on Acoustics. G-4-1. 877-878 (1992)

    • Related Report
      1992 Annual Research Report
  • [Publications] 張 中: "調音結合モデルに基づく中国語音声認識システムの音素認識" 電子情報通信学会誌. J74-D11,9. 1156-1164 (1991)

    • Related Report
      1992 Annual Research Report
  • [Publications] 張 中: "ホルマントを用いた中国語単母音の分析と認識" 日本音響学会誌. 47. 281-288 (1991)

    • Related Report
      1991 Annual Research Report
  • [Publications] 伊藤 彰則: "機能語予測CYK法による日本語文音声の統語処理" 電子情報通信学会誌. J74ーDII,9. 1147-1155 (1991)

    • Related Report
      1991 Annual Research Report
  • [Publications] 張 中: "調音結合モデルを基づく中国語音声認識システムの音素認識" 電子情報通信学会誌. J74ーDII,9. 1156-1164 (1991)

    • Related Report
      1991 Annual Research Report
  • [Publications] 古賀 秀昭: "性別判定と多数決を用いたロ-カルピ-クによる単語中母音の認識" 日本音響学会秋季研究発表会講演論文集. 1ー5ー9. 17-18 (1991)

    • Related Report
      1991 Annual Research Report
  • [Publications] 棚橋 健二: "短時間FFTによる音声分析ー母音分析による検討ー" 日本音響学会春季研究発表会講演論文集. 2ーQー1. 159-160 (1992)

    • Related Report
      1991 Annual Research Report
  • [Publications] 熊切 義博: "短時間FFTによる音声分析ディスプレイ装置" 日本音響学会学季研究発表会講演論文集. 1ー5ー17. 431-432 (1992)

    • Related Report
      1991 Annual Research Report

URL: 

Published: 1991-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi