A Study for Utilizing the Linguistic Information in Phoneme Recognition to Understand Continuous Speech

Research Project

Project/Area Number	03452173
Research Category	Grant-in-Aid for General Scientific Research (B)
Allocation Type	Single-year Grants
Research Field	情報工学
Research Institution	Chiba Institute of Technology
Principal Investigator	KIDO Ken'iti Chiba Inst. of Tech., Engineering, Prof., 工学部, 教授 (30006209)
Co-Investigator(Kenkyū-buntansha)	MAKINO Shozo Tokyo Univ., Research Center for Applied Information Sciences, Associate Prof., 応用情報学研究センタ, 助教授 (00089806) ARAI Shuichi Chiba Inst. of Tech., Engineering, Associate Prof., 工学部, 講師 (20212590) UKIGAI Masahiro Chiba Inst. of Tech., Engineering, Associate Prof., 工学部, 助教授 (80118695) SUGAWARA Kenji Chiba Inst. of Tech., Engineering, Prof., 工学部, 教授 (00137853) MIIDA Yoshiro Chiba Inst. of Tech., Engineering, Prof., 工学部, 教授 (10083859) 伊與田光宏千葉工業大学, 工学部, 助教授 (90160069)
Project Period (FY)	1991 – 1993
Project Status	Completed (Fiscal Year 1993)
Budget Amount *help	¥6,800,000 (Direct Cost: ¥6,800,000) Fiscal Year 1993: ¥900,000 (Direct Cost: ¥900,000) Fiscal Year 1992: ¥2,000,000 (Direct Cost: ¥2,000,000) Fiscal Year 1991: ¥3,900,000 (Direct Cost: ¥3,900,000)
Keywords	Continuous Speech Recognition / Speech Recognition / Phoneme Recognition / Speaker Independent / Linguistic Information
Research Abstract	In this study, we proposed 2 higher performance phoneme recognition methodsand the continuous speech recognition method utilizing the linguistic information around the target phoneme. At first, we proposed MR-HMM (Multi-Resolution HMM) based on Wavelet transform, which is able to control the time-frequency resolution. The WTD (Wavelet transform Tree Data) is proposed to represent the time-frequency space in scalogram that is obtained through Wavelet transform. Using this WTD structure, we proposed the State merge Algorithm stucying MR-HMM, it enables the high recognition rate. Next, we proposed the phoneme recognition method using the 9 acoustic features besides the cepstrum parameters that is most popular but not enough. In general, it is necessary for using the several kinds of acoustic parameters to analyze what parameters are suitable for the specified phoneme recognition. But, the proposed method enables using the several kinds of parameters except that. We proposed the Membership Scale to enable applying the linear discriminant method that is for 2 category discrimination to the multi category discrimination. Using this method, the linguistic recognition stage can get the reliability of the results from the acoustical recognition stage. Finally, we proposed the new linguistic recognition method, that uses the co-occurative relationship of the words in one sentence. This method doesn't use the grammatical knowledge, so the task fre speech is available. Combining this linguistic recognition method with the acoustic recognition methods mentioned above, the misrecognition in the acoustical recognition stage can be controlled by the linguistic rrecognition stage. From the experimental results, we confirmed the effectiveness of the proposed recognition methods.

Report

(4 results)

1993 Annual Research Report Final Research Report Summary
1992 Annual Research Report
1991 Annual Research Report

Research Products
(24 results)

All Other

All Publications (24 results)

[Publications] 柵橋健二: "異常発声音の評価を目的とした音声分析表示法の予備的検討" 電子情報通信学会技術研究会資料. EA93-33. 17-23 (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1993 Final Research Report Summary
[Publications] 大内康裕: "正常および異常音声の第1・第2フォルマント平面における比較" 日本音響学会秋季研究発表会講演論文集. 593-594 (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1993 Final Research Report Summary
[Publications] 柵橋健二: "正常および異常音声のフォルマント周波数の時間遷移パターンによる比較" 日本音響学会秋季研究発表会講演論文集. 595-596 (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1993 Final Research Report Summary
[Publications] Shozo Makino: "Speech to Text Conversion System Based on Phoneme Recognition" Annals of Applied Information Science. 18. 51-65 (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1993 Final Research Report Summary
[Publications] 栗原世治: "各種音響パラメータが保持する個人性情報の分析" 日本音響学会秋季研究発表会講演論文集. 645-646 (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1993 Final Research Report Summary
[Publications] 小林淳: "動詞、名詞のスポッティングによる会話文の認識" 日本音響学会秋季研究発表会講演論文集. 175-176 (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1993 Final Research Report Summary
[Publications] 棚橋健二: "異常発声音の評価を目的とした音声分析表示法の予備的検討" 電子情報通信学会技術研究会資料. EA93-33. 17-23 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 大内康裕: "正常および異常音声の第1・第2フォルマント平面における比較" 日本音響学会秋期研究発表会講演論文集. 593-594 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 棚橋健二: "正常および異常音声のフォルマント周波数の時間遷移パターンによる比較" 日本音響学会秋期研究発表会講演論文集. 595-596 (1993)
- Related Report
  1993 Annual Research Report
[Publications] Shozo Makino: "Speech to Text Conversion System Based on Phoneme Recognition" Annals of Applied Information Science. 18. 51-65 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 栗原世治: "各種音響パラメータが保持する個人性情報の分析" 日本音響学会秋期研究発表会講演論文集. 645-646 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 小林淳: "動詞、名詞のスポッティングによる会話文の認識" 日本音響学会秋期研究発表会講演論文集. 175-176 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 張中: "ホルマントを用いた中国単語母音の分析と認識" 日本音響学会誌. 47. 281-288 (1991)
- Related Report
  1992 Annual Research Report
[Publications] 伊藤彰則: "機能語予測CYK法による日本語文音声の統語処理" 電子情報通信学会誌. J74-D11,9. 1147-1155 (1991)
- Related Report
  1992 Annual Research Report
[Publications] 熊切義博: "短時間FFTによる音声分析ディスプレイ装置" 日本音響学会春期研究発表会講演論文集. 1-5-17. 431-432 (1992)
- Related Report
  1992 Annual Research Report
[Publications] 牧野正三: "Recognition on phonemes in continuous speech using a modified LVQ2 method" Journal Acoustic Society Japan. Vol.13. 351-360 (1992)
- Related Report
  1992 Annual Research Report
[Publications] 荒井秀一: "A Network for Phenome Recognition by Spectral Local Peaks" Proc.14th International Congress on Acoustics. G-4-1. 877-878 (1992)
- Related Report
  1992 Annual Research Report
[Publications] 張中: "調音結合モデルに基づく中国語音声認識システムの音素認識" 電子情報通信学会誌. J74-D11,9. 1156-1164 (1991)
- Related Report
  1992 Annual Research Report
[Publications] 張中: "ホルマントを用いた中国語単母音の分析と認識" 日本音響学会誌. 47. 281-288 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 伊藤彰則: "機能語予測CYK法による日本語文音声の統語処理" 電子情報通信学会誌. J74ーDII,9. 1147-1155 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 張中: "調音結合モデルを基づく中国語音声認識システムの音素認識" 電子情報通信学会誌. J74ーDII,9. 1156-1164 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 古賀秀昭: "性別判定と多数決を用いたロ-カルピ-クによる単語中母音の認識" 日本音響学会秋季研究発表会講演論文集. 1ー5ー9. 17-18 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 棚橋健二: "短時間FFTによる音声分析ー母音分析による検討ー" 日本音響学会春季研究発表会講演論文集. 2ーQー1. 159-160 (1992)
- Related Report
  1991 Annual Research Report
[Publications] 熊切義博: "短時間FFTによる音声分析ディスプレイ装置" 日本音響学会学季研究発表会講演論文集. 1ー5ー17. 431-432 (1992)
- Related Report
  1991 Annual Research Report

A Study for Utilizing the Linguistic Information in Phoneme Recognition to Understand Continuous Speech

Principal Investigator

KIDO Ken'iti Chiba Inst. of Tech., Engineering, Prof., 工学部, 教授 (30006209)

¥6,800,000 (Direct Cost: ¥6,800,000)

Report

Research Products

[Publications] 柵橋健二: "異常発声音の評価を目的とした音声分析表示法の予備的検討" 電子情報通信学会技術研究会資料. EA93-33. 17-23 (1993)

Description

Related Report

[Publications] 大内康裕: "正常および異常音声の第1・第2フォルマント平面における比較" 日本音響学会秋季研究発表会講演論文集. 593-594 (1993)

Description

Related Report

[Publications] 柵橋健二: "正常および異常音声のフォルマント周波数の時間遷移パターンによる比較" 日本音響学会秋季研究発表会講演論文集. 595-596 (1993)

Description

Related Report

[Publications] Shozo Makino: "Speech to Text Conversion System Based on Phoneme Recognition" Annals of Applied Information Science. 18. 51-65 (1993)

Description

Related Report

[Publications] 栗原世治: "各種音響パラメータが保持する個人性情報の分析" 日本音響学会秋季研究発表会講演論文集. 645-646 (1993)

Description

Related Report

[Publications] 小林淳: "動詞、名詞のスポッティングによる会話文の認識" 日本音響学会秋季研究発表会講演論文集. 175-176 (1993)

Description

Related Report

[Publications] 棚橋健二: "異常発声音の評価を目的とした音声分析表示法の予備的検討" 電子情報通信学会技術研究会資料. EA93-33. 17-23 (1993)

Related Report

[Publications] 大内康裕: "正常および異常音声の第1・第2フォルマント平面における比較" 日本音響学会秋期研究発表会講演論文集. 593-594 (1993)

Related Report

[Publications] 棚橋健二: "正常および異常音声のフォルマント周波数の時間遷移パターンによる比較" 日本音響学会秋期研究発表会講演論文集. 595-596 (1993)

Related Report

[Publications] Shozo Makino: "Speech to Text Conversion System Based on Phoneme Recognition" Annals of Applied Information Science. 18. 51-65 (1993)

Related Report

[Publications] 栗原世治: "各種音響パラメータが保持する個人性情報の分析" 日本音響学会秋期研究発表会講演論文集. 645-646 (1993)

Related Report

[Publications] 小林淳: "動詞、名詞のスポッティングによる会話文の認識" 日本音響学会秋期研究発表会講演論文集. 175-176 (1993)

Related Report

[Publications] 張 中: "ホルマントを用いた中国単語母音の分析と認識" 日本音響学会誌. 47. 281-288 (1991)

Related Report

[Publications] 伊藤 彰則: "機能語予測CYK法による日本語文音声の統語処理" 電子情報通信学会誌. J74-D11,9. 1147-1155 (1991)

Related Report

[Publications] 熊切 義博: "短時間FFTによる音声分析ディスプレイ装置" 日本音響学会春期研究発表会講演論文集. 1-5-17. 431-432 (1992)

Related Report

[Publications] 牧野 正三: "Recognition on phonemes in continuous speech using a modified LVQ2 method" Journal Acoustic Society Japan. Vol.13. 351-360 (1992)

Related Report

[Publications] 荒井 秀一: "A Network for Phenome Recognition by Spectral Local Peaks" Proc.14th International Congress on Acoustics. G-4-1. 877-878 (1992)

Related Report

[Publications] 張 中: "調音結合モデルに基づく中国語音声認識システムの音素認識" 電子情報通信学会誌. J74-D11,9. 1156-1164 (1991)

Related Report

[Publications] 張 中: "ホルマントを用いた中国語単母音の分析と認識" 日本音響学会誌. 47. 281-288 (1991)

Related Report

[Publications] 伊藤 彰則: "機能語予測CYK法による日本語文音声の統語処理" 電子情報通信学会誌. J74ーDII,9. 1147-1155 (1991)

Related Report

[Publications] 張 中: "調音結合モデルを基づく中国語音声認識システムの音素認識" 電子情報通信学会誌. J74ーDII,9. 1156-1164 (1991)

Related Report

[Publications] 古賀 秀昭: "性別判定と多数決を用いたロ-カルピ-クによる単語中母音の認識" 日本音響学会秋季研究発表会講演論文集. 1ー5ー9. 17-18 (1991)

Related Report

[Publications] 棚橋 健二: "短時間FFTによる音声分析ー母音分析による検討ー" 日本音響学会春季研究発表会講演論文集. 2ーQー1. 159-160 (1992)

Related Report

[Publications] 熊切 義博: "短時間FFTによる音声分析ディスプレイ装置" 日本音響学会学季研究発表会講演論文集. 1ー5ー17. 431-432 (1992)

Related Report

[Publications] 張中: "ホルマントを用いた中国単語母音の分析と認識" 日本音響学会誌. 47. 281-288 (1991)

[Publications] 伊藤彰則: "機能語予測CYK法による日本語文音声の統語処理" 電子情報通信学会誌. J74-D11,9. 1147-1155 (1991)

[Publications] 熊切義博: "短時間FFTによる音声分析ディスプレイ装置" 日本音響学会春期研究発表会講演論文集. 1-5-17. 431-432 (1992)

[Publications] 牧野正三: "Recognition on phonemes in continuous speech using a modified LVQ2 method" Journal Acoustic Society Japan. Vol.13. 351-360 (1992)

[Publications] 荒井秀一: "A Network for Phenome Recognition by Spectral Local Peaks" Proc.14th International Congress on Acoustics. G-4-1. 877-878 (1992)

[Publications] 張中: "調音結合モデルに基づく中国語音声認識システムの音素認識" 電子情報通信学会誌. J74-D11,9. 1156-1164 (1991)

[Publications] 張中: "ホルマントを用いた中国語単母音の分析と認識" 日本音響学会誌. 47. 281-288 (1991)

[Publications] 伊藤彰則: "機能語予測CYK法による日本語文音声の統語処理" 電子情報通信学会誌. J74ーDII,9. 1147-1155 (1991)

[Publications] 張中: "調音結合モデルを基づく中国語音声認識システムの音素認識" 電子情報通信学会誌. J74ーDII,9. 1156-1164 (1991)

[Publications] 古賀秀昭: "性別判定と多数決を用いたロ-カルピ-クによる単語中母音の認識" 日本音響学会秋季研究発表会講演論文集. 1ー5ー9. 17-18 (1991)

[Publications] 棚橋健二: "短時間FFTによる音声分析ー母音分析による検討ー" 日本音響学会春季研究発表会講演論文集. 2ーQー1. 159-160 (1992)

[Publications] 熊切義博: "短時間FFTによる音声分析ディスプレイ装置" 日本音響学会学季研究発表会講演論文集. 1ー5ー17. 431-432 (1992)