A study on the conkersion from a sentence speech to a kanji-kana string using phoneme recognition, syntax and semantics processings

Research Project

Project/Area Number	59420031
Research Category	Grant-in-Aid for General Scientific Research (A)
Allocation Type	Single-year Grants
Research Field	電子通信系統工学
Research Institution	TOHOKU UNIVERSITY
Principal Investigator	KIDO Ken'iti Professor, Research Center for Applied Information Sciences, Tohoku University, 国立大学(その他), 教授 (30006209)
Co-Investigator(Kenkyū-buntansha)	SUZUKI Yoiti Research Associate, Reseach Institute of Electrical Communication, Tohoku Univer, 電気通信研究所, 助手 (20143034) MIWA Jouji Associate Professor, Faculty of Engineering, Iwate University, 工学部, 助教授 (60125664) ABE Masato Research Associate, Faculty of Engineering, Tohoku University, 工学部, 助手 (00159443) MAKINO Shozo Research Associate, Research Center for Applied Information Sciences, Tohoku Uni, 応用情報学研究センター, 助手 (00089806)
Project Period (FY)	1984 – 1986
Project Status	Completed (Fiscal Year 1986)
Budget Amount *help	¥22,100,000 (Direct Cost: ¥22,100,000) Fiscal Year 1986: ¥3,000,000 (Direct Cost: ¥3,000,000) Fiscal Year 1985: ¥3,000,000 (Direct Cost: ¥3,000,000) Fiscal Year 1984: ¥16,100,000 (Direct Cost: ¥16,100,000)
Keywords	Speech Recognition / Natural language processing / Syntax processing / Semantic processing / Speaker-independent / 単語スポッテイング / 音声 / 自動認識 / 音声認識 / 意味 / 構文 / 音声データベース / 単語音声 / 文章音声
Research Abstract	We have developed a Japanese dictation system which can convert a continuous speech uttered by unspecified speaker to a Kanji-Kana string. The system is composed of an acoustic processing part, a spotting part of Bunsetsu-like units and a syntax and semantic processing part. Our object is to recognize a sentence speech whose syntax and semantic structures are syntactically and semantically reasonable. In the acoustic processing part, an input speech is analized by a 29 channel band-pass filter bank. Segment features are extracted from short-time spectra using time-spectrum patterns and then converted to a phoneme string. In the spotting part of Bunsetsu-like units, Bunsetsu-like units are spotted from a phoneme string using a syntactic driven continuous DP can dominantly reduce the amounts of computation and storage necessary to spot Bunsetsu-like units. Inthe syntactic and semantic processing part, functional features are given to Bunsetsu-like unit candidates from lexical items, where functional features contains syntactic and semantic information. Possibility of concatenation between two adjacent Bunsetsu-like units is checked based on the functional features and then the two adjacent units make a larger unit if satisfying syntactic and semantic reasonability, and thus a complete sentence is finally made. When a speaker utterd a sentence by a Bunsetsu-like unit, 60% of sentence recognition score is obtained in case of 85% phoneme recognition score. 80% of sentence recognition score is obtained in case of 95% phoneme recognition score. Refining the rules in syntactic and semantic processing part can improve the sentence recognition score.

Report

(2 results)

1986 Final Research Report Summary
1985 Annual Research Report

Research Products
(29 results)

All Other

All Publications (29 results)

[Publications] 牧野正三,本間茂,城戸健一: 日本音響学会英文誌. 6. 171-180 (1985)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 牧野正三,城戸健一: Speech Communication. Vol.5No.2. 225-237 (1986)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 安倍正人,金敬泰,城戸健一: 日本音響学会英文誌. Vol.7. 269-277 (1986)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 金千徳,安倍正人,城戸健一: 日本音響学会英文誌. Vol.7. 239-247 (1986)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 金敬泰,牧野正三,城戸健一: 日本音響学会英文誌. Vol.7No.5. 325-334 (1986)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 金井浩,安倍正人,城戸健一: 日本音響学会英文誌. Vol.7. 219-228 (1986)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 伊藤努,野戸広之,嶋明弘,安倍正人,城戸健一: 日本音響学会英文誌. Vol.7. 187-195 (1986)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 岡田美智男,牧野正三,城戸健一: 日本音響学会音声研究会資料. S84-26. 199-206 (1984)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 小坂哲夫,岡田美智男,松尾広,城戸健一: 日本音響学会音声研宮会資料. S85-53. 405-412 (1985)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 小坂哲夫,城風敏彦,岡田美智男,城戸健一: 電子通信学会技術報告. EA85-32. 1-8 (1985)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 岡田美智男,牧野正三,城戸健一: 電子通信学会技術報告. EA85-34. 17-24 (1985)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 岡田美智男,伊藤彰則,松尾広,牧野正三: 電子通信学会技術報告. SP86-33. 49-56 (1986)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 城戸健一: "音声の合成と認識" オーム社, 112 (1986)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] S. Makino; S. Homma; K. Kido: "Speaker independent word recognition system based on phoneme recognition for a large size (212 words) vocabulary" J. Acoust. Soc. Jpn., (E). 6. 171-180 (1985)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] S. Makino; K. Kido: "Recognition of phonemes using time-spectrum pattern" Speech Communication. 5, No.2. 225-237 (1986)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] M. Abe; C. K. Kim; K. Kido: "Investigation of the effect of a time window on the accuracy of an estimated impulse response" J. Acoust. Soc. Jpn.,(E). 7. 269-277 (1986)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] C. K. Kim; M. Abe; K. Kido: "Investigation on the method for the estimation of impulse response using a rectangular pulse" J. Acoust. Soc. Jpn.,(E). 7. 239-247 (1986)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] K. T. Kim; S. Makino; K. Kido: "Recognition of stop consonants in Japanese words using spectral local peaks" J. Acoust. Soc. Jpn.,(E). 7. 325-334 (1986)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] T. Itoh; H. Noto; H. Shima; M. Abe; K. Kido: "A method of estimating the contribution factors of a sound source, using envelopes of band-passed signals" J. Acoust. Soc. Jpn.,(E). 7. 187-195 (1986)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] Michio OKADA; Shozo MAKINO; Ken-iti KIDO: "Normalization of coarticulation between a plosive and its succeeding phoneme in the recognition of plosive consonants." Transactions of the Committee on Speech Research The Acoustical Society of Japan. S84-26. 199-206 (1984)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] Tetsuo KOSAKA; Michio OKADA; Hiroshi MATSUO; Ken-iti KIDO: "Detection of segment type features for continuous speech recognition" Transactions of the Committee on Speech Research The Acoustical Society of Japan. S85-53. 405-412 (1985)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] Tetsuo KOSAKA; Toshihiko SHIROKAZE; Michio OKADA; Ken-iti KIDO: "Acoustic Characteristics of Devocalized Vowels, Long Vowels and "Mora" Phonemes in Spoken Words." Technical Report I. E. C. E.EA85-32. 1-8 (1985)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] Michio OKADA; Shozo MAKINO; Ken-iti KIDO: "Recognition of Voiced Plosives using Multiple Regression Model" Technical Report I. E. C. E.EA85-34. 17-24 (1985)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] Ken-iti Kido: OHM. Speaker synthesis and recognition, 112 (1986)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1986 Final Research Report Summary
[Publications] 日本音響学会英文誌. Vol.6-3. (1985)
- Related Report
  1985 Annual Research Report
[Publications] 日本音響学会誌. 41-12. (1985)
- Related Report
  1985 Annual Research Report
[Publications] 電子通信学会技術報告. EA85-32. (1985)
- Related Report
  1985 Annual Research Report
[Publications] 電子通信学会技術報告. EA85-34. (1985)
- Related Report
  1985 Annual Research Report
[Publications] 日本音響学音声研究会資料. S85-53. (1985)
- Related Report
  1985 Annual Research Report

A study on the conkersion from a sentence speech to a kanji-kana string using phoneme recognition, syntax and semantics processings

Principal Investigator

KIDO Ken'iti Professor, Research Center for Applied Information Sciences, Tohoku University, 国立大学(その他), 教授 (30006209)

¥22,100,000 (Direct Cost: ¥22,100,000)

Report

Research Products

[Publications] 牧野正三,本間茂,城戸健一: 日本音響学会英文誌. 6. 171-180 (1985)

Description

Related Report

[Publications] 牧野正三,城戸健一: Speech Communication. Vol.5No.2. 225-237 (1986)

Description

Related Report

[Publications] 安倍正人,金敬泰,城戸健一: 日本音響学会英文誌. Vol.7. 269-277 (1986)

Description

Related Report

[Publications] 金千徳,安倍正人,城戸健一: 日本音響学会英文誌. Vol.7. 239-247 (1986)

Description

Related Report

[Publications] 金敬泰,牧野正三,城戸健一: 日本音響学会英文誌. Vol.7No.5. 325-334 (1986)

Description

Related Report

[Publications] 金井浩,安倍正人,城戸健一: 日本音響学会英文誌. Vol.7. 219-228 (1986)

Description

Related Report

[Publications] 伊藤努,野戸広之,嶋明弘,安倍正人,城戸健一: 日本音響学会英文誌. Vol.7. 187-195 (1986)

Description

Related Report

[Publications] 岡田美智男,牧野正三,城戸健一: 日本音響学会音声研究会資料. S84-26. 199-206 (1984)

Description

Related Report

[Publications] 小坂哲夫,岡田美智男,松尾広,城戸健一: 日本音響学会音声研宮会資料. S85-53. 405-412 (1985)

Description

Related Report

[Publications] 小坂哲夫,城風敏彦,岡田美智男,城戸健一: 電子通信学会技術報告. EA85-32. 1-8 (1985)

Description

Related Report

[Publications] 岡田美智男,牧野正三,城戸健一: 電子通信学会技術報告. EA85-34. 17-24 (1985)

Description

Related Report

[Publications] 岡田美智男,伊藤彰則,松尾広,牧野正三: 電子通信学会技術報告. SP86-33. 49-56 (1986)

Description

Related Report

[Publications] 城戸健一: "音声の合成と認識" オーム社, 112 (1986)

Description

Related Report

[Publications] S. Makino; S. Homma; K. Kido: "Speaker independent word recognition system based on phoneme recognition for a large size (212 words) vocabulary" J. Acoust. Soc. Jpn., (E). 6. 171-180 (1985)

Description

Related Report

[Publications] S. Makino; K. Kido: "Recognition of phonemes using time-spectrum pattern" Speech Communication. 5, No.2. 225-237 (1986)

Description

Related Report

[Publications] M. Abe; C. K. Kim; K. Kido: "Investigation of the effect of a time window on the accuracy of an estimated impulse response" J. Acoust. Soc. Jpn.,(E). 7. 269-277 (1986)

Description

Related Report

[Publications] C. K. Kim; M. Abe; K. Kido: "Investigation on the method for the estimation of impulse response using a rectangular pulse" J. Acoust. Soc. Jpn.,(E). 7. 239-247 (1986)

Description

Related Report

[Publications] K. T. Kim; S. Makino; K. Kido: "Recognition of stop consonants in Japanese words using spectral local peaks" J. Acoust. Soc. Jpn.,(E). 7. 325-334 (1986)

Description

Related Report

[Publications] T. Itoh; H. Noto; H. Shima; M. Abe; K. Kido: "A method of estimating the contribution factors of a sound source, using envelopes of band-passed signals" J. Acoust. Soc. Jpn.,(E). 7. 187-195 (1986)

Description

Related Report

[Publications] Michio OKADA; Shozo MAKINO; Ken-iti KIDO: "Normalization of coarticulation between a plosive and its succeeding phoneme in the recognition of plosive consonants." Transactions of the Committee on Speech Research The Acoustical Society of Japan. S84-26. 199-206 (1984)

Description

Related Report

[Publications] Tetsuo KOSAKA; Michio OKADA; Hiroshi MATSUO; Ken-iti KIDO: "Detection of segment type features for continuous speech recognition" Transactions of the Committee on Speech Research The Acoustical Society of Japan. S85-53. 405-412 (1985)

Description

Related Report

[Publications] Tetsuo KOSAKA; Toshihiko SHIROKAZE; Michio OKADA; Ken-iti KIDO: "Acoustic Characteristics of Devocalized Vowels, Long Vowels and "Mora" Phonemes in Spoken Words." Technical Report I. E. C. E.EA85-32. 1-8 (1985)

Description

Related Report

[Publications] Michio OKADA; Shozo MAKINO; Ken-iti KIDO: "Recognition of Voiced Plosives using Multiple Regression Model" Technical Report I. E. C. E.EA85-34. 17-24 (1985)

Description

Related Report

[Publications] Ken-iti Kido: OHM. Speaker synthesis and recognition, 112 (1986)

Description

Related Report

[Publications] 日本音響学会英文誌. Vol.6-3. (1985)

Related Report