• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

1991 Fiscal Year Final Research Report Summary

Creation of Japanese Speech Corpus for Speech Processing Research

Research Project

Project/Area Number 01850068
Research Category

Grant-in-Aid for Developmental Scientific Research

Allocation TypeSingle-year Grants
Research Field 電子通信系統工学
Research InstitutionUniversity of Tsukuba

Principal Investigator

ITAHASHI Shuichi  Univ. of Tsukuba, Institute of Information Sciences and Electronics, Professor, 電子・情報工学系, 教授 (70151454)

Co-Investigator(Kenkyū-buntansha) KUREMATSU Akira  ATR Interpreting Telephone Research Laboratories, President, 社長
MAKINO Shuzo  Tohoku University, Research Center for Applied Information Sciences, Associate P, 応情研センター, 助教授 (00089806)
KOBATAKE Hidefumi  Tokyo University of Agriculture and Technology, Faculty of Technology, Professor, 工学部, 教授 (80013720)
SHIRAI Katsuhiko  Waseda University, Department of Electrical Engineering, Professor, 理工学部, 教授 (10063702)
FUJISAKI Hiroya  Science University of Tokyo, Faculty of Engineering, Professor, 基礎工学部, 教授 (80010776)
Project Period (FY) 1989 – 1991
KeywordsCD-ROM / DAT / Japanese / Noise / Speech / Speech corpus / Speech database / Speech processing
Research Abstract

Speech material was chosen including photetically ballanced 216 words, 110 monosyllables, 70 short sentences, 11 interrogative sentences, 7 sentences for speech quality measurement, one folk tale, weather forecast sentences and narrative sentences. Speech samples were recorded onto digital audio tapes (DAT) based on the above material with ten male and ten female speakers. Noise sound in a computer room was recorded to investigate influence of noise to speech processing.
Noise data of two hour duration was recorded onto DAT under 4 conditions of varying numbers and kinds of working machines.
Speech data of the 20 speakers mentioned above (4 utterances for each item, 2 hours for each speaker, 40 hours in all) was checked by hearing and those of good speech and recording quality of 12 speakers (6 male and 6 female speakers, 24 hours in all) was selected among them. Master tapes for speech database was produced with start ID's and program numbers assigned to major items so that necessary items can be retrieved easily. Check lists were prepared which describes pronunciation and recording conditions in detail. One utterance of good quality from 4 repetitions was selected for 7 kinds of continuous speech data mentioned above and they were recorded on to CD-ROM. This would be the first attempt to create a CD-ROM speech database of Japanese sentence speech. Then best utterances of 110 monosyllables and phonetically balanced 216 words were also recorded onto CD-ROM.
One of the major objectives of speech databases is to utilize them to develop various techniques of speech analysis, synthesis and recognition and to compare and evaluate them. Therefore, several kinds of speech analysis and recognition experiments have been performed. Some methods were proved to be useful for speech/non-speech discrimination and speech recognition under noise environments.

  • Research Products

    (70 results)

All Other

All Publications (70 results)

  • [Publications] K.SHIRAI,H,FUJISAKI,S.ITAHASHI: "Speech database projects in Japan -Present and Future-" Proc.ESCA Workshop on Speech Input/Output Assesment and Speech Databases. 2,4,1-2,4,4 (1989)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 許 康仁,板橋 秀一,伊藤 正弘: "スペクトルモ-メントによる韓国語の単母音のホルマント抽出" 日本音響学会平成2年度春季研究発表会講演論文集. 3ー4ー3. 273-274 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 廖,牧野,城戸: "スペクトルの時間変化、ロ-カルピ-ク、傾斜を利用した破裂子音の検出と認識の検討" 日本音響学会誌. 45巻9号. 499-506 (1989)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 遠藤,牧野,城戸: "LVQ2を用いた音素認識" 電子情報通信学会技術報告. SP89ー50. 33-40 (1989)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 藤崎,広瀬,浅野: "高品質の音声合成に適したタ-ミナル・アナログ型音声合成器の構成" 日本音響学会秋季研究発表会講演論文集. 3ーPー2. 279-280 (1989)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 藤崎,広瀬,浅野: "ホルマント合成器による日本語音声の規則合成" 日本音響学会春季研究発表会講演論文集. 1ー4ー6. 189-190 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 白井 克彦,青木 紀將,保坂 尚樹: "MULTIーLEVEL CLUSTERING OF ACOUSTIC FEATURES FOR PHONEME RECOGNITION BASED ON MUTUAL INFORMATION" IEEE ICASSP. Vol.1. 604-607 (1989)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 白井 克彦,青木 紀將: "音響特徴量の情報量に基づく階層的クラスタリングによる音韻認識" 電子情報通信学会論文誌DーII Vol.J72ーDーII. No.8. 1207-1214 (1989)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] H.Kobatake and A.Ishida: "Speech/Nonspeech Discrimination under Nonstationary Noise Environments" Proc.ICASSP89. 365-368 (1989)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 石田 明,小畑 秀文: "実環境下における音声分類" 日本音響学会春期講演論文集. 1-2 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 榑松 明他: "ATR Japanese Speech Database as a Tool of Speech Recognition and Synthesis" Proceeding of ESCA Workshop on Speech I/O Assessment. 2,3,1-2,3,4 (1989)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Shuichi ITAHASHI: "Recent Speech Database Projects in Japan" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 24.1. 1081-1084 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Shichi ITAHASHI: "On Speech Database Efforts in Japan" Preperints of International Symposium on International Coordination and Standardization of Speech Database and Assessment Techniques for Speech Input/Output. 57-63 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Jingxu CUI: "A Comparison of the Articulation of the Chinese/i,,l/by Chinese and Japanese Speakers" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 15.10. 629-632 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Hiroya FUJISAKI and S.SATO: "Proposal and Evaluation of a New Scheme for Reliable Pitch Extraction of Speech" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 11.14. 473-476 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Hiroya FUJISAKI: "Influence of context and knowkedge on the Perception of Continuous Speech" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 10.9. 417-420 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Katsuhiko SHIRAI: "Speaker Adaptable Phoneme Recognition Selecting Reliable Acoustic Features based on Mutual Information" Proc.of ICSLP90-International conference on Spoken Language Processing. 353-356 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Katsuhiko SHIRAI: "Speech Synthesis Using Superposition of Sinusoidal Waves Generated by Synchronized Oscillators" Proc.of ICSLP90-Internatiional Conference on Spoken Lanaguage Processing. 345-348 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] K.GYOUTOKU,(H.KOBATAKE): "Maximum likelohood estimation of speech waveform under nonstatonary noise environments" Proc.ICSLP90. 1149-1152 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 城風 敏彦,(牧野 正三): "発話全体の連続性を考慮した基本周波数の抽出" 電子情報通信学会論文誌. J73ーA. 1537-1539 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 松尾 広,(牧野 正三): "音素の持続時間モデルに基づく検証法を用いた単語音声認識" 電子情報通信学会論文誌. J73ーDーII. 1936-1944 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 城風 敏彦,(牧野 正三),城戸 健一: "音声情報の自動獲得機能を持つ分散型大規模音声デ-タベ-ス「KーDB」" 情報処理学会論文誌. 32. 62-70 (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 榑松 明: "ATR Japanese Speech Database as a tool of Speech Recognition and Synthesis" SPEECH COMMUNICATION. Volum9 No.4. 357-357 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 榑松 明: "A Perspective of telephone interpretation research" Proceeding of Pacific Rim International Conference on Artificial Intelligence. 11-16 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 森元 逞,(榑松 明): "Spoken language translation toward realizing an automatic telephone interpretation system" Proceeding of InfoーJapan'90(情報処理学会30周年記念国際会議). 553-560 (1990)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 板橋 秀一: "騒音デ-タベ-スと日本語共通音声デ-タDAT版" 日本音響学会誌. 47. 951-953 (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] S.Itahashi: ""Creating Speech Corpora for Speech Science and Technology"" IEICE Transactions. E74. 1906-1910 (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] K.Shirai,E.Kitagawa,T.Endo: ""Optimal Construction of Context Sensitive Quantizer For Phoneme Recognition in Continuous Speech"" Proc.Eurospeech91. 14.1. 405-408 (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 白井 克彦: "「音声認識における特徴抽出」" 電子情報通信学会誌. 73. 1269-1275 (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] H.Kobatake,K.Gyoutoku,L.Sheng: ""Enhancement of Noisy Speech by Maximum Likelihood Estimation"" Proc.ICASSP91. 973-976 (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 石田 明,小畑 秀文: "「実環境下での音声/非音声の判別」" 日本音響学会誌. 47. 911-917 (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] S.Makino,A.Ito,M.Endo,K.Kido: ""A Japanese Text Dictation System Based on Phoneme Recognition and a Dependency Grammar"" IEICE Transactions. E74. 1773-1782 (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] T.Morimoto,K.shikano,K.Kogure,H.Iida,A.Kurematsu: ""Integration of Speech Recognition and Language Processing in a Japanese to English Spoken Language Translation System"" IEICE Transactions. E74. 1889-1991 (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 榑松 明: "「自動翻訳電話のための音声情報処理」" 人工知能学会誌「ATR特集号」. (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 藤崎 博也,広瀬 啓吉,高橋 登: "共通語および方言音声における基本周波数の統計的分布" 日本音響学会講演論文集. 251-252 (1991)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] K. Shirai, H. Fujisaki and S. Itahashi: "Speech database projects in Japan - Present and Future -" Proc. ESCA Workshop on Speech Input/output Assesment and Speech Databases. (1989)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] HUR Kang-In, S. Itahashi and M. Ito: "Formant extraction of eight Korean vowels using spectrum moments" Preprints, Spring Meeting, Acous. Soc. Jpn.Paper 3-4-3. 273-274 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Ma and S. Makino: "Detection of plosive consonants utilizing temporal change, local peaks and slpoe of speech spectrum" J. Acous. soc. Jpn.45, 7. 499-506 (1989)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Endo, S. Makino and K. Kido: "Phoneme recognition using LVQ2" Tech. Rep. IEICE. SP89-50. 33-40 (1989)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H. Fujisaki, K. Hirose and Y. Asano: "Configuration of terminal analogue synthesizer for high quality speech synthesis" Prep. Fall Meeting, Acous. Soc. Jpn.Paper 3-P-2. 279-280 (1989)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H. Fujisaki, K. Hirose and Y. Asano: "Rule synthesis of speech using terminal analog synthesizer" Prep, Spring Meeting, Acous. Soc. Jpn.Paper 1-4-6. 189-190 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] K. Shirai, N. Aoki and N. Hosaka: "Multi-level clustering of acoustic features for phoneme recognition based on mutual information" Proc. ICASSP89. 604-607 (1989)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] K. Shirai and N. Aoki: "Phoneme recognition by hierarchical clustering of acoustic features based on mutual information" Trans. IEICE.J72-D-II, 8. 1207-1214 (1989)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H. Kobatake and A. Ishida: "Speech/nonspeech discrimination for speech recognition under real life Noise Environments" Proc. ICASSP89. 365-368 (1989)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] A. Ishida and H. Kobatake: "Speech discrimination under real life environments" Prep. Spring Meeting, Acous. Soc. Jpn.Paper 1-3-1. 1-2 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] A. Kurematsu: "ATR Japanese Speech Database as a tool of Speech Recognition and Synthesis" Proceeding of ESCA Workshop on Speech I/O Assessment. Paper 2, 3. 1-4 (1989)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] S. Itahashi: "Recent Speech Database Projects in Japan" Proc. ICSLP90, Kobe. Paper 24.1. 1081-1084 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] S. Itahashi: "On Speech Database Efforts in Japan" Prep. Intn'l Symp. on Intn'l Coordination and Standardization of Speech Database and Assessment Techniques for Speech Input/Output. 57-63 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Jingxu CUI: "A Comparison of the Articulation of the Chinese /i, , l/ by Chinese and Japanese Speakers" Proc. ICSLP90, Kobe. Paper 15.10. 629-632 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H. Fujisaki, S. Sato: "Proposal and Evaluation of a New Scheme for Reliable Pitch Extraction of Speech" Proc. ICSLP90, Kobe. Paper 11.14. 473-476 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H. Fujisaki: "Influence of Context and Knowkedge on the Perception of Continuous Speech" Proc. ICSLP90, Kobe. Paper 10.9. 417-420 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] K. Shirai: "Speaker Adaptable Phoneme Recognition Selection Reliable Acoustic Features based on Mutual Information" Proc. ICSLP90, Kobe. Paper 9.2. 353-356 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] K. Shirai: "Speech Synthesis Using Superposition of Sinusoidal Waves Generated by Synchronized Oscillators" Proc. of ICSLP90. Paper 8.9. 345-348 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] K. Gyoutoku and H. Kobatake: "Maximum likelihood estimation of speech waveform under nonstationary noise environments" Proc. ICSLP90. Paper 25.10. 1149-1152 (1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] T. Shirokaze and S. Makino: "Extraction of fundamental frequency using temporal continuity over an input speech" Trans. IEICE.J73-A. 1537-1539 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H. Matsuo and S. Makino: "Spoken word recognition using a verification method based on phoneme duration model" Trans. IEICE.J-73-D-II. 1936-1944 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] T. Shirokaze, S. Makino and K. Kido: "A Large sale distributed speech database "K-DB" with an acquisition system of speech information" Trans. Inform. Proc. Soc. Jpn.32. 62-70 (1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] A. Kurematsu: "ATR Japanese Speech Database as a tool of Speech Recognition and Synthesis" Speech Communication. 9, 4. 357-357 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] A. Kurematsu: "A Perspective of telephone interpretation research" Proceeding of Pacific Rim International Conference on Artificial Intelligence. 11-16 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] T. Morimoto and A. Kurematsu: "Spoken language translation toward realizing an automatic telephone interpretation system" Proceeding of Info-Japan '90. 553-560 (1990)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] S. Itahashi: "A noise database and Japanese common speech data corpus" J. Acous. Soc. Jpn.47, 12. 951-953 (1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] S. Itahashi: "Creating Speech Corpora for Speech Science and Technology" IEICE Transactions. E74, 7. 1906-1910 (1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] K. Shirai, E. Kitagawa and T. Endo: "Optimal Construction of Context Sensitive Quantizer For Phoneme Recognition in Continuous Speech" Proc. Neurospeech91. Paper 14.1. 405, 408 (1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] K. Shirai: "Feature extraction in speech recognition" Trans. IEICE.73. 1269-1275 (1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H. Kobatake, K. Gyoutoku and L. Sheng: "Enhancement of Noisy speech by Maximum Likelihood Estimation" Proc. ICASSP91. 973-976 (1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] A. Ishida and H. Kobatake: "Speech/non-speech discrimination under real life environments" J. Acous. Soc. Jpn.47, 12. (1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] S. Makino, A. Ito, M. Endo and K. Kido: "A Japanese Text Dictation System Based on Phoneme Recognition and a Dependency Grammar" IEICE Transactions. E74, 7. 1773-1782 (1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] T. Morimoto, K. Shikano, K. Kogure, H. Iida and A. Kurematsu: "Integration of Speech Recognition and Language Processing in a Japanese to English Spoken Language Translation System" IEICE Transactions. E74, 7. 1889-1991 (1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] A. Kurematsu: "Speech processing for automatic interpreting telephony." J. Jpn. Soc. Artif. Intel.(1991)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H. Fujisaki, K. Hirose and N. Takahashi: "Statistical distribution of voice fundamental frequencies in Common Japanese and dialect Japanese" Prep. Fall Meeting, Acous. Soc. Jpn.Paper 2-6-7. 251-252 (1991)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 1993-03-16  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi