Creation of Japanese Speech Corpus for Speech Processing Research

Research Project

Project/Area Number	01850068
Research Category	Grant-in-Aid for Developmental Scientific Research
Allocation Type	Single-year Grants
Research Field	電子通信系統工学
Research Institution	University of Tsukuba
Principal Investigator	ITAHASHI Shuichi Univ. of Tsukuba, Institute of Information Sciences and Electronics, Professor, 電子・情報工学系, 教授 (70151454)
Co-Investigator(Kenkyū-buntansha)	KUREMATSU Akira ATR Interpreting Telephone Research Laboratories, President, 社長 MAKINO Shuzo Tohoku University, Research Center for Applied Information Sciences, Associate P, 応情研センター, 助教授 (00089806) KOBATAKE Hidefumi Tokyo University of Agriculture and Technology, Faculty of Technology, Professor, 工学部, 教授 (80013720) SHIRAI Katsuhiko Waseda University, Department of Electrical Engineering, Professor, 理工学部, 教授 (10063702) FUJISAKI Hiroya Science University of Tokyo, Faculty of Engineering, Professor, 基礎工学部, 教授 (80010776) 城戸健一東北大学, 応用情報学研研究センター長, 教授 (30006209)
Project Period (FY)	1989 – 1991
Project Status	Completed (Fiscal Year 1991)
Budget Amount *help	¥24,800,000 (Direct Cost: ¥24,800,000) Fiscal Year 1991: ¥6,200,000 (Direct Cost: ¥6,200,000) Fiscal Year 1990: ¥8,000,000 (Direct Cost: ¥8,000,000) Fiscal Year 1989: ¥10,600,000 (Direct Cost: ¥10,600,000)
Keywords	CD-ROM / DAT / Japanese / Noise / Speech / Speech corpus / Speech database / Speech processing / 音声デ-タベ-ス / CD-ROM / 音声情報処理
Research Abstract	Speech material was chosen including photetically ballanced 216 words, 110 monosyllables, 70 short sentences, 11 interrogative sentences, 7 sentences for speech quality measurement, one folk tale, weather forecast sentences and narrative sentences. Speech samples were recorded onto digital audio tapes (DAT) based on the above material with ten male and ten female speakers. Noise sound in a computer room was recorded to investigate influence of noise to speech processing. Noise data of two hour duration was recorded onto DAT under 4 conditions of varying numbers and kinds of working machines. Speech data of the 20 speakers mentioned above (4 utterances for each item, 2 hours for each speaker, 40 hours in all) was checked by hearing and those of good speech and recording quality of 12 speakers (6 male and 6 female speakers, 24 hours in all) was selected among them. Master tapes for speech database was produced with start ID's and program numbers assigned to major items so that necessary items can be retrieved easily. Check lists were prepared which describes pronunciation and recording conditions in detail. One utterance of good quality from 4 repetitions was selected for 7 kinds of continuous speech data mentioned above and they were recorded on to CD-ROM. This would be the first attempt to create a CD-ROM speech database of Japanese sentence speech. Then best utterances of 110 monosyllables and phonetically balanced 216 words were also recorded onto CD-ROM. One of the major objectives of speech databases is to utilize them to develop various techniques of speech analysis, synthesis and recognition and to compare and evaluate them. Therefore, several kinds of speech analysis and recognition experiments have been performed. Some methods were proved to be useful for speech/non-speech discrimination and speech recognition under noise environments.

Report

(4 results)

1991 Annual Research Report Final Research Report Summary
1990 Annual Research Report
1989 Annual Research Report

Research Products
(107 results)

All Other

All Publications (107 results)

[Publications] K.SHIRAI,H,FUJISAKI,S.ITAHASHI: "Speech database projects in Japan -Present and Future-" Proc.ESCA Workshop on Speech Input/Output Assesment and Speech Databases. 2,4,1-2,4,4 (1989)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 許康仁,板橋秀一,伊藤正弘: "スペクトルモ-メントによる韓国語の単母音のホルマント抽出" 日本音響学会平成2年度春季研究発表会講演論文集. 3ー4ー3. 273-274 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 廖,牧野,城戸: "スペクトルの時間変化、ロ-カルピ-ク、傾斜を利用した破裂子音の検出と認識の検討" 日本音響学会誌. 45巻9号. 499-506 (1989)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 遠藤,牧野,城戸: "LVQ2を用いた音素認識" 電子情報通信学会技術報告. SP89ー50. 33-40 (1989)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 藤崎,広瀬,浅野: "高品質の音声合成に適したタ-ミナル・アナログ型音声合成器の構成" 日本音響学会秋季研究発表会講演論文集. 3ーPー2. 279-280 (1989)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 藤崎,広瀬,浅野: "ホルマント合成器による日本語音声の規則合成" 日本音響学会春季研究発表会講演論文集. 1ー4ー6. 189-190 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 白井克彦,青木紀將,保坂尚樹: "MULTIーLEVEL CLUSTERING OF ACOUSTIC FEATURES FOR PHONEME RECOGNITION BASED ON MUTUAL INFORMATION" IEEE ICASSP. Vol.1. 604-607 (1989)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 白井克彦,青木紀將: "音響特徴量の情報量に基づく階層的クラスタリングによる音韻認識" 電子情報通信学会論文誌DーII Vol.J72ーDーII. No.8. 1207-1214 (1989)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] H.Kobatake and A.Ishida: "Speech/Nonspeech Discrimination under Nonstationary Noise Environments" Proc.ICASSP89. 365-368 (1989)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 石田明,小畑秀文: "実環境下における音声分類" 日本音響学会春期講演論文集. 1-2 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 榑松明他: "ATR Japanese Speech Database as a Tool of Speech Recognition and Synthesis" Proceeding of ESCA Workshop on Speech I/O Assessment. 2,3,1-2,3,4 (1989)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] Shuichi ITAHASHI: "Recent Speech Database Projects in Japan" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 24.1. 1081-1084 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] Shichi ITAHASHI: "On Speech Database Efforts in Japan" Preperints of International Symposium on International Coordination and Standardization of Speech Database and Assessment Techniques for Speech Input/Output. 57-63 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] Jingxu CUI: "A Comparison of the Articulation of the Chinese/i,,l/by Chinese and Japanese Speakers" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 15.10. 629-632 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] Hiroya FUJISAKI and S.SATO: "Proposal and Evaluation of a New Scheme for Reliable Pitch Extraction of Speech" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 11.14. 473-476 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] Hiroya FUJISAKI: "Influence of context and knowkedge on the Perception of Continuous Speech" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 10.9. 417-420 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] Katsuhiko SHIRAI: "Speaker Adaptable Phoneme Recognition Selecting Reliable Acoustic Features based on Mutual Information" Proc.of ICSLP90-International conference on Spoken Language Processing. 353-356 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] Katsuhiko SHIRAI: "Speech Synthesis Using Superposition of Sinusoidal Waves Generated by Synchronized Oscillators" Proc.of ICSLP90-Internatiional Conference on Spoken Lanaguage Processing. 345-348 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] K.GYOUTOKU,(H.KOBATAKE): "Maximum likelohood estimation of speech waveform under nonstatonary noise environments" Proc.ICSLP90. 1149-1152 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 城風敏彦,(牧野正三): "発話全体の連続性を考慮した基本周波数の抽出" 電子情報通信学会論文誌. J73ーA. 1537-1539 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 松尾広,(牧野正三): "音素の持続時間モデルに基づく検証法を用いた単語音声認識" 電子情報通信学会論文誌. J73ーDーII. 1936-1944 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 城風敏彦,(牧野正三),城戸健一: "音声情報の自動獲得機能を持つ分散型大規模音声デ-タベ-ス「KーDB」" 情報処理学会論文誌. 32. 62-70 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 榑松明: "ATR Japanese Speech Database as a tool of Speech Recognition and Synthesis" SPEECH COMMUNICATION. Volum9 No.4. 357-357 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 榑松明: "A Perspective of telephone interpretation research" Proceeding of Pacific Rim International Conference on Artificial Intelligence. 11-16 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 森元逞,(榑松明): "Spoken language translation toward realizing an automatic telephone interpretation system" Proceeding of InfoーJapan'90(情報処理学会30周年記念国際会議). 553-560 (1990)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 板橋秀一: "騒音デ-タベ-スと日本語共通音声デ-タDAT版" 日本音響学会誌. 47. 951-953 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] S.Itahashi: ""Creating Speech Corpora for Speech Science and Technology"" IEICE Transactions. E74. 1906-1910 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] K.Shirai,E.Kitagawa,T.Endo: ""Optimal Construction of Context Sensitive Quantizer For Phoneme Recognition in Continuous Speech"" Proc.Eurospeech91. 14.1. 405-408 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 白井克彦: "「音声認識における特徴抽出」" 電子情報通信学会誌. 73. 1269-1275 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] H.Kobatake,K.Gyoutoku,L.Sheng: ""Enhancement of Noisy Speech by Maximum Likelihood Estimation"" Proc.ICASSP91. 973-976 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 石田明,小畑秀文: "「実環境下での音声/非音声の判別」" 日本音響学会誌. 47. 911-917 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] S.Makino,A.Ito,M.Endo,K.Kido: ""A Japanese Text Dictation System Based on Phoneme Recognition and a Dependency Grammar"" IEICE Transactions. E74. 1773-1782 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] T.Morimoto,K.shikano,K.Kogure,H.Iida,A.Kurematsu: ""Integration of Speech Recognition and Language Processing in a Japanese to English Spoken Language Translation System"" IEICE Transactions. E74. 1889-1991 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 榑松明: "「自動翻訳電話のための音声情報処理」" 人工知能学会誌「ATR特集号」. (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 藤崎博也,広瀬啓吉,高橋登: "共通語および方言音声における基本周波数の統計的分布" 日本音響学会講演論文集. 251-252 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] K. Shirai, H. Fujisaki and S. Itahashi: "Speech database projects in Japan - Present and Future -" Proc. ESCA Workshop on Speech Input/output Assesment and Speech Databases. (1989)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] HUR Kang-In, S. Itahashi and M. Ito: "Formant extraction of eight Korean vowels using spectrum moments" Preprints, Spring Meeting, Acous. Soc. Jpn.Paper 3-4-3. 273-274 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] Ma and S. Makino: "Detection of plosive consonants utilizing temporal change, local peaks and slpoe of speech spectrum" J. Acous. soc. Jpn.45, 7. 499-506 (1989)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] Endo, S. Makino and K. Kido: "Phoneme recognition using LVQ2" Tech. Rep. IEICE. SP89-50. 33-40 (1989)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] H. Fujisaki, K. Hirose and Y. Asano: "Configuration of terminal analogue synthesizer for high quality speech synthesis" Prep. Fall Meeting, Acous. Soc. Jpn.Paper 3-P-2. 279-280 (1989)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] H. Fujisaki, K. Hirose and Y. Asano: "Rule synthesis of speech using terminal analog synthesizer" Prep, Spring Meeting, Acous. Soc. Jpn.Paper 1-4-6. 189-190 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] K. Shirai, N. Aoki and N. Hosaka: "Multi-level clustering of acoustic features for phoneme recognition based on mutual information" Proc. ICASSP89. 604-607 (1989)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] K. Shirai and N. Aoki: "Phoneme recognition by hierarchical clustering of acoustic features based on mutual information" Trans. IEICE.J72-D-II, 8. 1207-1214 (1989)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] H. Kobatake and A. Ishida: "Speech/nonspeech discrimination for speech recognition under real life Noise Environments" Proc. ICASSP89. 365-368 (1989)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] A. Ishida and H. Kobatake: "Speech discrimination under real life environments" Prep. Spring Meeting, Acous. Soc. Jpn.Paper 1-3-1. 1-2 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] A. Kurematsu: "ATR Japanese Speech Database as a tool of Speech Recognition and Synthesis" Proceeding of ESCA Workshop on Speech I/O Assessment. Paper 2, 3. 1-4 (1989)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] S. Itahashi: "Recent Speech Database Projects in Japan" Proc. ICSLP90, Kobe. Paper 24.1. 1081-1084 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] S. Itahashi: "On Speech Database Efforts in Japan" Prep. Intn'l Symp. on Intn'l Coordination and Standardization of Speech Database and Assessment Techniques for Speech Input/Output. 57-63 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] Jingxu CUI: "A Comparison of the Articulation of the Chinese /i, , l/ by Chinese and Japanese Speakers" Proc. ICSLP90, Kobe. Paper 15.10. 629-632 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] H. Fujisaki, S. Sato: "Proposal and Evaluation of a New Scheme for Reliable Pitch Extraction of Speech" Proc. ICSLP90, Kobe. Paper 11.14. 473-476 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] H. Fujisaki: "Influence of Context and Knowkedge on the Perception of Continuous Speech" Proc. ICSLP90, Kobe. Paper 10.9. 417-420 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] K. Shirai: "Speaker Adaptable Phoneme Recognition Selection Reliable Acoustic Features based on Mutual Information" Proc. ICSLP90, Kobe. Paper 9.2. 353-356 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] K. Shirai: "Speech Synthesis Using Superposition of Sinusoidal Waves Generated by Synchronized Oscillators" Proc. of ICSLP90. Paper 8.9. 345-348 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] K. Gyoutoku and H. Kobatake: "Maximum likelihood estimation of speech waveform under nonstationary noise environments" Proc. ICSLP90. Paper 25.10. 1149-1152 (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] T. Shirokaze and S. Makino: "Extraction of fundamental frequency using temporal continuity over an input speech" Trans. IEICE.J73-A. 1537-1539 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] H. Matsuo and S. Makino: "Spoken word recognition using a verification method based on phoneme duration model" Trans. IEICE.J-73-D-II. 1936-1944 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] T. Shirokaze, S. Makino and K. Kido: "A Large sale distributed speech database "K-DB" with an acquisition system of speech information" Trans. Inform. Proc. Soc. Jpn.32. 62-70 (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] A. Kurematsu: "ATR Japanese Speech Database as a tool of Speech Recognition and Synthesis" Speech Communication. 9, 4. 357-357 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] A. Kurematsu: "A Perspective of telephone interpretation research" Proceeding of Pacific Rim International Conference on Artificial Intelligence. 11-16 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] T. Morimoto and A. Kurematsu: "Spoken language translation toward realizing an automatic telephone interpretation system" Proceeding of Info-Japan '90. 553-560 (1990)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] S. Itahashi: "A noise database and Japanese common speech data corpus" J. Acous. Soc. Jpn.47, 12. 951-953 (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] S. Itahashi: "Creating Speech Corpora for Speech Science and Technology" IEICE Transactions. E74, 7. 1906-1910 (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] K. Shirai, E. Kitagawa and T. Endo: "Optimal Construction of Context Sensitive Quantizer For Phoneme Recognition in Continuous Speech" Proc. Neurospeech91. Paper 14.1. 405, 408 (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] K. Shirai: "Feature extraction in speech recognition" Trans. IEICE.73. 1269-1275 (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] H. Kobatake, K. Gyoutoku and L. Sheng: "Enhancement of Noisy speech by Maximum Likelihood Estimation" Proc. ICASSP91. 973-976 (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] A. Ishida and H. Kobatake: "Speech/non-speech discrimination under real life environments" J. Acous. Soc. Jpn.47, 12. (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] S. Makino, A. Ito, M. Endo and K. Kido: "A Japanese Text Dictation System Based on Phoneme Recognition and a Dependency Grammar" IEICE Transactions. E74, 7. 1773-1782 (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] T. Morimoto, K. Shikano, K. Kogure, H. Iida and A. Kurematsu: "Integration of Speech Recognition and Language Processing in a Japanese to English Spoken Language Translation System" IEICE Transactions. E74, 7. 1889-1991 (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] A. Kurematsu: "Speech processing for automatic interpreting telephony." J. Jpn. Soc. Artif. Intel.(1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] H. Fujisaki, K. Hirose and N. Takahashi: "Statistical distribution of voice fundamental frequencies in Common Japanese and dialect Japanese" Prep. Fall Meeting, Acous. Soc. Jpn.Paper 2-6-7. 251-252 (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 板橋秀一: "騒音デ-タベ-スと日本語共通音声デ-タDAT版" 日本音響学会誌. 47. 951-953 (1991)
- Related Report
  1991 Annual Research Report
[Publications] S.Itahashi: ""Creating Speech Corpora for Speech Science and Technology"" IEICE Transactions. E74. 1906-1910 (1991)
- Related Report
  1991 Annual Research Report
[Publications] K.Shirai,E.Kitagawa, T.Endo: ""Optimal Construction of Context Sensitive Quantizer For Phoneme Recognition in Continuous Speech"" Proc.Eurospeech91. 14.1. (1991)
- Related Report
  1991 Annual Research Report
[Publications] 白井克彦: "「音声認識における特徴抽出」" 電子情報通信学会誌. 73. 1269-1275 (1991)
- Related Report
  1991 Annual Research Report
[Publications] H.Kobatake,K.Gyoutoku, L.Sheng: ""Enhancement of Noisy Speech by Maximum Likelihood Estimation"" Proc.ICASSP91. 973-976 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 石田明,小畑秀文: "「実環境下での音声/非音声の判別」" 日本音響学会誌. 47. 911-917 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 城風敏彦,牧野正三,城戸健一: "「音声情報の自動獲得機能を持つ分散型大規模音声デ-タベ-スK-DV」" 情報処理学会論文誌. 32. 62-70 (1991)
- Related Report
  1991 Annual Research Report
[Publications] S.Makino,A.Ito, M.Endo,K.Kido: ""A Japanese Text Dictation System Based on Phoneme Recognition and a Dependency Grammar"" IEICE Transactions. E74. 1773-1782 (1991)
- Related Report
  1991 Annual Research Report
[Publications] T.Morimoto,K.shikano, K.Kogure,H.Iida, A.Kurematsu: ""Integration of Speech Recognition and Language Processing in a Japanese to English Spoken Language Translation System"" IEICE Transactions. E74. 1889-1991 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 榑松明: "「自動翻訳電話のための音声情報処理」" 人工知能学会誌「ATR特集号」. (1991)
- Related Report
  1991 Annual Research Report
[Publications] 藤崎博也,広瀬啓吉,高橋登: "共通語および方言音声における基本周波数の統計的分布" 日本音響学会講演論文集. 251-252 (1991)
- Related Report
  1991 Annual Research Report
[Publications] Shuichi ITAHASHI: "Recent Speech Database Projects in Japan" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 24.1. 1081-1084 (1990)
- Related Report
  1990 Annual Research Report
[Publications] Shichi ITAHASHI: "On Speech Database Efforts in Japan" Preprints of International Symposium on International Coordination and Standardization of Speech Database and Assessment Techniques for Speech Input/Output. 57-63 (1990)
- Related Report
  1990 Annual Research Report
[Publications] Jingxu CUI: "A Comparison of the Articulation of the Chinese/i,,l/by Chinese and Japanese Speakers" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 15.10. 629-632 (1990)
- Related Report
  1990 Annual Research Report
[Publications] Hiroya FUJISAKI and S.SATO: "Proposal and Evaluation of a New Scheme for Reliable Pitch Extraction of Speech" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 11.14. 473-476 (1990)
- Related Report
  1990 Annual Research Report
[Publications] Hiroya FUJISAKI: "Influence of context and knowkedge on the Perception of Continuous Speech" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 10.9. 417-420 (1990)
- Related Report
  1990 Annual Research Report
[Publications] Katsuhiko SHIRAI: "Speaker Adaptable Phoneme Recognition Selecting Reliable Acoustic Features based on Mutual Information" Proc.of ICSLP90ーInternational conference on Spoken Language Processing. 353-356 (1990)
- Related Report
  1990 Annual Research Report
[Publications] Katsuhiko SHIRAI: "Speech Synthesis Using Superposition of Sinusoidal Waves Generated by Synchronized Oscillators" Proc.of ICSLP90ーInternational conference on Spoken Language Processing. 345-348 (1990)
- Related Report
  1990 Annual Research Report
[Publications] K.GYOUTOKU,(H.KOBATAKE): "Maximum likelihood estimation of speech waveform under nonstationary noise environments" Proc.ICSLP90. 1149-1152 (1990)
- Related Report
  1990 Annual Research Report
[Publications] Hidefumi KOBATAKE: "Enhancement of noisy speech by maximum likelihood estimation" Proc.ICASSP90. (1991)
- Related Report
  1990 Annual Research Report
[Publications] 城風敏彦、(牧野正三): "発話全体の連続性を考慮した基本周波数の抽出" 電子情報通信学会論文誌. J73ーA. 1537-1539 (1990)
- Related Report
  1990 Annual Research Report
[Publications] 松尾広、(牧野正三): "音素の持続時間モデルに基づく検証法を用いた単語音声認識" 電子情報通信学会論文誌. J73ーDーII. 1936-1944 (1990)
- Related Report
  1990 Annual Research Report
[Publications] 城風敏彦、(牧野正三) 城戸健一: "音声情報の自動獲得機能を持つ分散型大規模音声デ-タベ-ス「KーDB」" 情報処理学会論文誌. 32. 62-70 (1991)
- Related Report
  1990 Annual Research Report
[Publications] 榑松明: "ATR Japanese Speech Database as a tool of Speech Recognition and Synthesis" SPEECH COMMUNICATION. Volume 9 No.4. 357-357 (1990)
- Related Report
  1990 Annual Research Report
[Publications] 榑松明: "A Perspective of telephone interpretation research" Proceedign of Pacific Rim International Conference on Artificial Intelligence. 11-16 (1990)
- Related Report
  1990 Annual Research Report
[Publications] 森元逞(榑松明): "Spoken language translation toward realizing an automatic telephone interpretation system" Proceeding of InfoーJapan'90(情報処理学会30周年記念国際会議). 553-560 (1990)
- Related Report
  1990 Annual Research Report
[Publications] K.SHIRAI,H.FUJISAKI,S.ITAHASHI: "Speech database projects in Japan-Present and Future-" Proc.ESCA Workshop on Speech Input/Output Assesment and Speech Databases. 2,4,1-2,4,4 (1989)
- Related Report
  1989 Annual Research Report
[Publications] 許康仁,板橋秀一,伊藤正弘: "スペクトルモ-メントによる韓国語の単母音のホルマント抽出" 日本音響学会平成2年度春季研究発表会講演論文集. 3ー4ー3. (1990)
- Related Report
  1989 Annual Research Report
[Publications] 廖、牧野、城戸: "スペクトルの時間変化、ロ-カルピ-ク、傾斜を利用した破裂子音の検出と認識の検討" 日本音響学会誌. 45巻9号. 499-506 (1989)
- Related Report
  1989 Annual Research Report
[Publications] 遠藤、牧野、城戸: "LVQ2を用いた音素認識" 電子情報通信学会技術報告. SP89ー50. 33-40 (1989)
- Related Report
  1989 Annual Research Report
[Publications] 藤崎、広瀬、浅野: "高品質の音声合成に適したタ-ミナル・アナログ型音声合成器の構成" 日本音響学会秋季研究発表会講演論文集. 3ーPー2. (1989)
- Related Report
  1989 Annual Research Report
[Publications] 藤崎、広瀬、浅野: "ホルマント合成器による日本語音声の規則合成" 日本音響学会春季研究発表会講演論文集. (1990)
- Related Report
  1989 Annual Research Report
[Publications] 白井克彦、青木紀將、保坂尚樹: "MULTI-LEVEL CLUSTERING OF ACOUSTIC FEATURES FOR PHONEME RECOGNITION BASED ON MUTUAL INFORMATION" IEEE ICASSP. Vol.1. 604-607 (1989)
- Related Report
  1989 Annual Research Report
[Publications] 白井克彦、青木紀將: "音響特徴量の情報量に基づく階層的クラスタリングによる音韻認識" 電子情報通信学会論文誌Dー11 Vol.J72ーDー11. No.8. 1207-1214 (1989)
- Related Report
  1989 Annual Research Report
[Publications] H.Kobatake and A.Ishida: "Speech/Nonspeech Discrimination under Nonstationary Noise Environments" Proc.of the Third Symp.on Advanced Man-Machine Interface Through Spoken Language. 8.1-8.7 (1989)
- Related Report
  1989 Annual Research Report
[Publications] 石田明、小畑秀文: "実環境下における音声分類" 日本音響学会春期講演論文集. (1990)
- Related Report
  1989 Annual Research Report
[Publications] 榑松明他: "ATR Japanese Speech Database as a Tool of Speech Recognition and Synthesis" Proceeding of ESCA Workshop on Speech I/O Assessment. 2,3,1-2,3,4 (1989)
- Related Report
  1989 Annual Research Report

Creation of Japanese Speech Corpus for Speech Processing Research

Principal Investigator

ITAHASHI Shuichi Univ. of Tsukuba, Institute of Information Sciences and Electronics, Professor, 電子・情報工学系, 教授 (70151454)

¥24,800,000 (Direct Cost: ¥24,800,000)

Report

Research Products

[Publications] K.SHIRAI,H,FUJISAKI,S.ITAHASHI: "Speech database projects in Japan -Present and Future-" Proc.ESCA Workshop on Speech Input/Output Assesment and Speech Databases. 2,4,1-2,4,4 (1989)

Description

Related Report

[Publications] 許 康仁,板橋 秀一,伊藤 正弘: "スペクトルモ-メントによる韓国語の単母音のホルマント抽出" 日本音響学会平成2年度春季研究発表会講演論文集. 3ー4ー3. 273-274 (1990)

Description

Related Report

[Publications] 廖,牧野,城戸: "スペクトルの時間変化、ロ-カルピ-ク、傾斜を利用した破裂子音の検出と認識の検討" 日本音響学会誌. 45巻9号. 499-506 (1989)

Description

Related Report

[Publications] 遠藤,牧野,城戸: "LVQ2を用いた音素認識" 電子情報通信学会技術報告. SP89ー50. 33-40 (1989)

Description

Related Report

[Publications] 藤崎,広瀬,浅野: "高品質の音声合成に適したタ-ミナル・アナログ型音声合成器の構成" 日本音響学会秋季研究発表会講演論文集. 3ーPー2. 279-280 (1989)

Description

Related Report

[Publications] 藤崎,広瀬,浅野: "ホルマント合成器による日本語音声の規則合成" 日本音響学会春季研究発表会講演論文集. 1ー4ー6. 189-190 (1990)

Description

Related Report

[Publications] 白井 克彦,青木 紀將,保坂 尚樹: "MULTIーLEVEL CLUSTERING OF ACOUSTIC FEATURES FOR PHONEME RECOGNITION BASED ON MUTUAL INFORMATION" IEEE ICASSP. Vol.1. 604-607 (1989)

Description

Related Report

[Publications] 白井 克彦,青木 紀將: "音響特徴量の情報量に基づく階層的クラスタリングによる音韻認識" 電子情報通信学会論文誌DーII Vol.J72ーDーII. No.8. 1207-1214 (1989)

Description

Related Report

[Publications] H.Kobatake and A.Ishida: "Speech/Nonspeech Discrimination under Nonstationary Noise Environments" Proc.ICASSP89. 365-368 (1989)

Description

Related Report

[Publications] 石田 明,小畑 秀文: "実環境下における音声分類" 日本音響学会春期講演論文集. 1-2 (1990)

Description

Related Report

[Publications] 榑松 明他: "ATR Japanese Speech Database as a Tool of Speech Recognition and Synthesis" Proceeding of ESCA Workshop on Speech I/O Assessment. 2,3,1-2,3,4 (1989)

Description

Related Report

[Publications] Shuichi ITAHASHI: "Recent Speech Database Projects in Japan" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 24.1. 1081-1084 (1990)

Description

Related Report

[Publications] Shichi ITAHASHI: "On Speech Database Efforts in Japan" Preperints of International Symposium on International Coordination and Standardization of Speech Database and Assessment Techniques for Speech Input/Output. 57-63 (1990)

Description

Related Report

[Publications] Jingxu CUI: "A Comparison of the Articulation of the Chinese/i,,l/by Chinese and Japanese Speakers" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 15.10. 629-632 (1990)

Description

Related Report

[Publications] Hiroya FUJISAKI and S.SATO: "Proposal and Evaluation of a New Scheme for Reliable Pitch Extraction of Speech" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 11.14. 473-476 (1990)

Description

Related Report

[Publications] Hiroya FUJISAKI: "Influence of context and knowkedge on the Perception of Continuous Speech" Proc.International Conference on Spoken Language Processing (ICSLP90),Kobe,Paper 10.9. 417-420 (1990)

Description

Related Report

[Publications] Katsuhiko SHIRAI: "Speaker Adaptable Phoneme Recognition Selecting Reliable Acoustic Features based on Mutual Information" Proc.of ICSLP90-International conference on Spoken Language Processing. 353-356 (1990)

Description

Related Report

[Publications] Katsuhiko SHIRAI: "Speech Synthesis Using Superposition of Sinusoidal Waves Generated by Synchronized Oscillators" Proc.of ICSLP90-Internatiional Conference on Spoken Lanaguage Processing. 345-348 (1990)

Description

Related Report

[Publications] K.GYOUTOKU,(H.KOBATAKE): "Maximum likelohood estimation of speech waveform under nonstatonary noise environments" Proc.ICSLP90. 1149-1152 (1990)

Description

Related Report

[Publications] 城風 敏彦,(牧野 正三): "発話全体の連続性を考慮した基本周波数の抽出" 電子情報通信学会論文誌. J73ーA. 1537-1539 (1990)

Description

Related Report

[Publications] 松尾 広,(牧野 正三): "音素の持続時間モデルに基づく検証法を用いた単語音声認識" 電子情報通信学会論文誌. J73ーDーII. 1936-1944 (1990)

Description

Related Report

[Publications] 城風 敏彦,(牧野 正三),城戸 健一: "音声情報の自動獲得機能を持つ分散型大規模音声デ-タベ-ス「KーDB」" 情報処理学会論文誌. 32. 62-70 (1991)

Description

Related Report

[Publications] 榑松 明: "ATR Japanese Speech Database as a tool of Speech Recognition and Synthesis" SPEECH COMMUNICATION. Volum9 No.4. 357-357 (1990)

Description

Related Report

[Publications] 榑松 明: "A Perspective of telephone interpretation research" Proceeding of Pacific Rim International Conference on Artificial Intelligence. 11-16 (1990)

Description

Related Report

[Publications] 森元 逞,(榑松 明): "Spoken language translation toward realizing an automatic telephone interpretation system" Proceeding of InfoーJapan'90(情報処理学会30周年記念国際会議). 553-560 (1990)

Description

[Publications] 許康仁,板橋秀一,伊藤正弘: "スペクトルモ-メントによる韓国語の単母音のホルマント抽出" 日本音響学会平成2年度春季研究発表会講演論文集. 3ー4ー3. 273-274 (1990)

[Publications] 白井克彦,青木紀將,保坂尚樹: "MULTIーLEVEL CLUSTERING OF ACOUSTIC FEATURES FOR PHONEME RECOGNITION BASED ON MUTUAL INFORMATION" IEEE ICASSP. Vol.1. 604-607 (1989)

[Publications] 白井克彦,青木紀將: "音響特徴量の情報量に基づく階層的クラスタリングによる音韻認識" 電子情報通信学会論文誌DーII Vol.J72ーDーII. No.8. 1207-1214 (1989)

[Publications] 石田明,小畑秀文: "実環境下における音声分類" 日本音響学会春期講演論文集. 1-2 (1990)

[Publications] 榑松明他: "ATR Japanese Speech Database as a Tool of Speech Recognition and Synthesis" Proceeding of ESCA Workshop on Speech I/O Assessment. 2,3,1-2,3,4 (1989)

[Publications] 城風敏彦,(牧野正三): "発話全体の連続性を考慮した基本周波数の抽出" 電子情報通信学会論文誌. J73ーA. 1537-1539 (1990)

[Publications] 松尾広,(牧野正三): "音素の持続時間モデルに基づく検証法を用いた単語音声認識" 電子情報通信学会論文誌. J73ーDーII. 1936-1944 (1990)

[Publications] 城風敏彦,(牧野正三),城戸健一: "音声情報の自動獲得機能を持つ分散型大規模音声デ-タベ-ス「KーDB」" 情報処理学会論文誌. 32. 62-70 (1991)

[Publications] 榑松明: "ATR Japanese Speech Database as a tool of Speech Recognition and Synthesis" SPEECH COMMUNICATION. Volum9 No.4. 357-357 (1990)

[Publications] 榑松明: "A Perspective of telephone interpretation research" Proceeding of Pacific Rim International Conference on Artificial Intelligence. 11-16 (1990)

[Publications] 森元逞,(榑松明): "Spoken language translation toward realizing an automatic telephone interpretation system" Proceeding of InfoーJapan'90(情報処理学会30周年記念国際会議). 553-560 (1990)

[Publications] 板橋秀一: "騒音デ-タベ-スと日本語共通音声デ-タDAT版" 日本音響学会誌. 47. 951-953 (1991)

[Publications] 白井克彦: "「音声認識における特徴抽出」" 電子情報通信学会誌. 73. 1269-1275 (1991)

[Publications] 石田明,小畑秀文: "「実環境下での音声/非音声の判別」" 日本音響学会誌. 47. 911-917 (1991)

[Publications] 榑松明: "「自動翻訳電話のための音声情報処理」" 人工知能学会誌「ATR特集号」. (1991)

[Publications] 藤崎博也,広瀬啓吉,高橋登: "共通語および方言音声における基本周波数の統計的分布" 日本音響学会講演論文集. 251-252 (1991)