• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Develoment of Spoken Dialogue System for Japanese and Chinese

Research Project

Project/Area Number 08558028
Research Category

Grant-in-Aid for Scientific Research (A)

Allocation TypeSingle-year Grants
Section展開研究
Research Field Intelligent informatics
Research InstitutionThe University of Tokyo

Principal Investigator

HIROSE Keikichi  Univ.of Tokyo, Dept.of Inf.and Commu.Engg., Professor, 大学院・工学系研究科, 教授 (50111472)

Co-Investigator(Kenkyū-buntansha) KOSUGI Yasuhiro  Tokyo Electric Power Co., Telecom.Engg.Dept., Chief Researcher, システム研究所, 主任研究職員
MINEMATSU Nobuaki  Toyohashi Univ.of Tech., Dept.of Inf.& Computer Sciences, Assistant, 工学部, 助手 (90273333)
OHNO Sumio  Tokyo Science Univ., Dept.of Applied Electronics, Assistant, 基礎工学部, 助手 (80256677)
鈴木 敏克  東京電力株式会社, システム研究所, 主任(研究職)
Project Period (FY) 1996 – 1998
Project Status Completed (Fiscal Year 1998)
Budget Amount *help
¥8,000,000 (Direct Cost: ¥8,000,000)
Fiscal Year 1998: ¥1,700,000 (Direct Cost: ¥1,700,000)
Fiscal Year 1997: ¥1,700,000 (Direct Cost: ¥1,700,000)
Fiscal Year 1996: ¥4,600,000 (Direct Cost: ¥4,600,000)
KeywordsSpoken Dialogue System / Speech Recognition / Speech Synthesis / Multi-lingual System / Viterbi Bayesian Predictive Classification / Waveform Concatenation Synthesis / Prosodic Modelig / Tone Recognition / ビタビ探索 / TD-PSOLA / 対話処理 / 言語自動識別 / ベイズ予測分類 / HMM
Research Abstract

With the aim of developing a spoken dialogue system for both Japanese and Chinese in order to check the possibility of realizing practical systems of multilingual spoken dialogue, the following major results were obtained.
1. After selecting literature retrieval as the system task, we have arranged necessary databases and installed dictionary for speech synthesis. Also a speech corpus was cotstructed for training and evaluating speech recognition.
2. Phoneme HMM's and phoneme class HMM's were trained using the corpus. A method was developed to identify the input speech being Japanese or Chinese based on the phoneme/phoneme class sequences.
3. A robust speech recognition method was developed based on Bayesian predictive classification with Viterbi approximation. An adaptation method was further proposed, where improved posterior probability density function was estimated via sequential Bayesian learning using adaptation data. Another robust method, minimax, was also investigated to make it … More applicable to continuous speech.
4. An automatic Waveform concatenation speech synthesis method was developed. This method is based on segmenting speech waveform using speech recognition technique, and automatically placing pitch marks after LMA analysis. It was utilized for the Chinese speech synthesis.
5. Waveform concatenation synthesizer was combined with formant synthesizer to generate a new speech synthesis system. This system was shown to improve several low quality phonemes.
6. A speech synthesis oriented modeling of ; Chinese prosody was developed based on the newly defined function for unified representation of Chinese fundamental frequency contours.
7. A method was developed for precise tone recognition of Chinese continuous speech. This method is based on using features of tone nucleus of a syllable only.
8. A Japanese/Chinese spoken dialogue system was constructed (or literature retrieval. Chinese responses were pre-stored sentences, while Japanese responses were generated from semantic representations. The system was confirmed to operate both in Japanese and Chinese. Less

Report

(4 results)
  • 1998 Annual Research Report   Final Research Report Summary
  • 1997 Annual Research Report
  • 1996 Annual Research Report
  • Research Products

    (70 results)

All Other

All Publications (70 results)

  • [Publications] Keikichi HIROSE: "Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features" Proc.International Conference on Spoken Language Processing. 1. 378-381 (1996)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Hui JIANG: "Robust speech recognition based on Viterbi Bayesian predictive classification" Proc.IEEE International Conference on Acoustics,Speech,& Signal Processing. 2. 1551-1554 (1997)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] HingKeung KWAN: "Use of recurrent neural network for unknown language rejection in language identification system" Proc.5th European Conference on Speech Communication and Technology. 1. 63-66 (1997)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Jin-Fu NI: "Quantitative analysis and formulation of tone concatenation in Chinese F_0 contours" Proc.5th European Conference on Speech Communication and Technology. 1. 195-198 (1997)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Hui JIANG: "Sequential Bayesian learning of CDHMM based on finite mixture approximation of its prior/posterior density" Proc.IEEE Automatic Speech Recognition Workshop. 373-380 (1997)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Yoram MERON: "Waveform concatenation speech synthesis using phonetic clustering and sinusoidal modeling" 電子情報通信学会技術研究報告(音声研究会). 49-56 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 長谷川澄志: "柔軟な構成のターミナルアナログ音声合成システムとそれによる音声合成実験" 日本音響学会講演論文集. I. 195-196 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Hui JIANG: "Improving Viterbi Bayesian predictive classification via sequential Bayesian learning in robust speech recognition" Proc.IEEE International Conference on Acoustics,Speech,& Signal Processing. 1. 77-80 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] HingKeung KWAN: "N-gram modeling based on recognized phonemes in automatic language identification" IEICE Trans.Information and Systems. E81-D・11. 1224-1231 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Hui JIANG: "A minimax search algorithm for CDHMM based robust continuous speech recognition" Proc.International Conference on Spoken Language Processing. 2. 389-392 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Jin-song ZHANG: "A robust tone recognition method of Chinese based on sub-syllabic F_0 contours" Proc.International Conference on Spoken Language Processing. 3. 703-706 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Keikichi HIROSE: "On the relationship of speech rates with prosodic units in dialogue speech" Proc.International Conference on Spoken Language Processing. 5. 1979-1982 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Jinfu NI: "A Synthesis-oriented model of phrasal pitch movements in standard Chinese" Proc.International Conference on Spoken Language Processing. 7. 3317-3320 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 桐山伸也: "文献検索をタスクとした音声対話システムの検討" 日本音響学会研究発表会講演論文集. I(発表予定). (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 村松茂樹: "波形編集型音声合成におけるエコー抑制の検討" 日本音響学会研究発表会講演論文集. I(発表予定). (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 広瀬啓吉: "対話音声の生成(「音声による人間と機会の対話」の第4章)" オーム社, 375(14) (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Keikichi Hirose, Mayumi Sakata and Hiromichi Kawanami: "Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features" Proc.International Conference on Spoken Language Processing. 1. 378-381 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Jinsong Zhang, Beiqian Dai, Changfu Wang, Hingkeung Kwan, Keikichi Hirose: "Adaptive recognition method based on posterior use of distribution pattern of output probabilities" Proc.International Conference on Spoken Langueage Processing. 2. 1129-1132 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Hingkeung Kwan and Keikichi Hirose: "Unknown language rejection in language identification system" Proc.International Conference on Spoken Language Processing. 3. 1776-1779 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Hui Jiang, Keikichi Hirose and Qiang Huo: "Robust speech recognitio based on Viterbi Bayesian predictive classification" Proc.IEEE International Conference on Acoustics, Speech, & Signal Processing. 2. 1551-1554 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] HingKeung Kwan and Keikichi Hirose: "Use of recurrent neural network for unknown language rejection in language identification system" Proc.European Conference on Speech Communication and Technology. 1. 63-66 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Jin-Fu Ni, Ren-Hua Wang and Keikichi Hirose: "Quantitative analysis and formulation of tone concatenation in Chinese F_0 contours" Proc.European Conference on Speech Communication and Technology. 1. 195-198 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Hui Jiang, Keikichi Hirose and Qiang Huo: "Sequential Bayesian learning of CDHMM based on finite mixture approximation of its prior/posterior density" Proc.IEEE Automatic Speech Recognition Workshop, IEEE SP Society. 373-380 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Yoram Meron and Keikichi Hirose: "Waveform concatenation speech synthesis using phonetic clustering and sinusoidal modeling" IEICE Tech.Report (Speech Research). SP97-94. 49-56 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Kiyoshi Hasegawa, Keikichi Hirose: "Terminal analogue speech synthesis with highly-flexible configuration and speech synthesis experiments" Record of Spring Meeting of Acoustical Society of Japan. I. 195-196 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Hui Jiang, Keikichi Hirose and Qiang Huo: "Improving Viterbi Bayesian predictive classification via sequential Bayesian learning in robust speech recognition" Proc.IEEE International Conference on Acoustics, Speech, & Signal Processing. 1. 77-80 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] HingKeung Kwan and Keikichi Hirose: "N-gram modeling based on recognized phonemes in automatic language identification" IEICE Trans.Information and Systems. E81-D,11. 1224-1231 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Hui Jiang, Keikichi Hirose and Qiang Huo: "A minimax search algorithm for CDHMM based robust continuous speech recognition" Proc.International Conference on Spoken Language Processing. 2. 389-392 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Jin-song Zhang and Keikichi Hirose: "A robust tone recognition method of Chinese based on sub-syllabic F0 contours" Proc.International Conference on Spoken Language Processing. 3. 703-706 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Keikichi Hirose and Hiromichi Kawanami: "On the relationship of speech rates with prosodic units in dialogue speech" Proc.International Conference on Spoken Language Processing. 5. 1979-1982 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Jinfu Ni, Goh Kawai and Keikichi Hirose: "A Synthesis-oriented model of phrasal pitch movements in standard Chinese" Proc.International Conference on Spoken Language Processing. 7. 3317-3320 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Shinya Kiriyama and Keikichi Hirose: "Study on a speech dialogue system for literature retrieval" Record of Spring Meeting of Acoustical Society of Japan. I (to be published). (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Shigeki Muramatsu, Tetsuya Hasue and Keikichi Hirose: "On the echo reduction in concatenatve speech synthesis" Record of Spring Meeting of Acoustical Society of Japan. I (to be publiched). (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Keikichi Hirose: "Generation of dialogue speech" Spoken Dialogue between Man and Machine, Ohm, Chapter 4. 67-80 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Keikichi Hirose: "Accent type recognition and syntactic boundary detection of Japanese using statistical modeling of moraic transitions of fundamental frequency contours" Proc.IEEE International Conference on Acoustics,Speech,& Signal Processing. 1. 25-28 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 岩野公司: "モーラ遷移確率モデルを用いたアクセント型の識別とによるアクセント句境界の検出" 電子情報通信学会技術研究報告(音声研究会). 1-8 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 広瀬啓吉: "韻律情報の処理" 信号処理. 2・6. 415-423 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] Koji Iwano: "Representing prosodic words using statistical models of moraic transition of fundamental frequency contours of Japanese" Proc.International Conference on Spoken Language Processing. 3. 599-602 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] Jin-song Zhang: "A robust tone recognition method of Chinese based on sub-syllabic F0 contours" Proc.International Conference on Spoken Language Processing. 3. 703-706 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] Keikichi Hirose: "On the relationship of speech rates with prosodic units in dialogue speech" Proc.International Conference on Spoken Language Processing. 5. 1979-1982 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] Jinfu Ni: "A Synthesis-oriented model of phrasal pitch movements in standard Chinese" Proc.International Conference on Spoken Language Processing. 7. 3317-3320 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 岩野公司: "モーラ遷移確率モデルによるアクセント句境界検出と連続音声認識への応用" 情報処理学会研究報告(音声言語情報処理研究会). 73-78 (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] Jin-song Zhang: "Modeling contextual tone variations in F0 contour for Chinese tone recognition" 日本音響学会研究発表会講演論文集. I(発表予定). (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] Jin-song Zhang: "Lexical tone recognition based on tone-critical segment" 日本音響学会研究発表会講演論文集. I(発表予定). (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] Jinfu Ni: "Formulation of Chinese pitch phenomena using a tuning scheme" 日本音響学会研究発表会講演論文集. I(発表予定). (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] 桐山伸也: "文献検索をタスクとした音声対話システムの検討" 日本音響学会研究発表会講演論文集. I(発表予定). (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] 村松茂樹: "波形編集型音声合成におけるエコー抑制の検討" 日本音響学会研究発表会講演論文集. I(発表予定). (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] 岩野公司: "句境界情報を利用した語彙制約のない姓名認識" 日本音響学会研究発表会講演論文集. I(発表予定). (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] Koji Iwano: "Prosodic word boundary detection using statistical modeling of moraic fundamental frequency contours and its use for continuous speech recognition" Proc.IEEE International Conference on Acoustics,Speech,& Signal Processing. 1(発表予定). (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] 岩野公司: "モーラを単位とした基本周波数パターンの確率モデル化とそれによるアクセント句境界の検出" 情報処理学会論文誌. 40・4(発表予定). (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] 江 輝: "Use of less-informative Bayesian predictive classification for noisy speech recognition" Proc.1^<st> China-Japan Workshop on Spoken Language Processing. 169-174 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 倪 晋富: "A quantitative model for generating sentence F_0 contours of spoken Chinese" Proc.1^<st> China-Japan Workshop on Spoken Language Processing. 103-110 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 江 輝: "Robust speech recognition based on Viterbi Bayesian predictive classification" Proc.IEEE International Conference on Acoustics,Speech,& Signal Processing. 2. 1551-1554 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] MERON Yoram: "Waveform concatenation speech synthesis using phonetic clustering and automatic unit selection" 日本音響学会平成9年度秋季研究発表会講演論文集. I. 263-264 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 関 慶強: "Use of recurrent neural network for unknown language rejection in language identification system" Proc.5th European Conference on Speech Communication and Technology. 1. 63-66 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 倪 晋富: "Quantitative analysis and formulation of tone concatenation in Chinese F_0 contours" proc.5th European Conference on Speech Communication and Technology. 1. 195-198 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 江 輝: "Sequential Bayesian learning of CDHMM based on finite mixture approximation of its prior/posterior density" Proc.IEEE Automatic Speech Recognition Workshop. 373-380 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] MERON Yoram: "Waveform concatenation speech synthesis using phonetic clustering and sinusoidal modeling" 電子情報通信学会技術研究報告(音声研究会). 49-56 (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 川波弘道: "対話音声における発話速度の分析と韻律規則の作成" 日本音響学会講演論文集. (発表予定). (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 長谷川澄志: "柔軟な構成のターミナルアナログ音声合成システムとそれによる音声合成実験" 日本音響学会講演論文集. (発表予定). (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 江 輝: "Improving Viterbi Bayesian predictive classifiation via sequential Bayesian learning in robust speech recognition" Proc.IEEE International Conference on Acoustics,Speech,& Signal Processing. (発表予定). (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 広瀬啓吉: "対話音声の生成(「音声による人間と機械の対話」の第4章)" オーム社, 375(14) (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 広瀬啓吉: "音声対話システムの出力音声の韻律的特徴の合成" 人工知能学会全国大会論文集. 399-402 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 広瀬啓吉: "Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features" Proc.International Conference on Spoken Language Processing. 1. 378-381 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 張勁松: "Adaptive recognition method based on posterior use of distribution pattern of output probabilities" Proc.International Conference on Spoken Language Processing. 2. 1129-1132 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 関慶強: "Unknown language rejection in language identification system" Proc.International Conference on Spoken Language Processing. 3. 1776-1779 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 広瀬啓吉: "Use of prosodic features in speech recognition(Invited)" Proc.IEEE Invited Workshop on Pattern Recognition for Multimedia Techniques(IEEE Taegu Section). 99-108 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 江輝: "Robust speech recognition based on Bayesian predictive approach" 電子情報通信学会技術研究報告. SP96-93. 45-52 (1997)

    • Related Report
      1996 Annual Research Report
  • [Publications] 川波弘道: "対話音声の韻律的特徴の定量的分析とそれによる音声合成" 情報処理学会研究報告. 97-SLP-15-3. 15-20 (1997)

    • Related Report
      1996 Annual Research Report
  • [Publications] 江輝: "Robust speech recognition based on Viterbi Bayesian predictive classification" Proc.IEEE International Conference on Acoustics,Speech,& Signal Processing. (発表予定). (1997)

    • Related Report
      1996 Annual Research Report

URL: 

Published: 1996-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi