• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

韻律に着目した音声言語情報処理の高度化

Research Project

Project/Area Number 12132101
Research Category

Grant-in-Aid for Scientific Research on Priority Areas

Allocation TypeSingle-year Grants
Review Section Science and Engineering
Research InstitutionThe University of Tokyo

Principal Investigator

広瀬 啓吉  東京大学, 大学院・情報理工学研究科, 教授 (50111472)

Co-Investigator(Kenkyū-buntansha) 藤崎 博也  東京大学, 大学院・工学系研究科, 名誉教授 (80010776)
柳田 益造  同志社大学, 工学部, 教授 (00116120)
北澤 茂良  静岡大学, 情報学部, 教授 (00109018)
尾関 和彦  電気通信大学, 電気通信学部, 教授 (50214135)
小林 哲則  早稲田大学, 理工学部, 教授 (30162001)
Project Period (FY) 2000 – 2004
Project Status Completed (Fiscal Year 2004)
Budget Amount *help
¥54,200,000 (Direct Cost: ¥54,200,000)
Fiscal Year 2004: ¥1,500,000 (Direct Cost: ¥1,500,000)
Fiscal Year 2003: ¥18,900,000 (Direct Cost: ¥18,900,000)
Fiscal Year 2002: ¥17,100,000 (Direct Cost: ¥17,100,000)
Fiscal Year 2001: ¥16,700,000 (Direct Cost: ¥16,700,000)
Keywords韻律 / 音声言語情報処理 / 学術図書出版 / 成果報告書 / 最終報告 / ホームページ / 国際会議 / 総括班 / 音声言語処理 / 全体会議 / シンポジウム / プロシーディングス / 韻律コーパス / 成果報告
Research Abstract

領域の成果取りまとめのため、総括班が引き続き活動し、以下の成果を達成した。
1.領域の最終報告:平成16年7月に会合を行って各研究班の成果取りまとめの方針を策定した。それに従い、有機的な統合としての最終報告の立案を行い。10月に最終報告(ヒアリング)を行った。
2.和文学術図書の出版準備:領域全体を対象とした研究成果を整理した上で、和文学術図書の章立てを行った。それをもとに領域のメンバーに執筆依頼し、執筆された原稿の添削、調整を行った。出版社として丸善を選定し、平成16年11月に研究成果公開促進費「学術図書」の申請を行った。
3.英文学術図書の出版準備:モデル、多様性、コーパス、合成、認識、対話システム、医療応用の各項目について、その研究内容を概説した上で、領域の顕著な成果を選択して、関連研究における位置付けも含め概説する。このような観点から、執筆者を領域のメンバーから厳選するとともに、海外の著名研究者にも執筆依頼することを進めた。出版社については、欧米誌の主要出版元であるElsevierと交渉中である。
4.成果報告書の作成・配布:領域の研究者から研究期間に挙げた成果の研究論文を徴し、とりまとめて、最終成果報告書を作成した。関連研究機関に配布した。
5.成果の公開:http://www.gavo.t.u-tokyo.ac.jp/tokutei_pub/に、研究遂行によって得たソフトウエア、システム、データベース等の成果物を公開した。また、ヒアリング説明資料、最終成果報告書を、アップロードした。
6.国際会議での研究紹介:平成16年10月の音声言語処理国際会議(ICSLP2004)等に代表者が韻律関係の特別セッションを組み、分担者らとともに、研究調査と研究発表を行った。また、海外研究者によるサーベイを受けた。

Report

(5 results)
  • 2004 Annual Research Report
  • 2003 Annual Research Report
  • 2002 Annual Research Report
  • 2001 Annual Research Report
  • 2000 Annual Research Report
  • Research Products

    (77 results)

All 2005 2004 Other

All Journal Article (14 results) Book (1 results) Patent(Industrial Property Rights) (1 results) Publications (61 results)

  • [Journal Article] Synthesis of F_0 contours using generation process model parameters predicted from unlabeled corpora -Application to emotional speech synthesis-2005

    • Author(s)
      Keikichi Hirose
    • Journal Title

      Speech Communication (Special Issue on Quantitative prosody modeling for natural speech description and generation) (発表予定)

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Identification and synthesis of Cantonese tones based on the command-response model for F0 contour generation2005

    • Author(s)
      Wentao Gu
    • Journal Title

      Proceedings IEEE International Conference on Acoustics, Speech, & Signal Processing (発表予定)

    • Related Report
      2004 Annual Research Report
  • [Journal Article] 肯定的/否定的発話態度の認識とその音声対話システムへの応用2005

    • Author(s)
      藤江真也
    • Journal Title

      電子情報通信学会論文誌 (発表予定)

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Use of prosodic features for speech recognition2004

    • Author(s)
      Keikichi Hirose
    • Journal Title

      Proceedings 8^<th> International Conference on Spoken Language Processing 2

      Pages: 1445-1448

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Improvement in corpus-based generation of F_0 contours using generation process model for emotional speech synthesis2004

    • Author(s)
      Keikichi Hirose
    • Journal Title

      Proceedings 8^<th> International Conference on Spoken Language Processing 2

      Pages: 1349-1352

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Corpus-based synthesis of fundamental frequency contours with various speaking styles from text using F0 contour generation process model2004

    • Author(s)
      Keikichi Hirose
    • Journal Title

      Proceedings of 5^<th> ISCA Speech Synthesis Workshop

      Pages: 162-166

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Robust acoustic modeling of contextual tonal F0 variations on the basis of tone nucleus framework2004

    • Author(s)
      JinSon Zhang
    • Journal Title

      Proceedings IEEE International Conference on Acoustics, Speech, & Signal Processing 1

      Pages: 913-916

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Automatic extraction of tone command parameters for the model of F0 contour generation for standard Chinese2004

    • Author(s)
      Wentao Gu
    • Journal Title

      IEICE Trans.Information and Systems (Special Issue on Speech Dynamics by Ear, Eye, Mouth and Machine) E87-D・5

      Pages: 1079-1085

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Tone nucleus modeling for Chinese lexical tone recognition2004

    • Author(s)
      JinSon Zhang
    • Journal Title

      Speech Communication 42・3-4

      Pages: 447-466

    • Related Report
      2004 Annual Research Report
  • [Journal Article] The command-response model for the generation of F_0 contours of Cantonese utterances2004

    • Author(s)
      Hiroya Fujisaki
    • Journal Title

      Proceedings 7^<th> International Conference on Signal Processing 1

      Pages: 655-658

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Acoustic and prosodic analysis of Japanese vowel-vowel hiatus with laryngeal effect2004

    • Author(s)
      Shigeyoshi Kitazawa
    • Journal Title

      Proceedings 8^<th> International Conference on Spoken Language Processing 2

      Pages: 1289-1292

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Dependency analysis of read Japanese sentences using pause and F0 information : A speaker independent case2004

    • Author(s)
      Kazuyuki Takagi
    • Journal Title

      Proceedings 8^<th> International Conference on Spoken Language Processing 4

      Pages: 3021-3024

    • Related Report
      2004 Annual Research Report
  • [Journal Article] A low-band spectrum envelope reconstruction method for PSOLA-based F0 modification2004

    • Author(s)
      Ryo Mochizuki
    • Journal Title

      IEICE Trans.Information and Systems E87-D・10

      Pages: 2426-2429

    • Related Report
      2004 Annual Research Report
  • [Journal Article] F0 and formant frequency distribution of dysarthric speech - A comparative study2004

    • Author(s)
      Hiroki Mori
    • Journal Title

      Proceedings 8^<th> International Conference on Spoken Language Processing 1

      Pages: 85-88

    • Related Report
      2004 Annual Research Report
  • [Book] 韻律に着目した音声言語情報処理の高度化 研究成果報告書2005

    • Author(s)
      広瀬啓吉(編)
    • Total Pages
      400
    • Publisher
      特定領域研究「韻律と音声処理」総括班
    • Related Report
      2004 Annual Research Report
  • [Patent(Industrial Property Rights)] 特許(出願中)2004

    • Inventor(s)
      張 勁松他
    • Industrial Property Rights Holder
      ATR
    • Industrial Property Number
      2004-074594
    • Filing Date
      2004-03-16
    • Related Report
      2004 Annual Research Report
  • [Publications] Atsuhiro Sakurai: "Data-driven generation of F0 contours using a superpositional model"Speech Communication. 40・4. 535-549 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Carlos Toshinori Ishi: "Mora F0 representation for accent type identification in continuous speech and considerations on its relation with perceived pitch values"Speech Communication. 41・2-3. 441-453 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Jinsong Zhang: "Tone nucleus modeling for Chinese lexical tone recognition"Speech Communication. (発表予定). (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Keikichi Hirose: "Corpus-based synthesis of F0 contours for emotional speech using the generation process model"Proceedings 15th International Congress of Phonetic Sciences. III. 2945-2948 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Keikichi Hirose: "Use of linguistic information for automatic extraction of F0 contour generation process model parameters"Proceedings 8th European Conference on Speech Communication and Technology. 1. 141-144 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Keikichi Hirose: "Corpus-based synthesis of fundamental frequency contours of Japanese using automatically-generated prosodic corpus and generation process model"Proceedings 8th European Conference on Speech Communication and Technology. 1. 333-336 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Keikichi Hirose: "Speech generation from concept for realizing conversation with an agent in a virtual room"Proceedings 8th European Conference on Speech Communication and Technology. 3. 1693-1696 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Keikichi Hirose: "Speech prosody in spoken language processing (invited)"Proceedings International Conference on Computer and Information Technology. 1. 20-27 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Shigeyoshi Kitazawa: "An evaluation of J-TOBI on the Japanese multext prosodic corpus"Proceedings Oriental COCOSDA. 201-208 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Kazuyuki Takagi: "A neural network approach to dependency analysis of Japanese sentences using prosodic information"Proceedings 8th European Conference on Speech Communication and Technology. 4. 3177-3180 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Lu Meirong: "The use of multiple pause information in dependency structure analysis of spoken Japanese sentences"Proceedings 8th European Conference on Speech Communication and Technology. 4. 3173-3176 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Yoshinori Sagisaka: "Generation and perception of F0 markedness in conversational speech with adverbs expressing degrees"Proceedings 8th European Conference on Speech Communication and Technology. 1. 173-176 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Yoshinori Sagisaka: "Modeling and perception of temporal characteristics in speech (invited)"Proceedings 15th International Congress of Phonetic Sciences. 1. 1-6 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] 広瀬啓吉(編): "韻律に着目した音声言語情報処理の高度化 2003年度研究成果報告書"特定領域研究「韻律と音声処理」総括班. 150 (2004)

    • Related Report
      2003 Annual Research Report
  • [Publications] Keikichi Hirose(編): "Proceedings of the International Conference on Speech Prosody"Speech Prosody 2004 実行委員会. 800 (2004)

    • Related Report
      2003 Annual Research Report
  • [Publications] Shinya Kiriyama: "Development and evaluation of a spoken dialogue system for academic document retrieval with a focus on reply generation"Systems and Computers in Japan. 33・4. 25-39 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 成澤修一: "音声の基本周波数パターン生成過程モデルのパラメータ自動抽出法"情報処理学会論文誌. 43・7. 2155-2168 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Atsuhiro Sakurai: "Data-driven generation of FO contours using a superpositional model"Speech Communication. (発表予定). (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] Carlos Toshinori Ishi: "Mora FO representation for accent type identification in continuous speech and considerations on its relation with perceived pitch values"Speeh Communication. (発表予定). (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] Keikichi Hirose: "Data-driven synthesis of fundamental frequency contours for TTS systems based on a generation process model"Proc. Speech Prosody 2002. 391-394 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Keikichi Hirose: "Prosody in speech technology -National project and some of our related works -(invited)"Proc. Acoustical Society of Korea. 15-18 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Keikichi Hirose: "Statistical language modeling with prosodic boundaries and its use for continuous speech recognition"Proc. International Conference on Spoken Language Processing. 2. 937-940 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Keikichi Hirose: "Improved corpus-based synthesis of fundamental frequency contours using generation process model"Proc. International Conference on Spoken Language Processing. 3. 2085-2088 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Keikichi Hirose: "Corpus-based synthesis of FO contours for emotional speech using the generation process model"Proceedings 15th International Congress of Phonetic Sciences. (発表予定). (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] 柳田益造: "知的・感性情報処理の対象としての音楽"日本音響学会誌. 59・3. 159-164 (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] Tatsuya Kitamura: "Influence of prosody, context, and word order in the identification of focus in Japanese dialogue"Proc. International Conference on Speech Processing. 2. 1185-1188 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Kazuyuki Takagi: "Combination of pause and FO information in dependency analysis of Japanese sentences"Proc. International Conference on Speech Processing. 2. 1173-1176 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Makiko Muto: "Effects of intra-phrase position on acceptability of changes in segmental duration in sentence speech"Proc. International Conference on Speech Processing. 2. 762-765 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 大塚 貴弘: "音源パルス列を考慮した頑健なARX音声分析法"日本音響学会誌. 58,7. 386-397 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 広瀬啓吉(編): "Proceedings for 2002 2nd Plenary Meeting & Symposium on Prosody and Speech processing"特定領域研究「韻律と音声処理」総括班. 200 (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] 広瀬啓吉: "Temporal rate change of dialogue speech in prosodic units as compared to read speech"Speech Communication. 36・1-2. 97-111 (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 桐山伸也: "Development and evaluation of a spoken dialogue system for academic document retrieval with a focus on reply generation"Systems and Computers in Japan. (掲載予定). (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 広瀬啓吉: "Prosody, an important feature for advanced spoken language processing technology(招待)"Proc. International Conference on Speech Processing. 1. 35-40 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 広瀬啓吉: "Corpus-based synthesis of fundamental frequency contours based on a generation process model"Proc. European Conference on Speech Communication and Technology. 3. 2255-2258 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 広瀬啓吉: "Continuous speech recognition of Japanese using prosodic word boundaries detected by mora transition modeling of fundamental frequency contours"Proceedings ISCA Tutorial and Research Workshop on : Prosody in Speech Recognition and Understanding. 61-66 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 桐山伸也: "Control of prosodic focuses for reply speech generation in a spoken dialogue system of information retrieval on academic documents"Proc. Speech Prosody 2002. (発表予定). (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 広瀬啓吉: "N-gram language modeling of Japanese using prosodic boundaries"Proc. Speech Prosody 2002. (発表予定). (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 広瀬啓吉: "Data-driven synthesis of fundamental frequency contours for TTS systems based on a generation process model"Proc. Speech Prosody 2002. (発表予定). (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 成澤修一: "A method for automatic extraction of model parameters from fundamental frequency contours of speech"Proc. IEEE International Conference on Acoustics, Speech, & Signal Processing. (発表予定). (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 光本浩士: "終助詞「ね」の韻律による皮肉の識別"電子情報通信学会論文誌. J84-D-I・5. 851-853 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 北澤茂良: "Preliminary study of Japanese MULTEXT : a prosodic corpus"Proc. International Conference on Speech Processing. 2. 825-828 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 尾関和彦: "The use of prosody in Japanese dependency structure analysis"Proc. ISCA Tutorial and Research Workshop on: Prosody in Speech Recognition and Understanding. 123-126 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 松坂要佐: "グループ会話に参与する対話ロボットの構築"電子情報通信学会論文誌. J84-D-II・6. 898-908 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 大塚 貴弘: "Aperiodicity control in ARX-based speech analysis-synthesis method"Proc. 7th European Conference on Speech Communication and Technology. 1. 545-558 (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 広瀬啓吉(編): "Proceedings for 2001 2nd Plenary Meeting & Symposium on Prosody and Speech processing"特定領域研究(B)「韻律と音声処理」総括班. 184 (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 広瀬啓吉: "Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition"Proc.IEEE International Conf.on Acoustics, Speech, & Signal Processing. 3. 1763-1766 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 岩野公司: "語彙制約なし音声認識へのアクセント句境界検出の統合"電子情報通信学会論文誌. J83-D-II・10. 1977-1985 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 倪晋富: "Automatic prosodic labeling of multiple languages based on a functional modeling of fundamental frequency contours"Proc.Workshop on Multi-lingual Speech Communication. 65-68 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 広瀬啓吉: "Analytical and perceptual study on the role of acoustic features in realizing emotional speech"Proc.International Conf.on Spoken Language Processing. 2. 369-372 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 桜井淳宏: "Data-driven intonation modeling using a neural network and a command response model"Proc.International Conf.on Spoken Language Processing. 3. 223-226 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 桜井淳宏: "Modeling and generation of accentual phrase F0 contours based on discrete HMMs synchronized at mora-unit transitions"Proc.International Conf.on Spoken Language Processing. 3. 259-262 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 李時旭: "Efficient search strategy in large vocabulary continuous speech recognition using prosodic boundary information"Proc.International Conf.on Spoken Language Processing. 4. 274-277 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 広瀬啓吉: "Temporal rate change of dialogue speech in prosodic units as compared to read speech"Speech Communication. (掲載予定). (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] 桜井淳宏: "Generation of F0 contours using model-constrained data-driven method"Proceedings IEEE International Conference on Acoustics, Speech, & Signal Processing, Proc.IEEE International Conf.on Acoustics, Speech, & Signal Processing. (掲載予定). (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] 大野澄雄: "Analysis and modeling of the effect of paralinguistic information upon the local spech rate"Proc.International Conf.on Spoken Language Processing. 2. 353-356 (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] 張燕: "中国語連続音声における声調パターンの変形現象とその規則性"電子情報通信学会技術報告(音声研究会). 1-8 (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] 柳田益造: "Augmented Backward DP for Recognizing Stuttered Speech"Proc.COST254 Intelligent Terminal WS "Friendly Exchanging through the Net". 123-128 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 光本浩士: "終助詞「ね」の韻律による皮肉の識別"電子情報通信学会論文誌. J84-D-II(掲載予定). (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] 廣瀬幸由: "Effectiveness of prosodic features in syntactic analysis of read Japanese sentences"Proc.International Conf.on Spoken Language Processing. 3. 215-218 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 尾本幸宏,村井則之,小林哲則: "複数対話音声における韻律パターンの利用に関する検討"日本音響学会研究発表会講演論文集. (掲載予定). (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] 広瀬啓吉(編): "韻律に着目した音声言語情報処理の高度化 研究成果報告書"特定領域研究(B)「韻律と音声処理」総括班. 168 (2001)

    • Related Report
      2000 Annual Research Report

URL: 

Published: 2001-04-01   Modified: 2018-03-28  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi