2004 Fiscal Year Final Research Report Summary
Automatic voice building for flexible speech synthesis
Project/Area Number |
14380160
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Intelligent informatics
|
Research Institution | Nagoya Institute of Technology |
Principal Investigator |
TOKUDA Keikhi Nagoya Institute of Technology, Graduate School of Engineering, Professor, 工学研究科, 教授 (20217483)
|
Co-Investigator(Kenkyū-buntansha) |
KITAMURA Tadashi Nagoya Institute of Technology, Graduate School of Engineering, Professor, 工学研究科, 教授 (60114865)
KOBAYASHI Takao Tokyo Institute of Technology, Interdisciplinary Graduate School of Science and Engineering, Professor, 大学院・総合理工学研究科, 教授 (70153616)
MASUDA Takashi Tokyo Institute of Technology, Interdisciplinary Graduate School of Science and Engineering, Research Associate, 大学院・総合理工学研究科, 助手 (90272715)
|
Project Period (FY) |
2002 – 2004
|
Keywords | speech synthesis / voice quality / emotional speech / HMM-based speech synthesis / labeling / automatic voice bulding / PLEd |
Research Abstract |
The increasing availability of large speech databases makes it possible to construct speech synthesis systems, which are referred to as data-driven or corpus-based approach, by applying statistical learning algorithms. These systems, which can be automatically trained, not only generate natural and high quality synthetic speech but also can reproduce voice characteristics of the original speaker. However, to make the whole voice building process fully-automatic, we need to construct speech databases in an automatic way. In this research work, we investigate automatic voice building techniques for an HMM-based speech synthesis system which can synthesize speech with various voice qualities. First, we implemented an GUI-based labeling tool, called PLEd (Prosody and Linguistic Label Editor). Then, in order to construct an automatic voice building system, we have developed an automatic accent labeling technique. It has been shown that by using the developed system, we have successfully label accent information.
|
Research Products
(111 results)
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
[Journal Article] Activities of Interactive Speech Technology Consortium (ISTC) Targeting Open Software Development for MMI Systems2004
Author(s)
T.Nitta, S.Sagayama, Y.Yamashita, T.Kawahara, S.Morishima, S.Nakamura, A.Yamada, K.Ito, M.Kai, A.Li, M.Mimura, K.Hirose, T.Kobayashi, K.Tokuda, N.Minematsu, Y.Den, T.Utsuro, T.Yotsukura, H.Shimodaira, M.Araki, T.Nishimoto, N.Kawaguchi, H.Banno, K.Katsurada
-
Journal Title
13th IEEE International Workshop on Robot and Human Interactive Communication (RO-MAN 2004) (CD-ROM proceedings)
Description
「研究成果報告書概要(和文)」より
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
[Journal Article] Activities of Interactive Speech Technology Consortium(ISTC) Targeting Open Software Development for MMI Systems2004
Author(s)
T.Nitta, S.Sagayama, Y.Yamashita, T.Kawahara, S.Morishima, S.Nakamura, A.Yamada, K.Ito, M.Kai, A.Li, M.Mimura, K.Hirose, T.Kobayashi, K.Tokuda, N.Minematsu, Y.Den, T.Utsuro, T.Yotsukura, H.Shimodaira, M.Araki, T.Nishimoto, N.Kawaguchi, H.Banno, K.Katsurada
-
Journal Title
13th IEEE International Workshop on Robot and Human Interactive Communication (RO-MAN 2004) (CD-ROM proceedings)
Description
「研究成果報告書概要(欧文)」より
-
-
-
-
-
-
-
-
[Journal Article] 凝人化音声対話エージェント基本ソフトウェアの開発プロジェクト報告2003
Author(s)
嵯峨山茂樹, 伊藤克亘, 宇津呂武仁, 甲斐充彦, 小林隆夫, 下平 博, 伝康晴, 徳田恵一, 中村哲, 西本卓也, 新田恒雄, 広瀬啓吉, 峯松信明, 森島繁生, 山下洋一, 山田篤, 李晃伸
-
Journal Title
情報処理学会研究報告「音声言語情報処理」 vol.2003,no.049
Description
「研究成果報告書概要(和文)」より
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-