A speech reproduction system using flexible time-axis and speech database for researchers
Project/Area Number |
23500147
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Multi-year Fund |
Section | 一般 |
Research Field |
Media informatics/Database
|
Research Institution | The University of Electro-Communications |
Principal Investigator |
TAKAHASHI Kota 電気通信大学, 情報理工学(系)研究科, 准教授 (10188005)
|
Project Period (FY) |
2011 – 2013
|
Project Status |
Completed (Fiscal Year 2013)
|
Budget Amount *help |
¥5,070,000 (Direct Cost: ¥3,900,000、Indirect Cost: ¥1,170,000)
Fiscal Year 2013: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000)
Fiscal Year 2012: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Fiscal Year 2011: ¥2,340,000 (Direct Cost: ¥1,800,000、Indirect Cost: ¥540,000)
|
Keywords | 話速変換 / 音声データベース / 話速推定 / ユーザモデル |
Research Abstract |
Speaking rate estimation techniques for a reproduction system using flexible time-axis and signal processing techniques for a speaking rate conversion are studied. In the study period, the software research environment is expanded. In this expanded environment. signal processing is decomposed and described for every processing element, and a new signal processing method can be made by connecting elements. About the system with FPGA, the proposed method of a reproduction system was implemented on the evaluation board of Xilinx. Moreover, in this research, construction of a special speech database is also performed so that researchers all over the country can use free. At the time of the end of this academic study period, the database has 2216 sentences, and has already opened to the public.
|
Report
(4 results)
Research Products
(22 results)