2017 Fiscal Year Final Research Report
Research assisting environment of speech communication based on interactive visualization and sonification
Project/Area Number |
16K12464
|
Research Category |
Grant-in-Aid for Challenging Exploratory Research
|
Allocation Type | Multi-year Fund |
Research Field |
Perceptual information processing
|
Research Institution | Wakayama University |
Principal Investigator |
Kawahara Hideki 和歌山大学, 学内共同利用施設等, 名誉教授 (40294300)
|
Co-Investigator(Kenkyū-buntansha) |
入野 俊夫 和歌山大学, システム工学部, 教授 (20346331)
森勢 将雅 山梨大学, 大学院総合研究部, 准教授 (60510013)
|
Co-Investigator(Renkei-kenkyūsha) |
TODA Tomoki 名古屋大学, 情報基盤センター, 教授 (90403328)
SAKAKIBARA Ken-Ichi 北海道医療大学, リハビリテーション科学部, 准教授 (80396168)
HANEISHI Eri 昭和音楽大学, 音楽学部, 教授 (70350684)
BANNO Hideki 名城大学, 理工学部, 准教授 (20335003)
|
Research Collaborator |
Patterson Roy D. Cambridge大学
Schweinberger Stefan Jena大学
Ellis Dan Columbia大学
McDermott Josh MIT
|
Project Period (FY) |
2016-04-01 – 2018-03-31
|
Keywords | 音声分析 / 音声合成 / 聴覚 / 感情音声 / 音声コミュニケーション / 対話的研究環境 / オープンソース |
Outline of Final Research Achievements |
We developed infrastructures of speech analysis, modification, and synthesis based on interference-free representations of speech parametric representations. In addition to our STRAIGHT-based infrastructure, which is a defacto standard in speech research, we developed a set of new independent algorithms. We made these algorithms as open-source. We elaborated on building supporting tools for promoting academic research using STRAIGHT systems. In addition to these planned accomplishments, we also established application infrastructure based on WaveNet, which revolutionalized speech applications based on deep learning.
|
Free Research Field |
聴覚メディア処理
|