2017 Fiscal Year Final Research Report

Research assisting environment of speech communication based on interactive visualization and sonification

Research Project

PDF

Project/Area Number	16K12464
Research Category	Grant-in-Aid for Challenging Exploratory Research
Allocation Type	Multi-year Fund
Research Field	Perceptual information processing
Research Institution	Wakayama University
Principal Investigator	Kawahara Hideki 和歌山大学, 学内共同利用施設等, 名誉教授 (40294300)
Co-Investigator(Kenkyū-buntansha)	入野俊夫和歌山大学, システム工学部, 教授 (20346331) 森勢将雅山梨大学, 大学院総合研究部, 准教授 (60510013)
Co-Investigator(Renkei-kenkyūsha)	TODA Tomoki 名古屋大学, 情報基盤センター, 教授 (90403328) SAKAKIBARA Ken-Ichi 北海道医療大学, リハビリテーション科学部, 准教授 (80396168) HANEISHI Eri 昭和音楽大学, 音楽学部, 教授 (70350684) BANNO Hideki 名城大学, 理工学部, 准教授 (20335003)
Research Collaborator	Patterson Roy D. Cambridge大学 Schweinberger Stefan Jena大学 Ellis Dan Columbia大学 McDermott Josh MIT
Project Period (FY)	2016-04-01 – 2018-03-31
Keywords	音声分析 / 音声合成 / 聴覚 / 感情音声 / 音声コミュニケーション / 対話的研究環境 / オープンソース
Outline of Final Research Achievements	We developed infrastructures of speech analysis, modification, and synthesis based on interference-free representations of speech parametric representations. In addition to our STRAIGHT-based infrastructure, which is a defacto standard in speech research, we developed a set of new independent algorithms. We made these algorithms as open-source. We elaborated on building supporting tools for promoting academic research using STRAIGHT systems. In addition to these planned accomplishments, we also established application infrastructure based on WaveNet, which revolutionalized speech applications based on deep learning.
Free Research Field	聴覚メディア処理