Development of fundamental technology for speech and sound event processing based on complementary use of air- and body-conducted sound signals

Research Project

Project/Area Number	17H01763
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Perceptual information processing
Research Institution	Nagoya University
Principal Investigator	Toda Tomoki 名古屋大学, 情報基盤センター, 教授 (90403328)
Co-Investigator(Kenkyū-buntansha)	北岡教英豊橋技術科学大学, 工学(系)研究科(研究院), 教授 (10333501) 亀岡弘和日本電信電話株式会社NTTコミュニケーション科学基礎研究所, メディア情報研究部, 特別研究員 (20466402)
Project Period (FY)	2017-04-01 – 2020-03-31
Project Status	Completed (Fiscal Year 2019)
Budget Amount *help	¥17,810,000 (Direct Cost: ¥13,700,000、Indirect Cost: ¥4,110,000) Fiscal Year 2019: ¥5,460,000 (Direct Cost: ¥4,200,000、Indirect Cost: ¥1,260,000) Fiscal Year 2018: ¥5,980,000 (Direct Cost: ¥4,600,000、Indirect Cost: ¥1,380,000) Fiscal Year 2017: ¥6,370,000 (Direct Cost: ¥4,900,000、Indirect Cost: ¥1,470,000)
Keywords	音声情報処理 / 音響信号処理 / 音声変換 / 音声強調 / 音声認識 / 音響イベント検出 / 音声認識等 / 音声等認識
Outline of Final Research Achievements	In this research, we developed fundamental technology for speech and sound event processing based on complementary use of air- and body-conducted sound signals to make it possible to handle various information included in sound signals beyond physical constraints. We developed fundamental technology to simultaneously record air- and body-conducted sound signals and air- and body-conducted sound signal processing technology capable of effectively using complementary properties of these two types of sound signals. Furthermore, we developed fundamental technology for speech and sound source enhancement processing and speech and sound event recognition processing, further investigating their potential to develop applications for augmenting our physical functions.
Academic Significance and Societal Importance of the Research Achievements	空気伝導音信号を対象とした音声／音環境情報処理技術が盛んに研究されている状況の中、本研究では、体内伝導音信号の利活用という別の視点から、新たな音声／音環境情報処理基盤の構築に取り組んだ。空気／体内伝導音信号の相補的活用と深層学習に代表される最先端の機械学習を組み合わせることで、音の重ね合わせによる情報消失といった本質的な問題を緩和できることを学術的に示した。また、本基盤技術を応用することで、身体的機能拡張といった社会的意義の高い応用技術が実現できる可能性を見出した。

Report

(4 results)

2019 Annual Research Report Final Research Report ( PDF )
2018 Annual Research Report
2017 Annual Research Report

Research Products
(97 results)

All 2020 2019 2018 2017

All Journal Article (9 results) (of which Peer Reviewed: 8 results, Open Access: 6 results) Presentation (88 results) (of which Int'l Joint Research: 40 results, Invited: 12 results)

[Journal Article] Statistical approaches to sound event detection2019
- Author(s)
  林知樹, 戸田智基
- Journal Title
  
  THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN
  
  Volume: 75 Issue: 9 Pages: 532-537
- DOI
  10.20697/jasj.75.9_532
- NAID
  130007804098
- ISSN
  0369-4232, 2432-2040
- Year and Date
  2019-09-01
- Related Report
  2019 Annual Research Report
- Open Access
[Journal Article] Supervised determined source separation with multichannel variational autoencoder2019
- Author(s)
  Hirokazu Kameoka, Li Li, Shota Inoue, Shoji Makino
- Journal Title
  
  Neural Computation
  
  Volume: Vol. 31, No. 9 Issue: 9 Pages: 1891-1914
- DOI
  10.1162/neco_a_01217
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] ACVAE-VC: non-parallel voice conversion with auxiliary classifier variational autoencoder2019
- Author(s)
  Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: Vol. 27, No. 9 Issue: 9 Pages: 1432-1443
- DOI
  10.1109/taslp.2019.2917232
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Underdetermined source separation based on generalized multichannel variational autoencoder2019
- Author(s)
  Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, Kazuya Takeda
- Journal Title
  
  IEEE Access
  
  Volume: Vol. 7, No. 1 Pages: 168104-168115
- DOI
  10.1109/access.2019.2954120
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Environmental sound processing and its applications2019
- Author(s)
  Koichi Miyazaki, Tomoki Toda, Tomoki Hayashi, Kazuya Takeda
- Journal Title
  
  IEEJ Transactions on Electronics, Information and Systems
  
  Volume: Vol. 14, No. 3 Issue: 3 Pages: 340-351
- DOI
  10.1002/tee.22868
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Mapping Acoustic Vector Space and Document Vector Space by RNN-LSTM2018
- Author(s)
  西村良太, 檜垣美帆, 北岡教英
- Journal Title
  
  Journal of Japan Society for Fuzzy Theory and Intelligent Informatics
  
  Volume: 30 Issue: 4 Pages: 628-633
- DOI
  10.3156/jsoft.30.4_628
- NAID
  130007435095
- ISSN
  1347-7986, 1881-7203
- Year and Date
  2018-08-15
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Stereophonic Music Separation Based on Non-Negative Tensor Factorization with Cepstral Distance Regularization2018
- Author(s)
  Shogo Seki, Tomoki Toda, Kazuya Takeda
- Journal Title
  
  IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
  
  Volume: E101.A Issue: 7 Pages: 1057-1064
- DOI
  10.1587/transfun.E101.A.1057
- NAID
  130007386619
- ISSN
  0916-8508, 1745-1337
- Year and Date
  2018-07-01
- Related Report
  2018 Annual Research Report
- Peer Reviewed
[Journal Article] A Vibration Control Method of an Electrolarynx Based on Statistical <i>F</i><sub>0</sub> Pattern Prediction2017
- Author(s)
  Kou Tanaka, Tomoki Toda, Satoshi Nakamura
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E100.D Issue: 9 Pages: 2165-2173
- DOI
  10.1587/transinf.2016EDP7485
- NAID
  130006038484
- ISSN
  0916-8532, 1745-1361
- Related Report
  2017 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Articulatory controllable speech modification based on statistical inversion and production mappings2017
- Author(s)
  Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: Vol. 25, No. 12 Issue: 12 Pages: 2337-2350
- DOI
  10.1109/taslp.2017.2753583
- NAID
  120006473530
- Related Report
  2017 Annual Research Report
- Peer Reviewed
[Presentation] E2E Streaming Speech Recognition Using CTC and Local Attention2020
- Author(s)
  Jiahao Chen, Ryota Nishimura, Norihide Kitaoka
- Organizer
  The 2020 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'20)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Efficient shallow WaveNet vocoder using multiple samples output based on Laplacian distribution and linear prediction2020
- Author(s)
  Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
- Organizer
  2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] 周りに内緒で通話できるか2020
- Author(s)
  戸田智基
- Organizer
  名古屋大学高等教育院卓越・先端・次世代シンポジウム
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] 変分自己符号化器を用いた空気・体内伝導音の結合音源モデリングに基づく半教師あり自己発声音強調・抑圧2020
- Author(s)
  関翔悟, 高田萌絵, 武田一哉, 戸田智基
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2019 Annual Research Report
[Presentation] 発話感情認識における音韻・話者情報の低減2020
- Author(s)
  岡田慎太郎, 安藤厚志, 戸田智基
- Organizer
  日本音響学会2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] Uni-directional LSTM と Local Attention を用いたストリーミング音声認識2020
- Author(s)
  陳家浩，西村良太，北岡教英
- Organizer
  日本音響学会2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] Joint separation and dereverberation of reverberant mixtures with multichannel variational autoencoder2019
- Author(s)
  Shota Inoue, Hirokazu Kameoka, Li Li, Shogo Seki, Shoji Makino
- Organizer
  2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] CycleGAN-VC2: Improved CycleGAN-based non-parallel voice conversion2019
- Author(s)
  Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo
- Organizer
  2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] AttS2S-VC: Sequence-to-sequence voice conversion with attention and context preservation mechanisms2019
- Author(s)
  Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo
- Organizer
  2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Fast MVAE: Joint separation and classification of mixed sources based on multichannel variational autoencoder with auxiliary classifier2019
- Author(s)
  Li Li, Hirokazu Kameoka, Shoji Makino
- Organizer
  2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Advanced Voice Conversion2019
- Author(s)
  Tomoki Toda
- Organizer
  Speech Processing Courses in Crete (SPCC)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Hands on Voice Conversion2019
- Author(s)
  Tomoki Toda
- Organizer
  Speech Processing Courses in Crete (SPCC)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Joint separation, dereverberation and classification of multiple sources using multichannel variational autoencoder with auxiliary classifier2019
- Author(s)
  Shota Inoue, Hirokazu Kameoka, Li Li, Shogo Seki, Shoji Makino
- Organizer
  23rd International Congress on Acoustics (ICA 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Generalized multichannel variational autoencoder for underdetermined source separation2019
- Author(s)
  Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, Kazuya Takeda
- Organizer
  2019 27th European Signal Processing Conference (EUSIPCO 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Tutorial: Statistical voice conversion with direct waveform modeling2019
- Author(s)
  Tomoki Toda, Kazuhiro Kobayashi, Tomoki Hayashi
- Organizer
  The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Quasi-periodic WaveNet vocoder: a pitch dependent dilated convolution model for parametric speech generation2019
- Author(s)
  YiChiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda
- Organizer
  The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Robustness of statistical voice conversion based on direct waveform modification against background sounds2019
- Author(s)
  Yusuke Kurita, Kazuhiro Kobayashi, Kazuya Takeda, Tomoki Toda
- Organizer
  The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Investigation of F0 conditioning and fully convolutional networks in variational autoencoder based voice conversion2019
- Author(s)
  Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang
- Organizer
  The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] StarGAN-VC2: Rethinking conditional methods for StarGAN-based voice conversion2019
- Author(s)
  Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo
- Organizer
  The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Small-footprint magic word detection method using convolutional LSTM neural network2019
- Author(s)
  Taiki Yamamoto, Ryota Nishimura, Masayuki Misaki, Norihide Kitaoka
- Organizer
  The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Generalization of spectrum differential based direct waveform modification for voice conversion2019
- Author(s)
  Wen-Chin Huang, Yi-Chiao Wu, Kazuhiro Kobayashi, Yu-Huai Peng, Hsin-Te Hwang, Patrick Lumban Tobing, Yu Tsao, Hsin-Min Wang, Tomoki Toda
- Organizer
  10th ISCA Speech Synthesis Workshop (SSW10)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Statistical voice conversion with quasi-periodic WaveNet vocoder2019
- Author(s)
  YiChiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda
- Organizer
  10th ISCA Speech Synthesis Workshop (SSW10)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] An investigation of features for fundamental frequency pattern prediction in electrolaryngeal speech enhancement2019
- Author(s)
  Mohammad Eshghi, Kou Tanaka, Kazuhiro Kobayashi, Hirokazu Kameoka, Tomoki Toda
- Organizer
  10th ISCA Speech Synthesis Workshop (SSW10)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Voice conversion with image-to-image translation and sequence-to-sequence learning approaches2019
- Author(s)
  Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
- Organizer
  SANE 2019 - Speech and Audio in the Northeast
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] 文脈保持機構を用いた系列変換学習による音声変換2019
- Author(s)
  田中宏, 亀岡弘和, 金子卓弘, 北条伸克
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2019 Annual Research Report
[Presentation] WaveCycleGAN2: 高品質音声生成のためのニューラル波形ポストフィルタ2019
- Author(s)
  田中宏, 亀岡弘和, 金子卓弘, 北条伸克
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2019 Annual Research Report
[Presentation] 楽曲音源分離のための個別音源マスク推定ネットワークの統合法2019
- Author(s)
  大竹徹郎, 関翔悟, 戸田智基
- Organizer
  日本音響学会2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 空気／体内伝導音の対応関係を活用した自己発声音強調／抑圧法2019
- Author(s)
  高田萌絵, 関翔悟, Patrick Lumban Tobing, 戸田智基
- Organizer
  日本音響学会2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] An investigation of fundamental frequency pattern prediction in electrolaryngeal speech enhancement2019
- Author(s)
  Mohammad Eshghi, Kou Tanaka, Kazuhiro Kobayashi, Hirokazu Kameoka, Tomoki Toda
- Organizer
  日本音響学会2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 注意機構および文脈保持機構を用いた系列変換モデルに基づく音声変換2019
- Author(s)
  田中宏, 亀岡弘和, 金子卓弘, 北条伸克
- Organizer
  日本音響学会2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] WaveCycleGAN2: 高品質音声合成のための時間領域ニューラルポストフィルタ2019
- Author(s)
  田中宏, 亀岡弘和, 金子卓弘, 北条伸克
- Organizer
  日本音響学会2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 多チャンネル変分自己符号化器法による任意話者の音源分離2019
- Author(s)
  李莉, 亀岡弘和, 井上翔太, 牧野昭二
- Organizer
  電子情報通信学会応用音響研究会
- Related Report
  2019 Annual Research Report
[Presentation] 画像変換／系列変換アプローチを用いた音声変換2019
- Author(s)
  亀岡弘和, 金子卓弘, 田中宏, 北条伸克
- Organizer
  第21回音声言語シンポジウム
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] 発話感情認識における音素事後確率を利用した表現学習とデータ拡張の評価2019
- Author(s)
  岡田慎太郎, 安藤厚志, 戸田智基
- Organizer
  第21回音声言語シンポジウム
- Related Report
  2019 Annual Research Report
[Presentation] 音声を変換する技術と機能拡張への応用2019
- Author(s)
  戸田智基
- Organizer
  豊田工業大学研究談話会
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] 音声合成技術の進展2019
- Author(s)
  戸田智基
- Organizer
  第3回次期グローバルコミュニケーション計画検討WG
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] Augmented vocal production towards new singing style development2019
- Author(s)
  Tomoki Toda
- Organizer
  Dagstuhl Seminar, Stimulus Talk at Seminar 19052: computational methods for melody and voice processing in music recordings
- Related Report
  2018 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] 音源クラス識別器つき多チャンネル変分自己符号化器を用いた高速セミブラインド音源分離2019
- Author(s)
  李莉, 亀岡弘和, 牧野昭二
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 多チャンネル変分自己符号化器を用いた劣決定音源分離2019
- Author(s)
  関翔悟, 亀岡弘和, 李莉, 戸田智基, 武田一哉
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 多チャンネル変分自己符号化器を用いた音源分離と残響除去の統合的アプローチ2019
- Author(s)
  井上翔太, 亀岡弘和, 李莉, 関翔悟, 牧野昭二
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 音素事後確率を利用した表現学習に基づく発話感情認識2019
- Author(s)
  岡田慎太郎, 安藤厚志, 戸田智基
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 雑音環境下における統計的声質変換の頑健性に関する調査2019
- Author(s)
  栗田優佑, 小林和弘, 武田一哉, 戸田智基
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 波形加工に基づく統計的声質変換の外部雑音に対する頑健性2019
- Author(s)
  栗田優佑, 小林和弘, 武田一哉, 戸田智基
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2018 Annual Research Report
[Presentation] 多チャンネル変分自己符号化器に基づく劣決定音源分離の評価2019
- Author(s)
  関翔悟, 亀岡弘和, 李莉, 戸田智基, 武田一哉
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2018 Annual Research Report
[Presentation] Deep clustering with gated convolutional networks2018
- Author(s)
  Li Li, Hirokazu Kameoka
- Organizer
  2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Joint separation and dereverberation of reverberant mixtures with determined multichannel non-negative matrix factorization2018
- Author(s)
  Hideaki Kagami, Hirokazu Kameoka, Masahiro Yukawa
- Organizer
  2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] VAE-SPACE: Deep generative model of voice fundamental frequency contours2018
- Author(s)
  Kou Tanaka, Hirokazu Kameoka, Kazuho Morikawa
- Organizer
  2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Connectionist temporal classification-based sound event encoder for converting sound events into onomatopoeia representations2018
- Author(s)
  Koichi Miyazaki, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda
- Organizer
  The 2018 European Signal Processing Conference (EUSIPCO 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Anomalous sound event detection based on WaveNet2018
- Author(s)
  Tomoki Hayashi, Tatsuya Komatsu, Reishi Kondo, Tomoki Toda, Kazuya Takeda
- Organizer
  The 2018 European Signal Processing Conference (EUSIPCO 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Generative adversarial network-based approach to signal reconstruction from magnitude spectrogram2018
- Author(s)
  Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Hiroyasu Ando
- Organizer
  The 2018 European Signal Processing Conference (EUSIPCO 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Non-parallel voice conversion using cycle-consistent adversarial networks2018
- Author(s)
  Takuhiro Kaneko, Hirokazu Kameoka
- Organizer
  The 2018 European Signal Processing Conference (EUSIPCO 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Automatic speech pronunciation correction with dynamic frequency warping-based spectral conversion2018
- Author(s)
  Nobukatsu Hojo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko
- Organizer
  The 2018 European Signal Processing Conference (EUSIPCO 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Multi-Head Decoder for end-to-end speech recognition2018
- Author(s)
  Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda
- Organizer
  INTERSPEECH 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Mapping acoustic vector space and document vector space by RNN-LSTM2018
- Author(s)
  Ryota Nishimura, Miho Higaki, Norihide Kitaoka
- Organizer
  2018 IEEE 7th Global Conference on Consumer Electronics (GCCE 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Self-produced speech enhancement and suppression method using air- and body-conductive microphones2018
- Author(s)
  Moe Takada, Shogo Seki, Tomoki Toda
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 (APSIPA ASC 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] StarGAN-VC: Non-parallel many-to-many voice conversion using star generative adversarial networks2018
- Author(s)
  Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
- Organizer
  2018 IEEE Workshop on Spoken Language Technology (SLT 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks2018
- Author(s)
  Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Hirokazu Kameoka
- Organizer
  2018 IEEE Workshop on Spoken Language Technology (SLT 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] 音声変換による発声機能の拡張2018
- Author(s)
  戸田智基
- Organizer
  東京大学ヒューマンオーグメンテーション学第4回セミナー
- Related Report
  2018 Annual Research Report
- Invited
[Presentation] RNNに基づく音響ベクトル時系列の文書ベクトルへのマッピング2018
- Author(s)
  西村良太, 檜垣美帆, 北岡教英
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2018 Annual Research Report
[Presentation] ウェアラブルな空気／体内伝導マイクロフォンを用いた自己発声音強調／抑圧法2018
- Author(s)
  高田萌絵, 関翔悟, 戸田智基
- Organizer
  電子情報通信学会電気音響研究会
- Related Report
  2018 Annual Research Report
[Presentation] 嚥下障害診断における嚥下音からの咽頭残留判定2018
- Author(s)
  内野達貴, 橋詰淳, 勝野雅央, 戸田智基
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2018 Annual Research Report
[Presentation] End-to-Endアプローチに基づく音イベントの擬音語表現への記号化2018
- Author(s)
  宮崎晃一, 林知樹, 戸田智基, 武田一哉
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2018 Annual Research Report
[Presentation] 空気／体内伝導マイクロフォンを用いた雑音環境下における自己発声音強調／抑圧法2018
- Author(s)
  高田萌絵, 関翔悟, 戸田智基
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] WaveNetに基づく振幅スペクトログラムからの波形生成2018
- Author(s)
  関翔悟, 林知樹, 武田一哉, 戸田智基
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] End-to-End音声認識ためのMulti-Head Decoderネットワーク2018
- Author(s)
  林知樹, 渡部晋治, 戸田智基, 武田一哉
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 嚥下音を利用した嚥下障害診断のための咽頭残留推定法2018
- Author(s)
  内野達貴, 橋詰淳, 勝野雅央, 戸田智基
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] WaveNetが音声合成研究に与える影響2018
- Author(s)
  戸田智基
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2017 Annual Research Report
- Invited
[Presentation] CycleGANを用いた合成音声から自然音声への波形変換2018
- Author(s)
  田中宏, 金子卓弘, 北条伸克, 亀岡弘和
- Organizer
  日本音響学会2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] ゲート付きCNNを用いた深層クラスタリングによる音源分離2018
- Author(s)
  李莉, 亀岡弘和
- Organizer
  日本音響学会2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] VAE-SPACE: 音声F0パターンの深層生成モデル2018
- Author(s)
  田中宏, 亀岡弘和, 森川一穂
- Organizer
  日本音響学会2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] Electrolaryngeal speech enhancement based on vocoder-free statistical voice conversion and noise suppression2018
- Author(s)
  Mohammad Eshghi, Kazuhiro Kobayashi, Tomoki Toda
- Organizer
  日本音響学会2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] CycleGANを用いたパラレルデータフリー声質変換2018
- Author(s)
  金子卓弘, 亀岡弘和
- Organizer
  日本音響学会2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] 敵対的生成ネットワークによる振幅スペクトログラムの位相復元2018
- Author(s)
  小山田圭佑, 亀岡弘和, 金子卓弘, 田中宏, 北条伸克, 安東弘泰
- Organizer
  日本音響学会2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] A hybrid approach to electrolaryngeal speech enhancement based on log-spectral differential conversion and noise suppression2018
- Author(s)
  Mohammad Eshghi, Kazuhiro Kobayashi, Tomoki Toda
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2017 Annual Research Report
[Presentation] 統計的手法に基づく楽曲中の歌声加工のための歌声分離法の検討2018
- Author(s)
  山田智也, 関翔悟, 小林和弘, 戸田智基
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2017 Annual Research Report
[Presentation] 音声の声質を変換する技術とその応用2017
- Author(s)
  戸田智基
- Organizer
  2017年度人工知能学会全国大会
- Related Report
  2017 Annual Research Report
- Invited
[Presentation] Physically constrained statistical F0 prediction for electrolaryngeal speech enhancement2017
- Author(s)
  Kou Tanaka, Hirokazu Kameoka, Tomoki Toda, Satoshi Nakamura
- Organizer
  INTERSPEECH 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Speech enhancement using non-negative spectrogram models with mel-generalized cepstral regularization2017
- Author(s)
  Li Li, Hirokazu Kameoka, Tomoki Toda, Shoji Makino
- Organizer
  INTERSPEECH 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Missing component restoration for masked speech signals based on time-domain spectrogram factorization2017
- Author(s)
  Shogo Seki, Hirokazu Kameoka, Tomoki Toda, Kazuya Takeda
- Organizer
  The 27th IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2017)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Mel-generalized cepstral regularization for discriminative non-negative matrix factorization2017
- Author(s)
  Li Li, Hirokazu Kameoka, Shoji Makino
- Organizer
  The 27th IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2017)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Deep acoustic-to-articulatory inversion mapping with latent trajectory modeling2017
- Author(s)
  Patrick Lumban Tobing, Hirokazu Kameoka, Tomoki Toda
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2017 (APSIPA ASC 2017)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] An Investigation of how to design control parameters for statistical voice timbre control2017
- Author(s)
  Kazutaka Kubo, Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2017 (APSIPA ASC 2017)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] ケプストラム距離正則化を用いた半教師ありステレオチャネル楽曲音源分離2017
- Author(s)
  関翔悟, 戸田智基, 武田一哉
- Organizer
  情報処理学会音学シンポジウム2017
- Related Report
  2017 Annual Research Report
[Presentation] 歌声分離ならびに統計的歌声声質変換に基づく楽曲中の歌声加工2017
- Author(s)
  山田智也, 関翔悟, 小林和弘, 戸田智基
- Organizer
  情報処理学会音学シンポジウム2017
- Related Report
  2017 Annual Research Report
[Presentation] 実環境下サイレント音声通話に向けた統計的非可聴つぶやき強調のための外部雑音抑圧法2017
- Author(s)
  田尻祐介, 亀岡弘和, 戸田智基
- Organizer
  第4回サイレント音声認識ワークショップ
- Related Report
  2017 Annual Research Report
[Presentation] 非可聴つぶやき認識のための深層学習に基づく音響モデリング2017
- Author(s)
  野田聖太, 林知樹, 戸田智基, 武田一哉
- Organizer
  平成29年度電気・電子・情報関係学会東海支部連合大会
- Related Report
  2017 Annual Research Report
[Presentation] CTCに基づく音響イベントから擬音語表現への変換2017
- Author(s)
  宮崎晃一, 林知樹, 戸田智基, 武田一哉
- Organizer
  日本音響学会2017年秋季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] DNN適応に基づく非可聴つぶやき認識用話者・環境依存音響モデルの構築2017
- Author(s)
  野田聖太, 林知樹, 戸田智基, 武田一哉
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2017 Annual Research Report

Development of fundamental technology for speech and sound event processing based on complementary use of air- and body-conducted sound signals

Principal Investigator

Toda Tomoki 名古屋大学, 情報基盤センター, 教授 (90403328)

¥17,810,000 (Direct Cost: ¥13,700,000、Indirect Cost: ¥4,110,000)

Report

Research Products

[Journal Article] Statistical approaches to sound event detection2019

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Supervised determined source separation with multichannel variational autoencoder2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] ACVAE-VC: non-parallel voice conversion with auxiliary classifier variational autoencoder2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Underdetermined source separation based on generalized multichannel variational autoencoder2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Environmental sound processing and its applications2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Mapping Acoustic Vector Space and Document Vector Space by RNN-LSTM2018

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Stereophonic Music Separation Based on Non-Negative Tensor Factorization with Cepstral Distance Regularization2018

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] A Vibration Control Method of an Electrolarynx Based on Statistical <i>F</i><sub>0</sub> Pattern Prediction2017

Author(s)

Journal Title

DOI

NAID

ISSN

Related Report

[Journal Article] Articulatory controllable speech modification based on statistical inversion and production mappings2017

Author(s)

Journal Title

DOI

NAID

Related Report

[Presentation] E2E Streaming Speech Recognition Using CTC and Local Attention2020

Author(s)

Organizer

Related Report

[Presentation] Efficient shallow WaveNet vocoder using multiple samples output based on Laplacian distribution and linear prediction2020

Author(s)

Organizer

Related Report

[Presentation] 周りに内緒で通話できるか2020

Author(s)

Organizer

Related Report

[Presentation] 変分自己符号化器を用いた空気・体内伝導音の結合音源モデリングに基づく半教師あり自己発声音強調・抑圧2020

Author(s)

Organizer

Related Report

[Presentation] 発話感情認識における音韻・話者情報の低減2020