• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Development of fundamental technology for speech and sound event processing based on complementary use of air- and body-conducted sound signals

Research Project

Project/Area Number 17H01763
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Perceptual information processing
Research InstitutionNagoya University

Principal Investigator

Toda Tomoki  名古屋大学, 情報基盤センター, 教授 (90403328)

Co-Investigator(Kenkyū-buntansha) 北岡 教英  豊橋技術科学大学, 工学(系)研究科(研究院), 教授 (10333501)
亀岡 弘和  日本電信電話株式会社NTTコミュニケーション科学基礎研究所, メディア情報研究部, 特別研究員 (20466402)
Project Period (FY) 2017-04-01 – 2020-03-31
Project Status Completed (Fiscal Year 2019)
Budget Amount *help
¥17,810,000 (Direct Cost: ¥13,700,000、Indirect Cost: ¥4,110,000)
Fiscal Year 2019: ¥5,460,000 (Direct Cost: ¥4,200,000、Indirect Cost: ¥1,260,000)
Fiscal Year 2018: ¥5,980,000 (Direct Cost: ¥4,600,000、Indirect Cost: ¥1,380,000)
Fiscal Year 2017: ¥6,370,000 (Direct Cost: ¥4,900,000、Indirect Cost: ¥1,470,000)
Keywords音声情報処理 / 音響信号処理 / 音声変換 / 音声強調 / 音声認識 / 音響イベント検出 / 音声認識等 / 音声等認識
Outline of Final Research Achievements

In this research, we developed fundamental technology for speech and sound event processing based on complementary use of air- and body-conducted sound signals to make it possible to handle various information included in sound signals beyond physical constraints. We developed fundamental technology to simultaneously record air- and body-conducted sound signals and air- and body-conducted sound signal processing technology capable of effectively using complementary properties of these two types of sound signals. Furthermore, we developed fundamental technology for speech and sound source enhancement processing and speech and sound event recognition processing, further investigating their potential to develop applications for augmenting our physical functions.

Academic Significance and Societal Importance of the Research Achievements

空気伝導音信号を対象とした音声/音環境情報処理技術が盛んに研究されている状況の中、本研究では、体内伝導音信号の利活用という別の視点から、新たな音声/音環境情報処理基盤の構築に取り組んだ。空気/体内伝導音信号の相補的活用と深層学習に代表される最先端の機械学習を組み合わせることで、音の重ね合わせによる情報消失といった本質的な問題を緩和できることを学術的に示した。また、本基盤技術を応用することで、身体的機能拡張といった社会的意義の高い応用技術が実現できる可能性を見出した。

Report

(4 results)
  • 2019 Annual Research Report   Final Research Report ( PDF )
  • 2018 Annual Research Report
  • 2017 Annual Research Report
  • Research Products

    (97 results)

All 2020 2019 2018 2017

All Journal Article (9 results) (of which Peer Reviewed: 8 results,  Open Access: 6 results) Presentation (88 results) (of which Int'l Joint Research: 40 results,  Invited: 12 results)

  • [Journal Article] Statistical approaches to sound event detection2019

    • Author(s)
      林 知樹, 戸田 智基
    • Journal Title

      THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN

      Volume: 75 Issue: 9 Pages: 532-537

    • DOI

      10.20697/jasj.75.9_532

    • NAID

      130007804098

    • ISSN
      0369-4232, 2432-2040
    • Year and Date
      2019-09-01
    • Related Report
      2019 Annual Research Report
    • Open Access
  • [Journal Article] Supervised determined source separation with multichannel variational autoencoder2019

    • Author(s)
      Hirokazu Kameoka, Li Li, Shota Inoue, Shoji Makino
    • Journal Title

      Neural Computation

      Volume: Vol. 31, No. 9 Issue: 9 Pages: 1891-1914

    • DOI

      10.1162/neco_a_01217

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] ACVAE-VC: non-parallel voice conversion with auxiliary classifier variational autoencoder2019

    • Author(s)
      Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: Vol. 27, No. 9 Issue: 9 Pages: 1432-1443

    • DOI

      10.1109/taslp.2019.2917232

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Underdetermined source separation based on generalized multichannel variational autoencoder2019

    • Author(s)
      Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, Kazuya Takeda
    • Journal Title

      IEEE Access

      Volume: Vol. 7, No. 1 Pages: 168104-168115

    • DOI

      10.1109/access.2019.2954120

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Environmental sound processing and its applications2019

    • Author(s)
      Koichi Miyazaki, Tomoki Toda, Tomoki Hayashi, Kazuya Takeda
    • Journal Title

      IEEJ Transactions on Electronics, Information and Systems

      Volume: Vol. 14, No. 3 Issue: 3 Pages: 340-351

    • DOI

      10.1002/tee.22868

    • Related Report
      2018 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Mapping Acoustic Vector Space and Document Vector Space by RNN-LSTM2018

    • Author(s)
      西村良太, 檜垣美帆, 北岡教英
    • Journal Title

      Journal of Japan Society for Fuzzy Theory and Intelligent Informatics

      Volume: 30 Issue: 4 Pages: 628-633

    • DOI

      10.3156/jsoft.30.4_628

    • NAID

      130007435095

    • ISSN
      1347-7986, 1881-7203
    • Year and Date
      2018-08-15
    • Related Report
      2018 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Stereophonic Music Separation Based on Non-Negative Tensor Factorization with Cepstral Distance Regularization2018

    • Author(s)
      Shogo Seki, Tomoki Toda, Kazuya Takeda
    • Journal Title

      IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

      Volume: E101.A Issue: 7 Pages: 1057-1064

    • DOI

      10.1587/transfun.E101.A.1057

    • NAID

      130007386619

    • ISSN
      0916-8508, 1745-1337
    • Year and Date
      2018-07-01
    • Related Report
      2018 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Vibration Control Method of an Electrolarynx Based on Statistical <i>F</i><sub>0</sub> Pattern Prediction2017

    • Author(s)
      Kou Tanaka, Tomoki Toda, Satoshi Nakamura
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E100.D Issue: 9 Pages: 2165-2173

    • DOI

      10.1587/transinf.2016EDP7485

    • NAID

      130006038484

    • ISSN
      0916-8532, 1745-1361
    • Related Report
      2017 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Articulatory controllable speech modification based on statistical inversion and production mappings2017

    • Author(s)
      Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: Vol. 25, No. 12 Issue: 12 Pages: 2337-2350

    • DOI

      10.1109/taslp.2017.2753583

    • NAID

      120006473530

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Presentation] E2E Streaming Speech Recognition Using CTC and Local Attention2020

    • Author(s)
      Jiahao Chen, Ryota Nishimura, Norihide Kitaoka
    • Organizer
      The 2020 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'20)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Efficient shallow WaveNet vocoder using multiple samples output based on Laplacian distribution and linear prediction2020

    • Author(s)
      Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
    • Organizer
      2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] 周りに内緒で通話できるか2020

    • Author(s)
      戸田 智基
    • Organizer
      名古屋大学高等教育院 卓越・先端・次世代シンポジウム
    • Related Report
      2019 Annual Research Report
    • Invited
  • [Presentation] 変分自己符号化器を用いた空気・体内伝導音の結合音源モデリングに基づく半教師あり自己発声音強調・抑圧2020

    • Author(s)
      関 翔悟, 高田 萌絵, 武田 一哉, 戸田 智基
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2019 Annual Research Report
  • [Presentation] 発話感情認識における音韻・話者情報の低減2020

    • Author(s)
      岡田 慎太郎, 安藤 厚志, 戸田 智基
    • Organizer
      日本音響学会2020年春季研究発表会
    • Related Report
      2019 Annual Research Report
  • [Presentation] Uni-directional LSTM と Local Attention を用いたストリーミング音声認識2020

    • Author(s)
      陳 家浩,西村 良太,北岡 教英
    • Organizer
      日本音響学会2020年春季研究発表会
    • Related Report
      2019 Annual Research Report
  • [Presentation] Joint separation and dereverberation of reverberant mixtures with multichannel variational autoencoder2019

    • Author(s)
      Shota Inoue, Hirokazu Kameoka, Li Li, Shogo Seki, Shoji Makino
    • Organizer
      2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] CycleGAN-VC2: Improved CycleGAN-based non-parallel voice conversion2019

    • Author(s)
      Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo
    • Organizer
      2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] AttS2S-VC: Sequence-to-sequence voice conversion with attention and context preservation mechanisms2019

    • Author(s)
      Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo
    • Organizer
      2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Fast MVAE: Joint separation and classification of mixed sources based on multichannel variational autoencoder with auxiliary classifier2019

    • Author(s)
      Li Li, Hirokazu Kameoka, Shoji Makino
    • Organizer
      2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Advanced Voice Conversion2019

    • Author(s)
      Tomoki Toda
    • Organizer
      Speech Processing Courses in Crete (SPCC)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Hands on Voice Conversion2019

    • Author(s)
      Tomoki Toda
    • Organizer
      Speech Processing Courses in Crete (SPCC)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Joint separation, dereverberation and classification of multiple sources using multichannel variational autoencoder with auxiliary classifier2019

    • Author(s)
      Shota Inoue, Hirokazu Kameoka, Li Li, Shogo Seki, Shoji Makino
    • Organizer
      23rd International Congress on Acoustics (ICA 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Generalized multichannel variational autoencoder for underdetermined source separation2019

    • Author(s)
      Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, Kazuya Takeda
    • Organizer
      2019 27th European Signal Processing Conference (EUSIPCO 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Tutorial: Statistical voice conversion with direct waveform modeling2019

    • Author(s)
      Tomoki Toda, Kazuhiro Kobayashi, Tomoki Hayashi
    • Organizer
      The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Quasi-periodic WaveNet vocoder: a pitch dependent dilated convolution model for parametric speech generation2019

    • Author(s)
      YiChiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda
    • Organizer
      The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Robustness of statistical voice conversion based on direct waveform modification against background sounds2019

    • Author(s)
      Yusuke Kurita, Kazuhiro Kobayashi, Kazuya Takeda, Tomoki Toda
    • Organizer
      The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Investigation of F0 conditioning and fully convolutional networks in variational autoencoder based voice conversion2019

    • Author(s)
      Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang
    • Organizer
      The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] StarGAN-VC2: Rethinking conditional methods for StarGAN-based voice conversion2019

    • Author(s)
      Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo
    • Organizer
      The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Small-footprint magic word detection method using convolutional LSTM neural network2019

    • Author(s)
      Taiki Yamamoto, Ryota Nishimura, Masayuki Misaki, Norihide Kitaoka
    • Organizer
      The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Generalization of spectrum differential based direct waveform modification for voice conversion2019

    • Author(s)
      Wen-Chin Huang, Yi-Chiao Wu, Kazuhiro Kobayashi, Yu-Huai Peng, Hsin-Te Hwang, Patrick Lumban Tobing, Yu Tsao, Hsin-Min Wang, Tomoki Toda
    • Organizer
      10th ISCA Speech Synthesis Workshop (SSW10)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Statistical voice conversion with quasi-periodic WaveNet vocoder2019

    • Author(s)
      YiChiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda
    • Organizer
      10th ISCA Speech Synthesis Workshop (SSW10)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] An investigation of features for fundamental frequency pattern prediction in electrolaryngeal speech enhancement2019

    • Author(s)
      Mohammad Eshghi, Kou Tanaka, Kazuhiro Kobayashi, Hirokazu Kameoka, Tomoki Toda
    • Organizer
      10th ISCA Speech Synthesis Workshop (SSW10)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Voice conversion with image-to-image translation and sequence-to-sequence learning approaches2019

    • Author(s)
      Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
    • Organizer
      SANE 2019 - Speech and Audio in the Northeast
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] 文脈保持機構を用いた系列変換学習による音声変換2019

    • Author(s)
      田中宏, 亀岡弘和, 金子卓弘, 北条伸克
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2019 Annual Research Report
  • [Presentation] WaveCycleGAN2: 高品質音声生成のためのニューラル波形ポストフィルタ2019

    • Author(s)
      田中宏, 亀岡弘和, 金子卓弘, 北条伸克
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2019 Annual Research Report
  • [Presentation] 楽曲音源分離のための個別音源マスク推定ネットワークの統合法2019

    • Author(s)
      大竹 徹郎, 関 翔悟, 戸田 智基
    • Organizer
      日本音響学会2019年秋季研究発表会
    • Related Report
      2019 Annual Research Report
  • [Presentation] 空気/体内伝導音の対応関係を活用した自己発声音強調/抑圧法2019

    • Author(s)
      高田 萌絵, 関 翔悟, Patrick Lumban Tobing, 戸田 智基
    • Organizer
      日本音響学会2019年秋季研究発表会
    • Related Report
      2019 Annual Research Report
  • [Presentation] An investigation of fundamental frequency pattern prediction in electrolaryngeal speech enhancement2019

    • Author(s)
      Mohammad Eshghi, Kou Tanaka, Kazuhiro Kobayashi, Hirokazu Kameoka, Tomoki Toda
    • Organizer
      日本音響学会2019年秋季研究発表会
    • Related Report
      2019 Annual Research Report
  • [Presentation] 注意機構および文脈保持機構を用いた系列変換モデルに基づく音声変換2019

    • Author(s)
      田中 宏, 亀岡 弘和, 金子 卓弘, 北条 伸克
    • Organizer
      日本音響学会2019年秋季研究発表会
    • Related Report
      2019 Annual Research Report
  • [Presentation] WaveCycleGAN2: 高品質音声合成のための時間領域ニューラルポストフィルタ2019

    • Author(s)
      田中 宏, 亀岡 弘和, 金子 卓弘, 北条 伸克
    • Organizer
      日本音響学会2019年秋季研究発表会
    • Related Report
      2019 Annual Research Report
  • [Presentation] 多チャンネル変分自己符号化器法による任意話者の音源分離2019

    • Author(s)
      李 莉, 亀岡 弘和, 井上 翔太, 牧野 昭二
    • Organizer
      電子情報通信学会応用音響研究会
    • Related Report
      2019 Annual Research Report
  • [Presentation] 画像変換/系列変換アプローチを用いた音声変換2019

    • Author(s)
      亀岡 弘和, 金子 卓弘, 田中 宏, 北条 伸克
    • Organizer
      第21回音声言語シンポジウム
    • Related Report
      2019 Annual Research Report
    • Invited
  • [Presentation] 発話感情認識における音素事後確率を利用した表現学習とデータ拡張の評価2019

    • Author(s)
      岡田 慎太郎, 安藤 厚志, 戸田 智基
    • Organizer
      第21回音声言語シンポジウム
    • Related Report
      2019 Annual Research Report
  • [Presentation] 音声を変換する技術と機能拡張への応用2019

    • Author(s)
      戸田 智基
    • Organizer
      豊田工業大学 研究談話会
    • Related Report
      2019 Annual Research Report
    • Invited
  • [Presentation] 音声合成技術の進展2019

    • Author(s)
      戸田 智基
    • Organizer
      第3回次期グローバルコミュニケーション計画検討WG
    • Related Report
      2019 Annual Research Report
    • Invited
  • [Presentation] Augmented vocal production towards new singing style development2019

    • Author(s)
      Tomoki Toda
    • Organizer
      Dagstuhl Seminar, Stimulus Talk at Seminar 19052: computational methods for melody and voice processing in music recordings
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] 音源クラス識別器つき多チャンネル変分自己符号化器を用いた高速セミブラインド音源分離2019

    • Author(s)
      李莉, 亀岡弘和, 牧野昭二
    • Organizer
      日本音響学会2019年春季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 多チャンネル変分自己符号化器を用いた劣決定音源分離2019

    • Author(s)
      関 翔悟, 亀岡 弘和, 李 莉, 戸田 智基, 武田 一哉
    • Organizer
      日本音響学会2019年春季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 多チャンネル変分自己符号化器を用いた音源分離と残響除去の統合的アプローチ2019

    • Author(s)
      井上翔太, 亀岡弘和, 李莉, 関翔悟, 牧野昭二
    • Organizer
      日本音響学会2019年春季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 音素事後確率を利用した表現学習に基づく発話感情認識2019

    • Author(s)
      岡田 慎太郎, 安藤 厚志, 戸田 智基
    • Organizer
      日本音響学会2019年春季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 雑音環境下における統計的声質変換の頑健性に関する調査2019

    • Author(s)
      栗田 優佑, 小林 和弘, 武田 一哉, 戸田 智基
    • Organizer
      日本音響学会2019年春季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 波形加工に基づく統計的声質変換の外部雑音に対する頑健性2019

    • Author(s)
      栗田 優佑, 小林 和弘, 武田 一哉, 戸田 智基
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 多チャンネル変分自己符号化器に基づく劣決定音源分離の評価2019

    • Author(s)
      関 翔悟, 亀岡 弘和, 李 莉, 戸田 智基, 武田 一哉
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2018 Annual Research Report
  • [Presentation] Deep clustering with gated convolutional networks2018

    • Author(s)
      Li Li, Hirokazu Kameoka
    • Organizer
      2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Joint separation and dereverberation of reverberant mixtures with determined multichannel non-negative matrix factorization2018

    • Author(s)
      Hideaki Kagami, Hirokazu Kameoka, Masahiro Yukawa
    • Organizer
      2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] VAE-SPACE: Deep generative model of voice fundamental frequency contours2018

    • Author(s)
      Kou Tanaka, Hirokazu Kameoka, Kazuho Morikawa
    • Organizer
      2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Connectionist temporal classification-based sound event encoder for converting sound events into onomatopoeia representations2018

    • Author(s)
      Koichi Miyazaki, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda
    • Organizer
      The 2018 European Signal Processing Conference (EUSIPCO 2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Anomalous sound event detection based on WaveNet2018

    • Author(s)
      Tomoki Hayashi, Tatsuya Komatsu, Reishi Kondo, Tomoki Toda, Kazuya Takeda
    • Organizer
      The 2018 European Signal Processing Conference (EUSIPCO 2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Generative adversarial network-based approach to signal reconstruction from magnitude spectrogram2018

    • Author(s)
      Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Hiroyasu Ando
    • Organizer
      The 2018 European Signal Processing Conference (EUSIPCO 2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Non-parallel voice conversion using cycle-consistent adversarial networks2018

    • Author(s)
      Takuhiro Kaneko, Hirokazu Kameoka
    • Organizer
      The 2018 European Signal Processing Conference (EUSIPCO 2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Automatic speech pronunciation correction with dynamic frequency warping-based spectral conversion2018

    • Author(s)
      Nobukatsu Hojo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko
    • Organizer
      The 2018 European Signal Processing Conference (EUSIPCO 2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Multi-Head Decoder for end-to-end speech recognition2018

    • Author(s)
      Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda
    • Organizer
      INTERSPEECH 2018
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Mapping acoustic vector space and document vector space by RNN-LSTM2018

    • Author(s)
      Ryota Nishimura, Miho Higaki, Norihide Kitaoka
    • Organizer
      2018 IEEE 7th Global Conference on Consumer Electronics (GCCE 2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Self-produced speech enhancement and suppression method using air- and body-conductive microphones2018

    • Author(s)
      Moe Takada, Shogo Seki, Tomoki Toda
    • Organizer
      Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 (APSIPA ASC 2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] StarGAN-VC: Non-parallel many-to-many voice conversion using star generative adversarial networks2018

    • Author(s)
      Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo
    • Organizer
      2018 IEEE Workshop on Spoken Language Technology (SLT 2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks2018

    • Author(s)
      Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Hirokazu Kameoka
    • Organizer
      2018 IEEE Workshop on Spoken Language Technology (SLT 2018)
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] 音声変換による発声機能の拡張2018

    • Author(s)
      戸田 智基
    • Organizer
      東京大学ヒューマンオーグメンテーション学第4回セミナー
    • Related Report
      2018 Annual Research Report
    • Invited
  • [Presentation] RNNに基づく音響ベクトル時系列の文書ベクトルへのマッピング2018

    • Author(s)
      西村良太, 檜垣美帆, 北岡教英
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2018 Annual Research Report
  • [Presentation] ウェアラブルな空気/体内伝導マイクロフォンを用いた自己発声音強調/抑圧法2018

    • Author(s)
      高田 萌絵, 関 翔悟, 戸田 智基
    • Organizer
      電子情報通信学会電気音響研究会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 嚥下障害診断における嚥下音からの咽頭残留判定2018

    • Author(s)
      内野 達貴, 橋詰 淳, 勝野 雅央, 戸田 智基
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2018 Annual Research Report
  • [Presentation] End-to-Endアプローチに基づく音イベントの擬音語表現への記号化2018

    • Author(s)
      宮崎 晃一, 林 知樹, 戸田 智基, 武田 一哉
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 空気/体内伝導マイクロフォンを用いた雑音環境下における自己発声音強調/抑圧法2018

    • Author(s)
      高田 萌絵, 関 翔悟, 戸田 智基
    • Organizer
      日本音響学会2018年秋季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] WaveNetに基づく振幅スペクトログラムからの波形生成2018

    • Author(s)
      関 翔悟, 林 知樹, 武田 一哉, 戸田 智基
    • Organizer
      日本音響学会2018年秋季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] End-to-End音声認識ためのMulti-Head Decoderネットワーク2018

    • Author(s)
      林 知樹, 渡部 晋治, 戸田 智基, 武田 一哉
    • Organizer
      日本音響学会2018年秋季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 嚥下音を利用した嚥下障害診断のための咽頭残留推定法2018

    • Author(s)
      内野 達貴, 橋詰 淳, 勝野 雅央, 戸田 智基
    • Organizer
      日本音響学会2018年秋季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] WaveNetが音声合成研究に与える影響2018

    • Author(s)
      戸田 智基
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2017 Annual Research Report
    • Invited
  • [Presentation] CycleGANを用いた合成音声から自然音声への波形変換2018

    • Author(s)
      田中 宏, 金子 卓弘, 北条 伸克, 亀岡 弘和
    • Organizer
      日本音響学会2018年春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] ゲート付きCNNを用いた深層クラスタリングによる音源分離2018

    • Author(s)
      李 莉, 亀岡 弘和
    • Organizer
      日本音響学会2018年春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] VAE-SPACE: 音声F0パターンの深層生成モデル2018

    • Author(s)
      田中 宏, 亀岡 弘和, 森川 一穂
    • Organizer
      日本音響学会2018年春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] Electrolaryngeal speech enhancement based on vocoder-free statistical voice conversion and noise suppression2018

    • Author(s)
      Mohammad Eshghi, Kazuhiro Kobayashi, Tomoki Toda
    • Organizer
      日本音響学会2018年春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] CycleGANを用いたパラレルデータフリー声質変換2018

    • Author(s)
      金子 卓弘, 亀岡 弘和
    • Organizer
      日本音響学会2018年春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] 敵対的生成ネットワークによる振幅スペクトログラムの位相復元2018

    • Author(s)
      小山田 圭佑, 亀岡 弘和, 金子 卓弘, 田中 宏, 北条 伸克, 安東 弘泰
    • Organizer
      日本音響学会2018年春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] A hybrid approach to electrolaryngeal speech enhancement based on log-spectral differential conversion and noise suppression2018

    • Author(s)
      Mohammad Eshghi, Kazuhiro Kobayashi, Tomoki Toda
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2017 Annual Research Report
  • [Presentation] 統計的手法に基づく楽曲中の歌声加工のための歌声分離法の検討2018

    • Author(s)
      山田 智也, 関 翔悟, 小林 和弘, 戸田 智基
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2017 Annual Research Report
  • [Presentation] 音声の声質を変換する技術とその応用2017

    • Author(s)
      戸田 智基
    • Organizer
      2017年度人工知能学会全国大会
    • Related Report
      2017 Annual Research Report
    • Invited
  • [Presentation] Physically constrained statistical F0 prediction for electrolaryngeal speech enhancement2017

    • Author(s)
      Kou Tanaka, Hirokazu Kameoka, Tomoki Toda, Satoshi Nakamura
    • Organizer
      INTERSPEECH 2017
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Speech enhancement using non-negative spectrogram models with mel-generalized cepstral regularization2017

    • Author(s)
      Li Li, Hirokazu Kameoka, Tomoki Toda, Shoji Makino
    • Organizer
      INTERSPEECH 2017
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Missing component restoration for masked speech signals based on time-domain spectrogram factorization2017

    • Author(s)
      Shogo Seki, Hirokazu Kameoka, Tomoki Toda, Kazuya Takeda
    • Organizer
      The 27th IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2017)
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Mel-generalized cepstral regularization for discriminative non-negative matrix factorization2017

    • Author(s)
      Li Li, Hirokazu Kameoka, Shoji Makino
    • Organizer
      The 27th IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2017)
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Deep acoustic-to-articulatory inversion mapping with latent trajectory modeling2017

    • Author(s)
      Patrick Lumban Tobing, Hirokazu Kameoka, Tomoki Toda
    • Organizer
      Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2017 (APSIPA ASC 2017)
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] An Investigation of how to design control parameters for statistical voice timbre control2017

    • Author(s)
      Kazutaka Kubo, Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2017 (APSIPA ASC 2017)
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] ケプストラム距離正則化を用いた半教師ありステレオチャネル楽曲音源分離2017

    • Author(s)
      関 翔悟, 戸田 智基, 武田 一哉
    • Organizer
      情報処理学会音学シンポジウム2017
    • Related Report
      2017 Annual Research Report
  • [Presentation] 歌声分離ならびに統計的歌声声質変換に基づく楽曲中の歌声加工2017

    • Author(s)
      山田 智也, 関 翔悟, 小林 和弘, 戸田 智基
    • Organizer
      情報処理学会音学シンポジウム2017
    • Related Report
      2017 Annual Research Report
  • [Presentation] 実環境下サイレント音声通話に向けた統計的非可聴つぶやき強調のための外部雑音抑圧法2017

    • Author(s)
      田尻 祐介, 亀岡 弘和, 戸田 智基
    • Organizer
      第4回サイレント音声認識ワークショップ
    • Related Report
      2017 Annual Research Report
  • [Presentation] 非可聴つぶやき認識のための深層学習に基づく音響モデリング2017

    • Author(s)
      野田 聖太, 林 知樹, 戸田 智基, 武田 一哉
    • Organizer
      平成29年度電気・電子・情報関係学会東海支部連合大会
    • Related Report
      2017 Annual Research Report
  • [Presentation] CTCに基づく音響イベントから擬音語表現への変換2017

    • Author(s)
      宮崎 晃一, 林 知樹, 戸田 智基, 武田 一哉
    • Organizer
      日本音響学会2017年秋季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] DNN適応に基づく非可聴つぶやき認識用話者・環境依存音響モデルの構築2017

    • Author(s)
      野田 聖太, 林 知樹, 戸田 智基, 武田 一哉
    • Organizer
      電子情報通信学会音声研究会
    • Related Report
      2017 Annual Research Report

URL: 

Published: 2017-04-28   Modified: 2021-02-19  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi