Research on Innovative Microphone Array Technology for Recognition and Understanding of Acoustic Environments

Research Project

Project/Area Number	19H04131
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Review Section	Basic Section 61010:Perceptual information processing-related
Research Institution	Waseda University (2020-2022) University of Tsukuba (2019)
Principal Investigator	Makino Shoji 早稲田大学, 理工学術院(情報生産システム研究科・センター), 特任教授 (60396190)
Co-Investigator(Kenkyū-buntansha)	猿渡洋東京大学, 大学院情報理工学系研究科, 教授 (30324974) 山田武志筑波大学, システム情報系, 准教授 (20312829)
Project Period (FY)	2019-04-01 – 2022-03-31
Project Status	Completed (Fiscal Year 2022)
Budget Amount *help	¥17,160,000 (Direct Cost: ¥13,200,000、Indirect Cost: ¥3,960,000) Fiscal Year 2021: ¥5,590,000 (Direct Cost: ¥4,300,000、Indirect Cost: ¥1,290,000) Fiscal Year 2020: ¥5,590,000 (Direct Cost: ¥4,300,000、Indirect Cost: ¥1,290,000) Fiscal Year 2019: ¥5,980,000 (Direct Cost: ¥4,600,000、Indirect Cost: ¥1,380,000)
Keywords	ブラインド音源分離 / 音響イベント検出 / 音情景解析 / 音響情報処理
Outline of Research at the Start	1) バーチャルマイクロホン技術を発展させ、世界初の時間周波数ビームフォーマ技術および時間周波数S-VAD(Sophisticated Voice Activity Detector)技術を開発する。 2) 分散型マイクロホンアレーとバーチャルマイクロホンを融合させ、劣決定/優決定条件の全体を最適化した理論を構築する。 3) 音響イベント検出における弱ラベルによる学習法を開発し、ビッグデータのラベル付けコストの大幅削減を達成する。
Outline of Final Research Achievements	In recent years, various efforts have been made under government leadership towards the realization of a highly smart society. Developing fundamental technologies for multimodal communication is also an urgent issue that needs to be addressed to solve these problems. Establishing fundamental technologies for statistical mathematics and fast signal processing to analyze and understand the sound environment, which is the core of multimodal communication, is necessary for applying it to smart security, elderly monitoring, robot audition, and other areas. This research focused on developing fundamental technologies for multimodal communication centered on the sound environment for social implementation. Specifically, we conducted research on the fundamental technologies of acoustic measurement using virtual microphones and distributed microphone arrays, as well as understanding the sound environment through deep learning.
Academic Significance and Societal Importance of the Research Achievements	本研究では、我々がこれまで提案してきたバーチャルマイクロホンという新概念に、音声や音響信号のパワフルな統計モデルや先進的な最適化手法を取り入れ、新しい分散型マイクロホンアレーシステムや信号処理アルゴリズムを開発したことに学術的意義がある。さらに、本研究では、マイクロホンアレー信号処理の積年の問題に立ち向かい、新しいロバストな分散型マイクロホンアレー信号処理アルゴリズムを考案し、包括的で安定な解法を開発したことに社会的意義がある。

Report

(4 results)

Research Products
(71 results)

All 2022 2021 2020 2019

All Journal Article (9 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 9 results, Open Access: 9 results) Presentation (62 results) (of which Int'l Joint Research: 33 results, Invited: 8 results)

[Journal Article] VMInNet: Interpolation of Virtual Microphones in Optimal Latent Space Explored by Autoencoder2021
- Author(s)
  R. Takahashi, L. Li, S. Makino, and T. Yamada
- Journal Title
  
  Journal of Signal Processing
  
  Volume: 25 Issue: 6 Pages: 245-250
- DOI
  10.2299/jsp.25.245
- NAID
  130008110096
- ISSN
  1342-6230, 1880-1013
- Year and Date
  2021-11-01
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Monitoring of Domestic Activities Using Multiple Beamformers and Attention Mechanism2021
- Author(s)
  Kaneko Yuki、Yamada Takeshi、Makino Shoji
- Journal Title
  
  Journal of Signal Processing
  
  Volume: 25 Issue: 6 Pages: 239-243
- DOI
  10.2299/jsp.25.239
- NAID
  130008110097
- ISSN
  1342-6230, 1880-1013
- Year and Date
  2021-11-01
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Single-Channel Multispeaker Separation with Variational Autoencoder Spectrogram Model2021
- Author(s)
  N. Murashima, H. Kameoka, L. Li, S. Seki, and S. Makino
- Journal Title
  
  Journal of Signal Processing
  
  Volume: 25 Issue: 4 Pages: 145-149
- DOI
  10.2299/jsp.25.145
- NAID
  130008060222
- ISSN
  1342-6230, 1880-1013
- Year and Date
  2021-07-01
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Time-Frequency-Bin-Wise Linear Combination of Beamformers for Distortionless Signal Enhancement2021
- Author(s)
  Kouei Yamaoka, Nobutaka Ono, and Shoji Makino
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 3461-3475
- DOI
  10.1109/taslp.2021.3126950
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Multichannel Blind Source Separation Based on Evanescent-Region-Aware Non-Negative Tensor Factorization in Spherical Harmonic Domain2021
- Author(s)
  Mitsufuji Yuki、Takamune Norihiro、Koyama Shoichi、Saruwatari Hiroshi
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 607-617
- DOI
  10.1109/taslp.2020.3045528
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Majorization-Minimization Algorithm for Discriminative Non-Negative Matrix Factorization2020
- Author(s)
  Li Li, Hirokazu Kameoka, Shoji Makino
- Journal Title
  
  IEEE Access
  
  Volume: 8 Pages: 227399-227408
- DOI
  10.1109/access.2020.3045791
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] FastMVAE: A Fast Optimization Algorithm for the Multichannel Variational Autoencoder Method2020
- Author(s)
  Li Li, Hirokazu Kameoka, Shota Inoue, Shoji Makino
- Journal Title
  
  IEEE Access
  
  Volume: 8 Pages: 228740-228753
- DOI
  10.1109/access.2020.3045704
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Blind Speech Extraction Based on Rank-Constrained Spatial Covariance Matrix Estimation With Multivariate Generalized Gaussian Distribution2020
- Author(s)
  Yuki Kubo, Norihiro Takamune, Daichi Kitamura, and Hiroshi Saruwatari
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 28 Pages: 1948-1968
- DOI
  10.1109/taslp.2020.3003165
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Supervised determined source separation with multichannel variational autoencoder2019
- Author(s)
  Hirokazu Kameoka, Li Li, Shota Inoue, Shoji Makino
- Journal Title
  
  Neural Computation
  
  Volume: Vol. 31, No. 9 Issue: 9 Pages: 1891-1914
- DOI
  10.1162/neco_a_01217
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Presentation] Blind Source Separation of Moving Sound Sources in Reverberant Indoor Environments, '' in Proc2022
- Author(s)
  T. Yu, T. Ueda, and S. Makino
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Semi-Supervised Learning Using Weakly Labeled Data Generated by GAN in Sound Event Detection, '' in Proc2022
- Author(s)
  K. Ouma, T. Yamada, and S. Makino
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Neutral/Emotional Speech Classification Using Autoencoder and Output of Intermediate Layer in Emotion Recognizer2022
- Author(s)
  J. Santoso, T. Yamada, K. Ishizuka, T. Hashimoto, and S. Makino
- Organizer
  日本音響学会 2022年春　季研究発表会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] Wave-U-Netと識別器のエンドツーエンド学習による音響シーン識別の検討2022
- Author(s)
  山田友紀, 山田武志, 牧野昭二
- Organizer
  日本音響学会 2022年春季研究発表会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] Reducing algorithmic delay using low-overlap window for online Wave-U-Net2021
- Author(s)
  S. Nakaoka, L. Li, S. Makino, and T. Yamada
- Organizer
  Invited in Proc. APSIPA
- Related Report
  2021 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Extension of virtual microphone technique to multiple real microphones and investigation of the impact of phase and amplitude interpolation on speech enhancement2021
- Author(s)
  H. Segawa, L. Li, S. Makino, and T. Yamada
- Organizer
  in Proc. APSIPA
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Speech enhancement by noise self-supervised rank-constrained spatial covariance matrix estimation via independent deeply learned matrix analysis2021
- Author(s)
  S. Misawa, N. Takamune, T. Nakamura, D. Kitamura, H. Saruwatari, M. Une, and S. Makino
- Organizer
  in Proc. APSIPA
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Speech emotion recognition based on attention weight correction using word-level confidence measure2021
- Author(s)
  J. Santoso, T. Yamada, S. Makino, K. Ishizuka, and T. Hiramura
- Organizer
  in Proc. INTERSPEECH
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] 'Low latency online source separation and noise reduction based on joint optimization with dereverberation2021
- Author(s)
  T. Ueda, T. Nakatani, R. Ikeshita, K. Kinoshita, S. Araki, and S. Makino
- Organizer
  Invited in Proc. EUSIPCO
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] SepNet: A deep separation matrix prediction network for multichannel audio source separation2021
- Author(s)
  S. Inoue, H. Kameoka, L. Li, and S. Makino
- Organizer
  in Proc. ICASSP2021
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Low latency online blind source separation based on joint optimization with blind dereverberation2021
- Author(s)
  T. Ueda, T. Nakatani, R. Ikeshita, K. Kinoshita, S. Araki, and S. Makino
- Organizer
  in Proc. ICASSP2021
- Related Report
  2021 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Teacher-student learning for low-latency online speech enhancement using wave-U-net2021
- Author(s)
  S. Nakaoka, L. Li, S. Inoue, and S. Makino
- Organizer
  in Proc. ICASSP2021
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures2021
- Author(s)
  L. Li, H. Kameoka, and S. Makino
- Organizer
  arXiv:2109.13496
- Related Report
  2021 Annual Research Report
[Presentation] ChimeraACVAEによる高速多チャンネル変分自己符号化器法2021
- Author(s)
  李莉, 亀岡弘和, 牧野昭二
- Organizer
  日本音響学会 2021年秋季研究発表会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] Low-overlap window を用いたオンラインWave-U-Net のアルゴリズム遅延の削減2021
- Author(s)
  中岡想太郎, 李莉, 牧野昭二, 山田武志
- Organizer
  日本音響学会 2021年秋季研究発表会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] ヴァーチャルマイクロフォンの内挿における位相及び振幅補間の音声強調性能への影響の評価2021
- Author(s)
  瀬川華子, 李莉, 牧野昭二, 山田武志
- Organizer
  日本音響学会 2021年秋季研究発表会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] 音響イベント検出におけるGANを用いた弱ラベルデータ生成による半教師あり学習2021
- Author(s)
  合馬一弥, 山田武志, 牧野昭二
- Organizer
  日本音響学会 2021年秋季研究発表会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] VMInNet: Interpolation of virtual microphones in optimal latent space explored by autoencoder2021
- Author(s)
  R. Takahashi, L. Li, S. Makino, and T. Yamada
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2021)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Single-channel multi-speaker separation via discriminative training of variational autoencoder spectrogram model2021
- Author(s)
  N. Murashima, H. Kameoka, L. Li, S. Seki, and S. Makino
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2021)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Monitoring of domestic activities using multiple beamformers and attention mechanism2021
- Author(s)
  Y. Kaneko, T. Yamada, and S. Makino
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2021)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Joint-Diagonalizability-Constrained Multichannel Nonnegative Matrix Factorization Based on Multivariate Complex Sub-Gaussian Distribution2021
- Author(s)
  Keigo Kamo, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi and Kazunobu Kondo
- Organizer
  European Signal Processing Conference (EUSIPCO 2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] SepNet: 高速多チャンネル音源分離のための分離行列予測ネットワーク2021
- Author(s)
  井上翔太, 亀岡弘和, 李莉, 牧野昭二
- Organizer
  日本音響学会 2021年春季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] 識別的変分自己符号化器学習による特定話者モノラル音声分離2021
- Author(s)
  村島允也, 亀岡弘和, 李莉, 関翔悟, 牧野昭二
- Organizer
  日本音響学会 2021年春季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] 低遅延でオンライン動作する残響除去と音源分離の同時最適化2021
- Author(s)
  上田哲也, 中谷智広, 池下林太郎, 木下慶介, 荒木章子, 牧野昭二
- Organizer
  日本音響学会 2021年春季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] Teacher-Student 学習を用いたWave-U-Net による低遅延リアルタイム音声強調2021
- Author(s)
  中岡想太郎, 李莉, 井上翔太, 牧野昭二
- Organizer
  日本音響学会 2021年春季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] 車室内の三角マイクロフォンアレイへのヴァーチャルマイクロフォン技術の適用2021
- Author(s)
  瀬川華子, 髙橋理希, 李莉, 陣在遼河, 牧野昭二, 山田武志
- Organizer
  日本音響学会 2021年春季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] 補助関数法に基づく幾何学的制約付き独立ベクトル分析の車室内音声強調への適用2021
- Author(s)
  後藤加奈, 髙橋理希, 李莉, 牧野昭二, 山田武志
- Organizer
  日本音響学会 2021年春季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] 音響イベント検出と位置推定における転移学習の効果の検証2021
- Author(s)
  陳軼夫, 山田武志, 牧野昭二
- Organizer
  日本音響学会 2021年春季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] 車室内環境でのWave-U-Netによる雑音除去の検討2021
- Author(s)
  樋口隼太, 李莉, 井上翔太, 牧野昭二, 山田武志
- Organizer
  電子情報通信学会 2021年総合大会
- Related Report
  2020 Annual Research Report
[Presentation] 多変量複素Sub-Gauss分布に基づく同時対角化制約付き多チャネル非負値行列因子分解におけるmajorization-equalizationアルゴリズムを用いた更新則2021
- Author(s)
  加茂佳吾，久保優騎，高宗典玄，北村大地，猿渡洋，高橋祐，近藤多伸
- Organizer
  日本音響学会 2021年春季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] Online directional speech enhancement using geometrically constrained independent vector analysis2020
- Author(s)
  L. Li, H. Kameoka, S. Makino
- Organizer
  IEEE International Workshop on Machine Learning for Signal Processing (MLSP2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Online directional speech enhancement using geometrically constrained independent vector analysis2020
- Author(s)
  L. Li, K. Koishida, and S. Makino
- Organizer
  Interspeech2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Multichannel hearing-aid system based on basis-shared semi-supervised independent low-rank matrix analysis2020
- Author(s)
  M. Une, Y. Kubo, N. Takamune, D. Kitamura, H. Saruwatari, and S. Makino
- Organizer
  Forum Acusticum2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Study on geometrically constrained IVA with auxiliary function approach and VCD for in-car communication2020
- Author(s)
  K. Goto, L. Li, R. Takahashi, S. Makino, and T. Yamada
- Organizer
  Asia-Pacific Signal and Information Processing Association (APSIPA 2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Applying virtual microphones to triangular microphone array in in-car communication2020
- Author(s)
  H. Segawa, R. Takahashi, R. Jinzai, S. Makino, and T. Yamada
- Organizer
  Asia-Pacific Signal and Information Processing Association (APSIPA 2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Joint-Diagonalizability-Constrained Multichannel Nonnegative Matrix Factorization Based on Multivariate Complex Student’s t-distribution2020
- Author(s)
  Keigo Kamo, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi and Kazunobu Kondo
- Organizer
  Asia-Pacific Signal and Information Processing Association (APSIPA 2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] ブラインド音声抽出のためのランク制約付き空間共分散行列推定法における雑音欠落ランク空間基底推定2020
- Author(s)
  近藤祐斗, 久保優騎, 高宗典玄(東大), 北村大地(香川高専), 猿渡洋(東大)
- Organizer
  日本音響学会2020秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 多変量複素Sub-Gauss分布に基づく同時対角化制約付き多チャネル非負値行列因子分解の様々な残響条件下における実験的評価2020
- Author(s)
  加茂佳吾，久保優騎，高宗典玄，北村大地，猿渡洋，高橋祐，近藤多伸
- Organizer
  日本音響学会 2020年秋季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] Blind source separation with low latency for in-car communication2020
- Author(s)
  T. Ueda, S. Inoue, S. Makino, M. Matsumoto, and T. Yamada
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2020)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Underdetermined multichannel speech enhancement using time-frequency-bin-wise switching beamformer and gated CNN-based time-frequency mask for reverberant environments2020
- Author(s)
  R. Takahashi, K. Yamaoka, L. Li, S. Makino, T. Yamada, and M. Matsumoto
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2020)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Spatial feature extraction based on convolutional neural network with multiple microphone inputs for monitoring of domestic activities2020
- Author(s)
  Y. Kaneko, R. Kurosawa, T. Yamada, and Shoji Makino
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2020)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] 車室内コミュニケーション用低遅延音源分離手法の検討2020
- Author(s)
  上田哲也, 井上翔太, 牧野昭二, 松本光雄, 山田武志
- Organizer
  日本音響学会 2020年春季研究発表会講演論文集
- Related Report
  2019 Annual Research Report
[Presentation] DNNマスク推定に基づく畳み込みビームフォーマによる音源分離・残響除去・雑音除去の同時実現2020
- Author(s)
  髙橋理希, 中谷智広, 落合翼, 木下慶介, 池下林太郎, Marc Delcroix, 荒木章子, 牧野昭二
- Organizer
  日本音響学会 2020年春季研究発表会講演論文集
- Related Report
  2019 Annual Research Report
[Presentation] 基底共有型半教師あり独立低ランク行列分析に基づく多チャネル補聴器システム2020
- Author(s)
  宇根昌和, 久保優騎, 高宗典玄, 北村大地, 猿渡洋, 牧野昭二
- Organizer
  日本音響学会 2020年春季研究発表会講演論文集
- Related Report
  2019 Annual Research Report
[Presentation] 発話の時間変動に着目した音声認識誤り区間推定の検討2020
- Author(s)
  舒禹清, 山田武志, 牧野昭二
- Organizer
  日本音響学会 2020年春季研究発表会講演論文集
- Related Report
  2019 Annual Research Report
[Presentation] 空間特徴と音響特徴を併用する音響イベント検出の検討2020
- Author(s)
  陳軼夫, 山田武志, 牧野昭二
- Organizer
  日本音響学会 2020年春季研究発表会講演論文集
- Related Report
  2019 Annual Research Report
[Presentation] 空間フィルタの自動推定による音響シーン識別の検討2020
- Author(s)
  大野泰己, 山田武志, 牧野昭二
- Organizer
  電子情報通信学会 2020年総合大会
- Related Report
  2019 Annual Research Report
[Presentation] Generative Adversarial Networks を用いた半教師あり学習の音響イベント検出への適用2020
- Author(s)
  合馬一弥, 山田武志, 牧野昭二
- Organizer
  電子情報通信学会 2020年総合大会
- Related Report
  2019 Annual Research Report
[Presentation] Time-frequency-bin-wise switching of minimum variance distortionless response beamformer for underdetermined situations2019
- Author(s)
  K. Yamaoka, N. Ono, S. Makino, and T. Yamada
- Organizer
  International Conference on Acoustics, Speech, and Signal Processing (ICASSP2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Fast MVAE: Joint separation and classification of mixed sources based on multichannel variational autoencoder with auxiliary classifier2019
- Author(s)
  L. Li, H. Kameoka, and S. Makino
- Organizer
  International Conference on Acoustics, Speech, and Signal Processing (ICASSP2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Joint separation and dereverberation of reverberant mixtures with multichannel variational autoencoder2019
- Author(s)
  S. Inoue, H. Kameoka, L. Li, S. Seki, and S. Makino
- Organizer
  International Conference on Acoustics, Speech, and Signal Processing (ICASSP2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] CNN-based virtual microphone signal estimation for MPDR beamforming in underdetermined situations2019
- Author(s)
  K. Yamaoka, L. Li, N. Ono, S. Makino, and T. Yamada
- Organizer
  European Signal Processing Conference (EUSIPCO 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Wavelength proportional arrangement of virtual microphones based on interpolation/extrapolation for underdetermined speech enhancement2019
- Author(s)
  R. Jinzai, K. Yamaoka, M. Matsumoto, S. Makino, and T. Yamada
- Organizer
  European Signal Processing Conference (EUSIPCO 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Gated convolutional neural network-based voice activity detection under high-level noise environments2019
- Author(s)
  L. Li, K. Yamaoka, Y. Koshino, M. Matsumoto, and S. Makino
- Organizer
  International Congress on Acoustics (ICA2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Joint separation, dereverberation and classification of multiple sources using multichannel variational autoencoder with auxiliary classifier2019
- Author(s)
  S. Inoue, H. Kameoka, L. Li, and S. Makino
- Organizer
  International Congress on Acoustics (ICA2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Improving singing aid system for laryngectomees with statistical voice conversion and VAE-SPACE2019
- Author(s)
  L. Li, T. Toda, K. Morikawa, K. Kobayashi, and S. Makino
- Organizer
  Annual Conference of the International Society for Music Information Retrieval (ISMIR2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Evaluation of multichannel hearing aid system by rank-constrained spatial covariance matrix estimation2019
- Author(s)
  M. Une, Y. Kubo, N. Takamune, D. Kitamura, H. Saruwatari, and S. Makino
- Organizer
  Asia-Pacific Signal and Information Processing Association (APSIPA 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Classifcation of causes of speech recognition errors using attention-based bidirectional long short-term memory and modulation spectrum2019
- Author(s)
  J. Santoso, T. Yamada, and S. Makino
- Organizer
  Asia-Pacific Signal and Information Processing Association (APSIPA 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] ランク制約付き空間共分散モデル推定を用いた多チャネル補聴器システムの評価2019
- Author(s)
  宇根昌和, 久保優騎, 高宗典玄, 北村大地, 猿渡洋, 牧野昭二
- Organizer
  日本音響学会 2019年秋季研究発表会講演論文集
- Related Report
  2019 Annual Research Report
[Presentation] BLSTMと変調スペクトルを用いた発話特徴識別の検討2019
- Author(s)
  サントソジェニファー, 山田武志, 牧野昭二
- Organizer
  日本音響学会 2019年秋季研究発表会講演論文集
- Related Report
  2019 Annual Research Report
[Presentation] BLSTMを用いた音声認識誤り区間推定の検討2019
- Author(s)
  舒禹清, 山田武志, 牧野昭二
- Organizer
  日本音響学会 2019年秋季研究発表会講演論文集
- Related Report
  2019 Annual Research Report
[Presentation] 多チャンネル変分自己符号化器法による任意話者の音源分離2019
- Author(s)
  李莉, 亀岡弘和, 井上翔太, 牧野昭二
- Organizer
  電子情報通信学会 2019年応用音響研究会
- Related Report
  2019 Annual Research Report

Research on Innovative Microphone Array Technology for Recognition and Understanding of Acoustic Environments

Principal Investigator

Makino Shoji 早稲田大学, 理工学術院(情報生産システム研究科・センター), 特任教授 (60396190)

¥17,160,000 (Direct Cost: ¥13,200,000、Indirect Cost: ¥3,960,000)

Report

Research Products

[Journal Article] VMInNet: Interpolation of Virtual Microphones in Optimal Latent Space Explored by Autoencoder2021

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Monitoring of Domestic Activities Using Multiple Beamformers and Attention Mechanism2021

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Single-Channel Multispeaker Separation with Variational Autoencoder Spectrogram Model2021

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Time-Frequency-Bin-Wise Linear Combination of Beamformers for Distortionless Signal Enhancement2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Multichannel Blind Source Separation Based on Evanescent-Region-Aware Non-Negative Tensor Factorization in Spherical Harmonic Domain2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Majorization-Minimization Algorithm for Discriminative Non-Negative Matrix Factorization2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] FastMVAE: A Fast Optimization Algorithm for the Multichannel Variational Autoencoder Method2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Blind Speech Extraction Based on Rank-Constrained Spatial Covariance Matrix Estimation With Multivariate Generalized Gaussian Distribution2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Supervised determined source separation with multichannel variational autoencoder2019

Author(s)

Journal Title

DOI

Related Report

[Presentation] Blind Source Separation of Moving Sound Sources in Reverberant Indoor Environments, '' in Proc2022

Author(s)

Organizer

Related Report

[Presentation] Semi-Supervised Learning Using Weakly Labeled Data Generated by GAN in Sound Event Detection, '' in Proc2022

Author(s)

Organizer

Related Report

[Presentation] Neutral/Emotional Speech Classification Using Autoencoder and Output of Intermediate Layer in Emotion Recognizer2022

Author(s)

Organizer

Related Report

[Presentation] Wave-U-Netと識別器のエンドツーエンド学習による音響シーン識別の検討2022

Author(s)

Organizer

Related Report

[Presentation] Reducing algorithmic delay using low-overlap window for online Wave-U-Net2021

Author(s)

Organizer

Related Report