Acoustic Augmented Reality and Auditory Communication Ability Expansion Based on Small-Data Machine Learning Theory

Research Project

Project/Area Number	19H01116
Research Category	Grant-in-Aid for Scientific Research (A)
Allocation Type	Single-year Grants
Section	一般
Review Section	Medium-sized Section 61:Human informatics and related fields
Research Institution	The University of Tokyo
Principal Investigator	Saruwatari Hiroshi 東京大学, 大学院情報理工学系研究科, 教授 (30324974)
Co-Investigator(Kenkyū-buntansha)	北村大地香川高等専門学校, 電気情報工学科, 講師 (40804745) 中村友彦東京大学, 大学院情報理工学系研究科, 特任助教 (50866308) 牧野昭二早稲田大学, 理工学術院(情報生産システム研究科・センター), 特任教授 (60396190) 小山翔一東京大学, 大学院情報理工学系研究科, 講師 (80734459) 高道慎之介東京大学, 大学院情報理工学系研究科, 助教 (90784330)
Project Period (FY)	2019-04-01 – 2023-03-31
Project Status	Completed (Fiscal Year 2022)
Budget Amount *help	¥44,850,000 (Direct Cost: ¥34,500,000、Indirect Cost: ¥10,350,000) Fiscal Year 2022: ¥10,920,000 (Direct Cost: ¥8,400,000、Indirect Cost: ¥2,520,000) Fiscal Year 2021: ¥10,920,000 (Direct Cost: ¥8,400,000、Indirect Cost: ¥2,520,000) Fiscal Year 2020: ¥10,920,000 (Direct Cost: ¥8,400,000、Indirect Cost: ¥2,520,000) Fiscal Year 2019: ¥12,090,000 (Direct Cost: ¥9,300,000、Indirect Cost: ¥2,790,000)
Keywords	スモールデータ / 機械学習 / 音響拡張現実感 / 音源分離 / 信号処理
Outline of Research at the Start	本申請では、スモールデータ機械学習理論に基づく新しい音響情報処理の確立、及びその柔軟かつ高品質な音メディアバーチャルリアリティ（VR）・拡張現実感（AR）システムへの応用に関して研究を行う。具体的には、「なるべく少ない事前情報から複雑な音情景を統計的な独立成分に分解し、加工・拡張再現する」という総合的音メディアコンテンツ入出力システムの構築を主目的とする。また、このシステムの実証的アプリケーションとして、「音メディアVR・AR」を想定し、不特定多数の音波動センサが一致団結してユーザの受聴を助ける音コミュニケーション能力拡張システムの実現を通じて、ライフイノベーションへ貢献する。
Outline of Final Research Achievements	In this research, we address small-data-aware sound information processing and its application. Our goal is to expand the unsupervised machine learning theory without a priori big data, and to apply the new theory to sound VR/AR system with efficient statistical modeling and control. In particular, we can develop our technologies for the sound VR/AR system, including flexible statistical model-based unsupervised/semi-supervised sound separation, and efficient voice conversion utilizing the generative DNN model, GAN, and DNN-based phase spectrum estimation.
Academic Significance and Societal Importance of the Research Achievements	本基盤研究で提案されたアルゴリズムにおいては、数理工学的に世界初の発見が複数存在している（例えば多変量におけるMajorizatrion-Equalizationアルゴリズムや劣ガウス生成モデルに関する音源分離アルゴリズムの導出、方向統計分布に基づく位相推定DNN、音場のカーネルリッジ回帰、等）。よって、当該学術分野に大きな貢献が出来たと考えられる。また、本貢献が認められ、多くの学術賞や奨励賞を受賞するに至った。

Report

(6 results)

2022 Annual Research Report Final Research Report ( PDF )
2021 Annual Research Report
2020 Annual Research Report
2019 Comments on the Screening Results Annual Research Report

Research Products
(104 results)

All 2023 2022 2021 2020 2019

All Journal Article (17 results) (of which Int'l Joint Research: 4 results, Peer Reviewed: 16 results, Open Access: 16 results) Presentation (86 results) (of which Int'l Joint Research: 31 results, Invited: 4 results) Patent(Industrial Property Rights) (1 results)

[Journal Article] Noise Suppression Using Beamformer and Transfer-Function-Gain Nonnegative Matrix Factorization with Distributed Stereo Microphones2023
- Author(s)
  Yutaro Matsui, Shoji Makino, Nobutaka Ono, Takeshi Yamada
- Journal Title
  
  Journal of Signal Processing
  
  Volume: 27 Issue: 1 Pages: 1-6
- DOI
  10.2299/jsp.27.1
- ISSN
  1342-6230, 1880-1013
- Year and Date
  2023-01-01
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Deficient-basis-complementary rank-constrained spatial covariance matrix estimation based on multivariate generalized Gaussian distribution for blind speech extraction2022
- Author(s)
  Yuto Kondo, Yuki Kubo, Norihiro Takamune , Daichi Kitamura, and Hiroshi Saruwatari
- Journal Title
  
  EURASIP Journal on Advances in Signal Processing
  
  Volume: 88(2022) Issue: 1
- DOI
  10.1186/s13634-022-00905-z
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Region-to-Region Kernel Interpolation of Acoustic Transfer Functions Constrained by Physical Properties2022
- Author(s)
  Juliano G. C. Ribeiro , Natsuki Ueno , Shoichi Koyama , Hiroshi Saruwatari
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: vol. 30 Pages: 2944-2954
- DOI
  10.1109/taslp.2022.3201368
- Related Report
  2022 Annual Research Report
[Journal Article] DNN-Based Low-Musical-Noise Single-Channel Speech Enhancement Based on Higher-Order-Moments Matching2021
- Author(s)
  Satoshi Mizoguchi, Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E104.D Issue: 11 Pages: 1971-1980
- DOI
  10.1587/transinf.2021EDP7041
- NAID
  130008109996
- ISSN
  0916-8532, 1745-1361
- Year and Date
  2021-11-01
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Noise Robust Acoustic Anomaly Detection System with Nonnegative Matrix Factorization Based on Generalized Gaussian Distribution2021
- Author(s)
  AIBA Akihito、YOSHIDA Minoru、KITAMURA Daichi、TAKAMICHI Shinnosuke、SARUWATARI Hiroshi
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E104.D Issue: 3 Pages: 441-449
- DOI
  10.1587/transinf.2020EDK0002
- NAID
  130007993183
- ISSN
  0916-8532, 1745-1361
- Year and Date
  2021-03-01
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Joint-diagonalizability-constrained multichannel nonnegative matrix factorization based on time-variant multivariate complex sub-Gaussian distribution2021
- Author(s)
  Kamo Keigo、Mitsui Yoshiki、Kubo Yuki、Takamune Norihiro、Kitamura Daichi、Saruwatari Hiroshi、Takahashi Yu、Kondo Kazunobu
- Journal Title
  
  Signal Processing
  
  Volume: 188 Pages: 108183-108183
- DOI
  10.1016/j.sigpro.2021.108183
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Time-Domain Audio Source Separation With Neural Networks Based on Multiresolution Analysis2021
- Author(s)
  Nakamura Tomohiko、Kozuka Shihori、Saruwatari Hiroshi
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 1687-1701
- DOI
  10.1109/taslp.2021.3072496
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Time-Frequency-Bin-Wise Linear Combination of Beamformers for Distortionless Signal Enhancement2021
- Author(s)
  Kouei Yamaoka, Nobutaka Ono, and Shoji Makino
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 3461-3475
- DOI
  10.1109/taslp.2021.3126950
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Perceptual-similarity-aware deep speaker representation learning for multi-speaker generative modeling2021
- Author(s)
  Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 1033-1048
- DOI
  10.1109/taslp.2021.3059114
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Multichannel Blind Source Separation Based on Evanescent-Region-Aware Non-Negative Tensor Factorization in Spherical Harmonic Domain2021
- Author(s)
  Mitsufuji Yuki、Takamune Norihiro、Koyama Shoichi、Saruwatari Hiroshi
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 607-617
- DOI
  10.1109/taslp.2020.3045528
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] DNN-Based Full-Band Speech Synthesis Using GMM Approximation of Spectral Envelope2020
- Author(s)
  KOGUCHI Junya、TAKAMICHI Shinnosuke、MORISE Masanori、SARUWATARI Hiroshi、SAGAYAMA Shigeki
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E103.D Issue: 12 Pages: 2673-2681
- DOI
  10.1587/transinf.2020EDP7075
- NAID
  130007948509
- ISSN
  0916-8532, 1745-1361
- Year and Date
  2020-12-01
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Perception analysis of inter-singer similarity in Japanese song2020
- Author(s)
  Tamaru Hiroki、Takamichi Shinnosuke、Saruwatari Hiroshi
- Journal Title
  
  Acoustical Science and Technology
  
  Volume: 41 Issue: 5 Pages: 804-807
- DOI
  10.1250/ast.41.804
- NAID
  130007895100
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Blind Speech Extraction Based on Rank-Constrained Spatial Covariance Matrix Estimation With Multivariate Generalized Gaussian Distribution2020
- Author(s)
  Yuki Kubo, Norihiro Takamune, Daichi Kitamura, and Hiroshi Saruwatari
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 28 Pages: 1948-1968
- DOI
  10.1109/taslp.2020.3003165
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Consistent independent low-rank matrix analysis for determined blind source separation2020
- Author(s)
  Kitamura Daichi、Yatabe Kohei
- Journal Title
  
  EURASIP Journal on Advances in Signal Processing
  
  Volume: 2020 Issue: 1 Pages: 1-35
- DOI
  10.1186/s13634-020-00704-4
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Multichannel Non-Negative Matrix Factorization Using Banded Spatial Covariance Matrices in Wavenumber Domain2020
- Author(s)
  Mitsufuji Yuki、Uhlich Stefan、Takamune Norihiro、Kitamura Daichi、Koyama Shoichi、Saruwatari Hiroshi
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 28 Pages: 49-60
- DOI
  10.1109/taslp.2019.2948770
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Independent Low-Rank Matrix Analysis Based on Time-Variant Sub-Gaussian Source Model for Determined Blind Source Separation2020
- Author(s)
  Mogami Shinichi、Takamune Norihiro、Kitamura Daichi、Saruwatari Hiroshi、Takahashi Yu、Kondo Kazunobu、Ono Nobutaka
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 28 Pages: 503-518
- DOI
  10.1109/taslp.2019.2959257
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Independent Deeply Learned Matrix Analysis for Determined Audio Source Separation2019
- Author(s)
  Makishima Naoki、Mogami Shinichi、Takamune Norihiro、Kitamura Daichi、Sumino Hayato、Takamichi Shinnosuke、Saruwatari Hiroshi、Ono Nobutaka
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 27 Issue: 10 Pages: 1601-1615
- DOI
  10.1109/taslp.2019.2925450
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] 日本語音声合成におけるアクセント句韻律特徴量の表現と予測2023
- Author(s)
  佐藤匡紀，高道慎之介，猿渡洋
- Organizer
  第9回音声・音響・信号処理ワークショップ（SPEASIP）
- Related Report
  2022 Annual Research Report
[Presentation] 多チャネル音源分離のための独立低ランク行列分析に対するスペクトログラム無矛盾性に基づく正則化項の設計2023
- Author(s)
  三澤颯大，高宗典玄，矢田部浩平，北村大地，猿渡洋
- Organizer
  第9回音声・音響・信号処理ワークショップ（SPEASIP）
- Related Report
  2022 Annual Research Report
[Presentation] vTTS: visual-text to speech2023
- Author(s)
  Yoshifumi Nakano, Takaaki Saeki, Shinnosuke Takamichi, Katsuhito Sudoh, Hiroshi Saruwatari
- Organizer
  the 2022 IEEE Spoken Language Technology Workshop (IEEE SLT 2022)
- Related Report
  2022 Annual Research Report
- Int'l Joint Research
[Presentation] REGION-TO-REGION KERNEL INTERPOLATION OF ACOUSTIC TRANSFER FUNCTION WITH DIRECTIONAL WEIGHTING2022
- Author(s)
  Juliano G. C. Ribeiro, Shoichi Koyama, Hiroshi Saruwatari
- Organizer
  The 47th International Conference on Acoustics, Speech, and Signal Processing (ICASSP2022)
- Related Report
  2022 Annual Research Report
- Int'l Joint Research
[Presentation] SPATIAL ACTIVE NOISE CONTROL BASED ON INDIVIDUAL KERNEL INTERPOLATION OF PRIMARY AND SECONDARY SOUND FIELDS2022
- Author(s)
  Kazuyuki Arikawa, Shoichi Koyama, and Hiroshi Saruwatari
- Organizer
  The 47th International Conference on Acoustics, Speech, and Signal Processing (ICASSP2022)
- Related Report
  2022 Annual Research Report
- Int'l Joint Research
[Presentation] Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders2022
- Author(s)
  Futa Nakashima, Tomohiko Nakamura, Norihiro Takamune, Satoru Fukayama, and Hiroshi Saruwatari
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2022 (APSIPA ASC 2022)
- Related Report
  2022 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Accelerating online algorithm using geometrically constrained independent vector analysis with iterative source steering2022
- Author(s)
  Kana Goto, Tetsuya Ueda, Li Li, Takeshi Yamada, Shoji Makino
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2022 (APSIPA ASC 2022)
- Related Report
  2022 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] 時間チャネル非負値行列因子分解を用いた被り音抑圧における初期値頑健性の比較2022
- Author(s)
  溝渕悠朔, 北村大地, 中村友彦, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  日本音響学会第148回(2022年秋季)研究発表会
- Related Report
  2022 Annual Research Report
[Presentation] 楽音合成のための Gauss 混合変分自己符号化器への定曲率非 Euclid 空間の導入と実験的比較2022
- Author(s)
  中島風太，中村友彦，高宗典玄，深山覚，猿渡洋
- Organizer
  日本音響学会第148回(2022年秋季)研究発表会
- Related Report
  2022 Annual Research Report
[Presentation] 拡散性雑音をモデル化した独立低ランク行列分析における一般化固有値問題の解法に基づく高速化2022
- Author(s)
  西田光輝，高宗典玄，北村大地，猿渡洋，池下林太郎，中谷智広
- Organizer
  日本音響学会第148回(2022年秋季)研究発表会
- Related Report
  2022 Annual Research Report
[Presentation] J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis2022
- Author(s)
  Shinnosuke Takamichi, Wataru Nakata, Naoko Tanji, and Hiroshi Saruwatari
- Organizer
  INTERSPEECH 2022
- Related Report
  2022 Annual Research Report
- Int'l Joint Research
[Presentation] ブラインド音声抽出のためのランク制約付き空間共分散行列推定法における雑音欠落ランク空間基底選択に関する一考察2022
- Author(s)
  西田光輝，高宗典玄，北村大地，猿渡洋
- Organizer
  音学シンポジウム2022
- Related Report
  2022 Annual Research Report
[Presentation] 双曲空間への音色埋め込みを用いたガウス混合変分自己符号化器による楽音合成の検討2022
- Author(s)
  中島風太，中村友彦，高宗典玄，深山覚，猿渡洋
- Organizer
  第134回音楽情報科学・第142回音声言語情報処理合同研究発表会
- Related Report
  2022 Annual Research Report
[Presentation] Geometrically constrained independent vector analysis with auxiliary function approach and iterative source steering2022
- Author(s)
  Kana Goto, Tetsuya Ueda, Li Li, Takeshi Yamada, Shoji Makino
- Organizer
  European Signal Processing Conference (EUSIPCO 2022)
- Related Report
  2022 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] 幾何学的制約付き独立ベクトル分析を用いたオンライン指向性音声強調のIterative Source Steering による高速化2022
- Author(s)
  後藤加奈, 上田哲也, 李莉, 山田武志, 牧野昭二
- Organizer
  日本音響学会第148回(2022年秋季)研究発表会
- Related Report
  2022 Annual Research Report
[Presentation] 解像度の異なる複数の時間周波数表現を用いた独立低ランク行列分析2022
- Author(s)
  細谷泰稚, 北村大地, 矢田部浩平
- Organizer
  日本音響学会 2022年春季研究発表会
- Related Report
  2021 Annual Research Report
[Presentation] Speaking Rate Control by HiFi-GAN using Feature Interpolation2022
- Author(s)
  Detai Xin, Shinnosuke Takamichi, Takuma Okamoto, Hisashi Kawai, Hiroshi Saruwatari
- Organizer
  音声言語情報処理研究会（IPSJ-SLP）
- Related Report
  2021 Annual Research Report
[Presentation] 画像文字からの音声合成2022
- Author(s)
  中野嘉文，佐伯高明，高道慎之介，須藤克仁，猿渡洋
- Organizer
  言語処理学会2022年年次大会
- Related Report
  2021 Annual Research Report
[Presentation] 深層学習に基づく周波数帯域予測による高速音源分離法の実験的評価2021
- Author(s)
  渡辺瑠伊, 北村大地, 中村友彦, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  第24回日本音響学会関西支部若手研究者交流研究発表会
- Related Report
  2021 Annual Research Report
[Presentation] 深層学習に基づく間引きインジケータ付き周波数帯域補間手法による音源分離処理の高速化2021
- Author(s)
  渡辺瑠伊, 北村大地, 中村友彦, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  日本音響学会 2021年秋期研究発表会
- Related Report
  2021 Annual Research Report
[Presentation] Teacher-student learning for low-latency online speech enhancement using wave-U-net2021
- Author(s)
  Sotaro Nakaoka, Li Li, Shota Inoue, Shoji Makino
- Organizer
  The 46th International Conference on Acoustics, Speech, and Signal Processing (ICASSP2021)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation2021
- Author(s)
  Naoki Narisawa, Rintaro Ikeshita, Norihiro Takamune, Daichi Kitamura, Tomohiko Nakamura, Hiroshi Saruwatari, Tomohiro Nakatani
- Organizer
  European Signal Processing Conference (EUSIPCO 2021)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Empirical Bayesian Independent Deeply Learned Matrix Analysis For Multichannel Audio Source Separation2021
- Author(s)
  Takuya Hasumi, Tomohiko Nakamura, Norihiro Takamune, Hiroshi Saruwatari, Daichi Kitamura, Yu Takahashi, Kazunobu Kondo
- Organizer
  European Signal Processing Conference (EUSIPCO 2021)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Prior distribution design for music bleeding-sound reduction based on nonnegative matrix factorization2021
- Author(s)
  Yusaku Mizobuchi, Daichi Kitamura, Tomohiko Nakamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2021 (APSIPA ASC 2021)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Speech enhancement by noise self-supervised rank-constrained spatial covariance matrix estimation via independent deeply learned matrix analysis2021
- Author(s)
  Sota Misawa, Norihiro Takamune, Tomohiko Nakamura, Daichi Kitamura, Hiroshi Saruwatari, Masakazu Une, and Shoji Makino
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2021 (APSIPA ASC 2021)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models2021
- Author(s)
  Takuya Hasumi, Tomohiko Nakamura, Norihiro Takamune, Hiroshi Saruwatari, Daichi Kitamura, Yu Takahashi, Kazunobu Kondo
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2021 (APSIPA ASC 2021)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Mean-Square-Error-Based Secondary Source Placement in Sound Field Synthesis With Prior Information on Desired Field2021
- Author(s)
  Keisuke Kimura, Shoichi Koyama, Natsuki Ueno, Hiroshi Saruwatari
- Organizer
  IEEE Workshop on Applications of Signal Processing to Audio and Acoustics(WASPAA)
- Related Report
  2021 Annual Research Report
[Presentation] 多変量一般化Gauss分布に基づくランク制約付き空間共分散行列推定法における雑音欠落ランク空間基底推定2021
- Author(s)
  近藤祐斗，久保優騎，高宗典玄，北村大地，猿渡洋
- Organizer
  日本音響学会2021秋季研究発表会
- Related Report
  2021 Annual Research Report
[Presentation] Product of Priors型確率分布を導入した音源モデルに基づく独立深層学習行列分析による多チャネル音源分離2021
- Author(s)
  蓮実拓也，中村友彦，高宗典玄，猿渡洋，北村大地，高橋祐，近藤多伸
- Organizer
  日本音響学会2021秋季研究発表会
- Related Report
  2021 Annual Research Report
[Presentation] ヘビーテイル生成モデルに基づく独立深層学習テンソル分析2021
- Author(s)
  成澤直輝，池下林太郎，高宗典玄，北村大地，中村友彦，猿渡洋，中谷智広
- Organizer
  日本音響学会2021秋季研究発表会
- Related Report
  2021 Annual Research Report
[Presentation] 独立深層学習行列分析を用いたランク制約付き空間共分散行列推定による音声強調2021
- Author(s)
  三澤颯大，中村友彦，高宗典玄，北村大地，猿渡洋
- Organizer
  日本音響学会2021秋季研究発表会
- Related Report
  2021 Annual Research Report
[Presentation] ドメイン適応と話者一致損失を用いた話者適応によるクロスリンガル音声合成2021
- Author(s)
  辛徳泰，齋藤佑樹，高道慎之介，郡山知樹，猿渡洋
- Organizer
  日本音響学会2021秋季研究発表会
- Related Report
  2021 Annual Research Report
[Presentation] Low-overlap window を用いたオンラインWave-U-Net のアルゴリズム遅延の削減2021
- Author(s)
  中岡想太郎, 李莉, 牧野昭二, 山田武志
- Organizer
  日本音響学会2021秋季研究発表会
- Related Report
  2021 Annual Research Report
[Presentation] 非負値行列因子分解を導入したproduct of experts型音源モデルに基づく独立深層学習行列分析による多チャネル音源分離2021
- Author(s)
  蓮実拓也，中村友彦，高宗典玄，猿渡洋，北村大地，高橋祐，近藤多伸
- Organizer
  第131回音楽情報科学研究会
- Related Report
  2021 Annual Research Report
[Presentation] 多重解像度深層分析を用いた楽音分離の実験的評価2021
- Author(s)
  中村友彦，猿渡洋
- Organizer
  音学シンポジウム2021
- Related Report
  2021 Annual Research Report
[Presentation] 非負値行列因子分解を用いた被り音の抑圧2021
- Author(s)
  溝渕悠朔, 北村大地, 中村友彦, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  第132回音楽情報科学研究会
- Related Report
  2021 Annual Research Report
[Presentation] 多変量複素Sub-Gauss分布に基づく同時対角化制約付き多チャネル非負値行列因子分解におけるmajorization-equalizationアルゴリズムを用いた更新則2021
- Author(s)
  加茂佳吾，久保優騎，高宗典玄，北村大地，猿渡洋，高橋祐，近藤多伸
- Organizer
  日本音響学会 2021年春季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] ランク制約付き空間共分散行列推定法における補助関数法に基づく雑音欠落ランク空間基底に対する新しい更新則2021
- Author(s)
  近藤祐斗, 久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会2021春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] スタガードモデル化三重対角型共分散行列を用いた独立半正定値テンソル分析によるブラインド音源分離2021
- Author(s)
  近藤樹、高宗典玄、北村大地、猿渡洋、池下林太郎、中谷智広
- Organizer
  日本音響学会2021春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 経験ベイズ独立深層学習行列分析による多チャネル音源分離2021
- Author(s)
  蓮実拓也，中村友彦，高宗典玄，猿渡洋，北村大地，高橋祐，近藤多伸
- Organizer
  日本音響学会2021春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 独立深層学習テンソル分析に基づく多チャネル?源分離2021
- Author(s)
  成澤直輝，池下林太郎，高宗典玄，北村大地，中村友彦，猿渡洋，中谷智広
- Organizer
  日本音響学会2021春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 音場合成のための所望音場の事前情報を用いた二乗誤差期待値最小化規準スピーカ配置最適化法2021
- Author(s)
  木村圭佑，小山翔一，植野夏樹，猿渡洋
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] スペクトログラム無矛盾性を用いた独立低ランク行列分析の実験的評価2021
- Author(s)
  北村大地，矢田部浩平
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 深層学習に基づく周波数帯域補間手法による音源分離処理の高速化2021
- Author(s)
  渡辺瑠伊，北村大地，猿渡洋，高橋祐，近藤多伸
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] Joint-Diagonalizability-Constrained Multichannel Nonnegative Matrix Factorization Based on Multivariate Complex Sub-Gaussian Distribution2021
- Author(s)
  Keigo Kamo, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi and Kazunobu Kondo
- Organizer
  European Signal Processing Conference (EUSIPCO 2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Sensor Placement in Arbitrarily Restricted Region for Field Estimation Based on Gaussian Process2021
- Author(s)
  Tomoya Nishida, Natsuki Ueno, Shoichi Koyama and Hiroshi Saruwatari
- Organizer
  European Signal Processing Conference (EUSIPCO 2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] DNN-Based Frequency Component Prediction for Frequency-Domain Audio Source Separation2021
- Author(s)
  Rui Watanabe, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, and Kazunobu Kondo
- Organizer
  European Signal Processing Conference (EUSIPCO 2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Joint-Diagonalizability-Constrained Multichannel Nonnegative Matrix Factorization Based on Multivariate Complex Student’s t-distribution2020
- Author(s)
  Keigo Kamo, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi and Kazunobu Kondo
- Organizer
  Asia-Pacific Signal and Information Processing Association (APSIPA 2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] DNN-Based Permutation Solver for Frequency-Domain Independent Component Analysis in Two-Source Mixture Case2020
- Author(s)
  Shuhei Yamaji and Daichi Kitamura
- Organizer
  Asia-Pacific Signal and Information Processing Association (APSIPA 2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Mutual-Information-Based Sensor Placement for Spatial Sound Field Recording2020
- Author(s)
  Kentaro Ariga, Tomoya Nishida, Shoichi Koyama, Natsuki Ueno, and Hiroshi Saruwatari
- Organizer
  The 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Convergence-Guaranteed Independent Positive Semidefinite Tensor Analysis Based on Student’s T Distribution2020
- Author(s)
  Tatsuki Kondo, Kanta Fukushige, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Rintaro Ikeshita, Tomohiro Nakatani
- Organizer
  The 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Regularized Fast Multichannel Nonnegative Matrix Factorization with ILRMA-based Prior Distribution of Joint-Diagonalization Process2020
- Author(s)
  Keigo Kamo, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi and Kazunobu Kondo
- Organizer
  The 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] SMASH corpus: a spontaneous speech corpus recording third-person audio commentaries on gameplay2020
- Author(s)
  Yuki Saito, Shinnosuke Takamichi, and Hiroshi Saruwatari
- Organizer
  The International Conference on Language Resources and Evaluation（LREC 2020）
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Time-domain Audio Source Separation based on Wave-U-Net Combined with Discrete Wavelet Transform2020
- Author(s)
  Tomohiko Nakamura and Hiroshi Saruwatari
- Organizer
  The 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Investigation on Wavelet Basis Function of DNN-based Time Domain Audio Source Separation Inspired by Multiresolution Analysis2020
- Author(s)
  Shihori Kozuka, Tomohiko Nakamura and Hiroshi Saruwatari
- Organizer
  The 49th International Congress and Exposition on Noise Control Engineering (INTERNOISE2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] 多変量複素Sub-Gauss分布に基づく同時対角化制約付き多チャネル非負値行列因子分解の様々な残響条件下における実験的評価2020
- Author(s)
  加茂佳吾，久保優騎，高宗典玄，北村大地，猿渡洋，高橋祐，近藤多伸
- Organizer
  日本音響学会 2020年秋季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] 音源分離のための周波数間相関を考慮した多変量複素Gauss分布に基づく深層学習による分散共分散行列推定の検討2020
- Author(s)
  成澤直輝，高宗典玄，北村大地，中村友彦，猿渡洋
- Organizer
  日本音響学会2020秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] ブラインド音声抽出のためのランク制約付き空間共分散行列推定法における雑音欠落ランク空間基底推定2020
- Author(s)
  近藤祐斗, 久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会2020秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] ユーザーからの補助情報を用いる独立低ランク行列分析2020
- Author(s)
  大島風雅，中野将生，北村大地
- Organizer
  日本音響学会2020秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 深層学習に基づく音響帯域拡張による音源分離処理の高速化2020
- Author(s)
  渡辺瑠伊，北村大地，猿渡洋，高橋祐，近藤多伸
- Organizer
  日本音響学会2020秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 局所時間周波数構造に基づく深層パーミュテーション解決法の実験的評価2020
- Author(s)
  山地修平，北村大地
- Organizer
  日本音響学会2020秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] Multichannel Hearing-aid System Based on Basis-Shared Semi-Supervised Independent Low-Rank Matrix Analysis2020
- Author(s)
  Masakazu Une, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, and Shoji Makino
- Organizer
  Forum Acusticum 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Kernel Interpolation of Acoustic Transfer Function Between Regions Considering Reciprocity2020
- Author(s)
  J. G. C. Ribeiro, N. Ueno,S. Koyama, and H. Saruwatari
- Organizer
  IEEE Sensor Array and Multichannel Signal Processing Workshop (SAM)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Region-to-region acoustic transfer function estimation with distributed sources and receivers based on kernel interpolation2020
- Author(s)
  J. G. C. Ribeiro, N. Ueno, S. Koyama, and H. Saruwatari
- Organizer
  電子情報通信学会技術研究報告
- Related Report
  2019 Annual Research Report
[Presentation] 基底共有型半教師あり独立低ランク行列分析に基づく多チャネル補聴器システム2020
- Author(s)
  宇根昌和, 久保優騎, 高宗典玄, 北村大地, 猿渡洋, 牧野昭二
- Organizer
  日本音響学会 2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 独立深層学習行列分析におけるマイクロホン毎及び音源毎の座標降下法に基づく分離行列更新法の周波数別自動選択法2020
- Author(s)
  牧島直輝, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  日本音響学会 2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] ランク制約付き空間共分散行列推定法に基づく拡散性雑音存在下でのブラインド複数方向性音源分離2020
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会 2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] リフティングスキームによる離散ウェーブレット変換を導入した深層ニューラルネットに基づく時間領域音源分離2020
- Author(s)
  小塚詩穂里, 中村友彦, 猿渡洋
- Organizer
  日本音響学会 2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 三重対角型周波数共分散行列を用いた独立半正定値テンソル分析によるブラインド音源分離2020
- Author(s)
  近藤樹, 高宗典玄, 北村大地, 猿渡洋, 池下林太郎, 中谷智広
- Organizer
  日本音響学会 2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 同時対角化行列の事前分布を用いた高速多チャネル非負値行列因子分解によるブラインド音源分離2020
- Author(s)
  加茂佳吾, 久保優騎, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  日本音響学会 2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] SMASHコーパス：ゲーム動画の後付け実況解説音声収録に基づく自発発話音声コーパス2020
- Author(s)
  齋藤佑樹, 高道慎之介, 猿渡洋
- Organizer
  日本音響学会 2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 広帯域 DNN 音声合成のためのスペクトル包絡の GMM 近似2020
- Author(s)
  小口純矢, 高道慎之介, 猿渡洋, 嵯峨山茂樹
- Organizer
  日本音響学会 2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] ガウス過程に基づく場の計測のための推定・候補領域を独立に設定可能なセンサ配置法2020
- Author(s)
  西田智哉, 植野夏樹, 小山翔一, 猿渡洋
- Organizer
  電子情報通信学会技術研究報告
- Related Report
  2019 Annual Research Report
[Presentation] ニューラルネットワークとウェーブレット基底関数の同時学習に基づく多重解像度深層分析を用いた時間領域音源分離2020
- Author(s)
  小塚詩穂里, 中村友彦, 猿渡洋
- Organizer
  電子情報通信学会技術研究報告
- Related Report
  2019 Annual Research Report
[Presentation] 一般化Gauss 分布に基づく同時対角化制約付き多チャネルNMFを用いたブラインド音源分離2020
- Author(s)
  加茂佳吾, 久保優騎, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  電子情報通信学会技術研究報告
- Related Report
  2019 Annual Research Report
[Presentation] Efficient Full-Rank Spatial Covariance Estimation Using Independent Low-Rank Matrix Analysis for Blind Source Separation2019
- Author(s)
  Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari
- Organizer
  European Signal Processing Conference (EUSIPCO 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Column-wise update algorithm for independent deeply learned matrix analysis2019
- Author(s)
  Naoki Makishima, Norihiro Takamune,Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, and Kazunobu Kondo
- Organizer
  International Congress on Acoustics (ICA 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Evaluation of multichannel hearing aid system using rank-constrained spatial covariance matrix estimation2019
- Author(s)
  Masakazu Une, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, and Shoji Makino
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Robust Demixing Filter Update Algorithm Based on Microphone-wise Coordinate Descent for Independent Deeply Learned Matrix Analysis2019
- Author(s)
  Naoki Makishima, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, and Kazunobu Kondo
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Acceleration of rank-constrained spatial covariance matrix estimation for blind speech extraction2019
- Author(s)
  Yuki Kubo, Norihiro Takamune, Daichi Kitamura, and Hiroshi Saruwatari
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] ランク制約付き空間共分散モデル推定を用いた多チャネル補聴器システムの評価2019
- Author(s)
  宇根昌和, 久保優騎, 高宗典玄, 北村大地, 猿渡洋, 牧野昭二
- Organizer
  日本音響学会 2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 独立深層学習行列分析におけるマイクロホン毎の座標降下法に基づく分離行列更新2019
- Author(s)
  牧島直輝, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  日本音響学会 2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] ランク制約付き空間共分散モデル推定法の逆行列展開による高速化2019
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会 2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 多変量複素Student's t 分布に基づく独立半正定値テンソル分析によるブラインド音源分離2019
- Author(s)
  近藤樹, 高宗典玄, 北村大地, 猿渡洋, 池下林太郎, 中谷智広
- Organizer
  日本音響学会 2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] Haar 変換を導入した時間領域深層ニューラルネットに基づく音源分離2019
- Author(s)
  中村友彦, 猿渡洋
- Organizer
  電子情報通信学会技術研究報告
- Related Report
  2019 Annual Research Report
[Presentation] ブラインド音声抽出のための多変量複素一般化Gauss 分布に基づくランク制約付き空間共分散行列推定法及びその高速化2019
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  電子情報通信学会技術研究報告
- Related Report
  2019 Annual Research Report
[Patent(Industrial Property Rights)] 音響解析装置、音響解析方法及び音響解析プログラム2019
- Inventor(s)
  猿渡洋、久保優騎、高宗典玄、北村大地
- Industrial Property Rights Holder
  猿渡洋、久保優騎、高宗典玄、北村大地
- Industrial Property Rights Type
  特許
- Industrial Property Number
  2019-220584
- Filing Date
  2019
- Related Report
  2019 Annual Research Report

Acoustic Augmented Reality and Auditory Communication Ability Expansion Based on Small-Data Machine Learning Theory

Principal Investigator

Saruwatari Hiroshi 東京大学, 大学院情報理工学系研究科, 教授 (30324974)

¥44,850,000 (Direct Cost: ¥34,500,000、Indirect Cost: ¥10,350,000)

Report

Research Products

[Journal Article] Noise Suppression Using Beamformer and Transfer-Function-Gain Nonnegative Matrix Factorization with Distributed Stereo Microphones2023

Author(s)

Journal Title

DOI

ISSN

Year and Date

Related Report

[Journal Article] Deficient-basis-complementary rank-constrained spatial covariance matrix estimation based on multivariate generalized Gaussian distribution for blind speech extraction2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Region-to-Region Kernel Interpolation of Acoustic Transfer Functions Constrained by Physical Properties2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] DNN-Based Low-Musical-Noise Single-Channel Speech Enhancement Based on Higher-Order-Moments Matching2021

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Noise Robust Acoustic Anomaly Detection System with Nonnegative Matrix Factorization Based on Generalized Gaussian Distribution2021

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Joint-diagonalizability-constrained multichannel nonnegative matrix factorization based on time-variant multivariate complex sub-Gaussian distribution2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Time-Domain Audio Source Separation With Neural Networks Based on Multiresolution Analysis2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Time-Frequency-Bin-Wise Linear Combination of Beamformers for Distortionless Signal Enhancement2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Perceptual-similarity-aware deep speaker representation learning for multi-speaker generative modeling2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Multichannel Blind Source Separation Based on Evanescent-Region-Aware Non-Negative Tensor Factorization in Spherical Harmonic Domain2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] DNN-Based Full-Band Speech Synthesis Using GMM Approximation of Spectral Envelope2020

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Perception analysis of inter-singer similarity in Japanese song2020

Author(s)

Journal Title

DOI

NAID

Related Report

[Journal Article] Blind Speech Extraction Based on Rank-Constrained Spatial Covariance Matrix Estimation With Multivariate Generalized Gaussian Distribution2020

Author(s)