Extended theories of audio source separation based on statistical independence and various mathematical structures

Research Project

Project/Area Number	17H06572
Research Category	Grant-in-Aid for Research Activity Start-up
Allocation Type	Single-year Grants
Research Field	Perceptual information processing
Research Institution	Kagawa National College of Technology (2018) The University of Tokyo (2017)
Principal Investigator	Kitamura Daichi 香川高等専門学校, 電気情報工学科, 助教 (40804745)
Project Period (FY)	2017-08-25 – 2019-03-31
Project Status	Completed (Fiscal Year 2018)
Budget Amount *help	¥2,990,000 (Direct Cost: ¥2,300,000、Indirect Cost: ¥690,000) Fiscal Year 2018: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000) Fiscal Year 2017: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Keywords	音響信号処理 / 統計的信号処理 / 音源分離 / 深層学習 / 最適化
Outline of Final Research Achievements	This research project aims to improve the performance of conventional audio source separation techniques by extending their theories from mathematical and practical aspects. Audio source separation is a technique for extracting specific audio sources from the observed mixture signal. This technique can be applied for many devices and systems including hearing-aid system, smart speaker, speech recognition, and so on. In this project, the generalization of probabilistic model assumed in "independent low-rank matrix analysis (ILRMA)" (state-of-the-art audio source separation method) was carried out, and its validity was confirmed by practical experiments. Also, various types of mathematical model were introduced into ILRMA to enhance its separation quality. Furthermore, data-driven approach was newly employed to ILRMA, which was named as independent deeply learned matrix analysis. The efficacy of the proposed methods was confirmed.
Academic Significance and Societal Importance of the Research Achievements	音源分離技術の精度が向上すれば，補聴器等の人支援デバイスへと直接的に応用できる他，音楽の新しい楽しみ方やVR技術への援用など，これまでの芸術・文化の振興につながることが期待されている．また，近年は音声認識やスマートスピーカ等が身近な技術となったが，これらのデバイスが雑音の多い環境下でも頑健に動作するためにも，音源分離技術の応用が必須となる．このように，音源分離技術はあらゆる音響機器のフロントエンドとして必要な最も基本的な信号処理である．また，「混合信号から潜在的な因子を推定する」という観点では，音響信号のみならず，画像や電波などあらゆるメディアへの活用も期待される．

Report

(3 results)

2018 Annual Research Report Final Research Report ( PDF )
2017 Annual Research Report

Research Products
(43 results)

All 2019 2018 2017 Other

All Journal Article (3 results) (of which Peer Reviewed: 3 results, Open Access: 3 results) Presentation (36 results) (of which Int'l Joint Research: 13 results, Invited: 2 results) Remarks (4 results)

[Journal Article] Independent Low-Rank Matrix Analysis Based on Generalized Kullback-Leibler Divergence2019
- Author(s)
  MOGAMI Shinichi、MITSUI Yoshiki、TAKAMUNE Norihiro、KITAMURA Daichi、SARUWATARI Hiroshi、TAKAHASHI Yu、KONDO Kazunobu、NAKAJIMA Hiroaki、KAMEOKA Hirokazu
- Journal Title
  
  IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
  
  Volume: E102.A Issue: 2 Pages: 458-463
- DOI
  10.1587/transfun.E102.A.458
- NAID
  130007585927
- ISSN
  0916-8508, 1745-1337
- Year and Date
  2019-02-01
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Bilevel Optimization Using Stationary Point of Lower-level Objective Function for Discriminative Basis Learning in Nonnegative Matrix Factorization2019
- Author(s)
  Nakajima Hiroaki、Kitamura Daichi、Takamune Norihiro、Saruwatari Hiroshi、Ono Nobutaka
- Journal Title
  
  IEEE Signal Processing Letters
  
  Volume: 印刷中 Issue: 6 Pages: 818-822
- DOI
  10.1109/lsp.2019.2909079
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Generalized independent low-rank matrix analysis using heavy-tailed distributions for blind source separation2018
- Author(s)
  Daichi Kitamura, Shinichi Mogami, Yoshiki Mitsui, Norihiro Takamune, Hiroshi Saruwatari, Nobutaka Ono, Yu Takahashi, and Kazunobu Kondo
- Journal Title
  
  EURASIP Journal on Advances in Signal Processing
  
  Volume: - Issue: 1 Pages: 1-28
- DOI
  10.1186/s13634-018-0549-5
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Presentation] Generalized-Gaussian-distribution-based independent deeply learned matrix analysis for multichannel audio source separation2019
- Author(s)
  Naoki Makishima, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo, and Hiroaki Nakajima
- Organizer
  Proceedings of International Congress and Exhibition on Noise Control Engineering (INTERNOISE 2019)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Column-wise update algorithm for independent deeply learned matrix analysis2019
- Author(s)
  Naoki Makishima, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, and Kazunobu Kondo,
- Organizer
  Proceedings of International Congress on Acoustics (ICA 2019)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Time-frequency-masking-based determined BSS with application to sparse IVA2019
- Author(s)
  Kohei Yatabe and Daichi Kitamura
- Organizer
  Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] 収束保証型独立半正定値テンソル分析に基づくブラインド音源分離2019
- Author(s)
  福重敢太, 高宗典玄, 北村大地, 猿渡洋, 池下林太郎, 中谷智広
- Organizer
  IEICE Technical Report, EA2018-127
- Related Report
  2018 Annual Research Report
[Presentation] ブラインド音源分離における多変量複素Student's t分布に基づくランク制約付き空間共分散モデルの推定2019
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  IEICE Technical Report, EA2018-128
- Related Report
  2018 Annual Research Report
[Presentation] 時変複素一般化ガウス分布に基づく独立深層学習行列分析2019
- Author(s)
  牧島直輝, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, 中嶋広明
- Organizer
  日本音響学会 2019年春季研究発表会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] 乗算型更新式に基づくランク制約付き空間共分散モデルの推定2019
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会 2019年春季研究発表会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] 独立低ランク行列分析におけるmajorization-equalizationアルゴリズムを用いた空間パラメータの高速更新2019
- Author(s)
  最上伸一, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, 中嶋広明
- Organizer
  日本音響学会 2019年春季研究発表会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] 白色化の影響を考慮したスパース独立ベクトル分析2019
- Author(s)
  矢田部浩平, 北村大地
- Organizer
  日本音響学会 2019年春季研究発表会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] 教師あり及び半教師あり条件下における独立深層学習行列分析の実験的評価2019
- Author(s)
  牧島直輝, 最上伸一, 高宗典玄, 高道慎之介, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, 中嶋広明
- Organizer
  日本音響学会 2019年春季研究発表会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] Independent low-rank matrix analysis based on time-variant sub-Gaussian source model2018
- Author(s)
  Shinichi Mogami, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo, Hiroaki Nakajima, and Nobutaka Ono
- Organizer
  Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Generative approach using the noise generation models for DNN-based speech synthesis trained from noisy speech2018
- Author(s)
  Masakazu Une, Yuki Saito, Shinnosuke Takamichi, Daichi Kitamura, Ryoichi Miyazaki, and Hiroshi Saruwatari
- Organizer
  Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Phase reconstruction from amplitude spectrograms based on von-Mises-distribution deep neural network2018
- Author(s)
  Shinnosuke Takamichi, Yuki Saito, Norihiro Takamune, Daichi Kitamura, and Hiroshi Saruwatari
- Organizer
  Proceedings of International Workshop on Acoustic Signal Enhancement (IWAENC 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Independent deeply learned matrix analysis for multichannel audio source separation2018
- Author(s)
  Shinichi Mogami, Hayato Sumino, Daichi Kitamura, Norihiro Takamune, Shinnosuke Takamichi, Hiroshi Saruwatari, and Nobutaka Ono
- Organizer
  Proceedings of European Signal Processing Conference (EUSIPCO 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Determined blind source separation via proximal splitting algorithm2018
- Author(s)
  Kohei Yatabe and Daichi Kitamura
- Organizer
  Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Vectorwise coordinate descent algorithm for spatially regularized independent low-rank matrix analysis2018
- Author(s)
  Yoshiki Mitsui, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, and Kazunobu Kondo
- Organizer
  Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] ヘビーテイル生成モデルに基づく独立深層学習行列分析による多チャネル音源分離2018
- Author(s)
  牧島直輝, 最上伸一, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, 中嶋広明
- Organizer
  Proceedings of 33th Signal Processing Symposium (SIP Symposium)
- Related Report
  2018 Annual Research Report
[Presentation] 方向統計DNNに基づく振幅スペクトログラムからの位相復元2018
- Author(s)
  高道慎之介, 齋藤佑樹, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会 2018年秋季研究発表会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] 一般化反復射影法に基づく時変劣ガウス独立低ランク行列分析2018
- Author(s)
  最上伸一, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, 中嶋広明, 小野順貴
- Organizer
  日本音響学会 2018年秋季研究発表会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] 独立低ランク行列分析を用いたフルランク空間共分散モデルに基づくブラインド音源分離2018
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会 2018年秋季研究発表会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] 半教師あり独立深層学習行列分析におけるデータ拡張に基づく音源モデルの適応2018
- Author(s)
  牧島直輝, 高宗典玄, 高道慎之介, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, 中嶋広明
- Organizer
  日本音響学会 2018年秋季研究発表会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] 一般の時間周波数マスキングに基づく独立ベクトル分析2018
- Author(s)
  矢田部浩平, 北村大地
- Organizer
  日本音響学会 2018年秋季研究発表会講演論文集
- Related Report
  2018 Annual Research Report
[Presentation] 雑音下異常検知における前処理としてのNMF音源抽出手法の検討2018
- Author(s)
  相場亮人, 吉田実, 後藤理, 北村大地, 高道慎之介, 猿渡洋
- Organizer
  Proceedings of 119th IPSJ Special Interest Group on Music and Computer (IPSJ-SIGMUS)
- Related Report
  2018 Annual Research Report
[Presentation] von Mises分布DNNに基づく振幅スペクトログラムからの位相復元2018
- Author(s)
  高道慎之介, 齋藤佑樹, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  Proceedings of 119th IPSJ Special Interest Group on Music and Computer (IPSJ-SIGMUS)
- Related Report
  2018 Annual Research Report
[Presentation] Determined blind source separation via proximal splitting algorithm2018
- Author(s)
  Kohei Yatabe and Daichi Kitamura
- Organizer
  IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Vectorwise coordinate descent algorithm for spatially regularized independent low-rank matrix analysis2018
- Author(s)
  Yoshiki Mitsui, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, and Kazunobu Kondo
- Organizer
  IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Ego-noise reduction for hose-shaped rescue robot using basis-shared semi-supervised independent low-rank matrix analysis2018
- Author(s)
  Moe Takakusaki, Daichi Kitamura, Nobutaka Ono, Shoji Makino, Takeshi Yamada, and Hiroshi Saruwatari
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP 2018)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] 独立深層学習行列分析に基づく多チャネル音源分離の実験的評価2018
- Author(s)
  北村大地, 角野隼斗, 高宗典玄, 高道慎之介, 猿渡洋, 猿渡洋, 小野順貴
- Organizer
  電子情報通信学会応用音響研究会（EA）2018年3月
- Related Report
  2017 Annual Research Report
[Presentation] ヘビーテイルな分布に基づく非負値行列因子分解を用いたスパース雑音除去2018
- Author(s)
  北村大地, 高宗典玄, 最上伸一, 三井祥幹, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  日本音響学会 2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] 独立深層学習行列分析に基づく多チャネル音源分離2018
- Author(s)
  角野隼斗, 北村大地, 高宗典玄, 高道慎之介, 猿渡洋, 小野順貴
- Organizer
  日本音響学会 2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] 空間モデル正則化を用いた独立低ランク行列分析に基づくブラインド音源分離2018
- Author(s)
  三井祥幹, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, 中嶋広明
- Organizer
  日本音響学会 2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] Iダイバージェンスに基づく独立低ランク行列分析の実験的評価2018
- Author(s)
  最上伸一, 三井祥幹, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, 中嶋広明, 亀岡弘和
- Organizer
  日本音響学会 2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] Independent low-rank matrix analysis based on parametric majorization-equalization algorithm2017
- Author(s)
  Yoshiki Mitsui, Daichi Kitamura, Norihiro Takamune, Hiroshi Saruwatari, Yu Takahashi, and Kazunobu Kondo
- Organizer
  IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP 2017)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] 空間事前情報を用いた独立低ランク行列分析2017
- Author(s)
  三井祥幹, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  第32回信号処理シンポジウム
- Related Report
  2017 Annual Research Report
[Presentation] Iダイバージェンスを用いた独立低ランク行列分析2017
- Author(s)
  最上伸一, 三井祥幹, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸
- Organizer
  第32回信号処理シンポジウム
- Related Report
  2017 Annual Research Report
[Presentation] 独立低ランク行列分析に基づくブラインド音源分離2017
- Author(s)
  北村大地, 小野順貴, 澤田宏, 亀岡弘和, 猿渡洋
- Organizer
  電子情報通信学会応用音響研究会（EA）2017年10月
- Related Report
  2017 Annual Research Report
- Invited
[Remarks] 独立深層学習行列分析に基づく多チャネル音源分離
- URL
  http://d-kitamura.net/demo_idlma.htm
- Related Report
  2018 Annual Research Report 2017 Annual Research Report
[Remarks] Audio Source Separation Based on IDLMA
- URL
  http://d-kitamura.net/en/demo_idlma_en.htm
- Related Report
  2018 Annual Research Report
[Remarks] Audio Source Separation Based on IDLMA
- URL
  http://d-kitamura.net/en/demo_idlma_em.htm
- Related Report
  2017 Annual Research Report
[Remarks] ILRMA
- URL
  https://github.com/d-kitamura/ILRMA
- Related Report
  2017 Annual Research Report

Extended theories of audio source separation based on statistical independence and various mathematical structures

Principal Investigator

Kitamura Daichi 香川高等専門学校, 電気情報工学科, 助教 (40804745)

¥2,990,000 (Direct Cost: ¥2,300,000、Indirect Cost: ¥690,000)

Report

Research Products

[Journal Article] Independent Low-Rank Matrix Analysis Based on Generalized Kullback-Leibler Divergence2019

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Bilevel Optimization Using Stationary Point of Lower-level Objective Function for Discriminative Basis Learning in Nonnegative Matrix Factorization2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Generalized independent low-rank matrix analysis using heavy-tailed distributions for blind source separation2018

Author(s)

Journal Title

DOI

Related Report

[Presentation] Generalized-Gaussian-distribution-based independent deeply learned matrix analysis for multichannel audio source separation2019

Author(s)

Organizer

Related Report

[Presentation] Column-wise update algorithm for independent deeply learned matrix analysis2019

Author(s)

Organizer

Related Report

[Presentation] Time-frequency-masking-based determined BSS with application to sparse IVA2019

Author(s)

Organizer

Related Report

[Presentation] 収束保証型独立半正定値テンソル分析に基づくブラインド音源分離2019

Author(s)

Organizer

Related Report

[Presentation] ブラインド音源分離における多変量複素Student's t分布に基づくランク制約付き空間共分散モデルの推定2019

Author(s)

Organizer

Related Report

[Presentation] 時変複素一般化ガウス分布に基づく独立深層学習行列分析2019

Author(s)

Organizer

Related Report

[Presentation] 乗算型更新式に基づくランク制約付き空間共分散モデルの推定2019

Author(s)

Organizer

Related Report

[Presentation] 独立低ランク行列分析におけるmajorization-equalizationアルゴリズムを用いた空間パラメータの高速更新2019

Author(s)

Organizer

Related Report

[Presentation] 白色化の影響を考慮したスパース独立ベクトル分析2019

Author(s)

Organizer

Related Report

[Presentation] 教師あり及び半教師あり条件下における独立深層学習行列分析の実験的評価2019

Author(s)

Organizer

Related Report

[Presentation] Independent low-rank matrix analysis based on time-variant sub-Gaussian source model2018

Author(s)

Organizer

Related Report

[Presentation] Generative approach using the noise generation models for DNN-based speech synthesis trained from noisy speech2018

Author(s)

Organizer

Related Report

[Presentation] Phase reconstruction from amplitude spectrograms based on von-Mises-distribution deep neural network2018

Author(s)

Organizer

Related Report

[Presentation] Independent deeply learned matrix analysis for multichannel audio source separation2018

Author(s)

Organizer

Related Report