2011 Fiscal Year Annual Research Report

共演者音楽ロボット実現のための音響信号に基づく音楽インタラクション手法の開発

Research Project

Project/Area Number	11J06577
Research Institution	Kyoto University
Principal Investigator	大塚琢馬京都大学, 情報学研究科, 特別研究員(DC1)
Keywords	音環境理解 / ロボット聴覚 / 楽譜追従 / 音源定位・分離 / ベイズトピックモデル
Research Abstract	本研究では,人と共演出来る音楽ロボットへの応用を目的とした音環境理解技術を,機械学習の理論の応用展開を目指す.平成23年度では,次の2つの方面で研究が進捗した。(1)音色の多様性に柔軟に対応する楽譜追従手法の開発,(2)ベイズ確率統計に基づく統一的マイクロホンアレイ処理手法の開発. (1)音色の多様性に柔軟に対応する楽譜追従手法の開発:従来の楽譜追従法は,楽譜で指定された音高に対応する楽器音のテンプレートを生成し,そのテンプレートと観測音響信号の相関を計算するという枠組みであった.テンプレートの生成には,和音の各構成音は等しいパワーで生成される,高調波の振幅が指数的に減衰する音色であるという仮定が置かれていた.これらの仮定は実楽曲における追従精度をしばしば劣化させるという課題があった. この課題を対処するため,音階楽器による演奏音に含まれる調波構造の生成モデルである潜在的調波配分法b)を楽譜追従法の音響信号観測モデルとして統合した.本手法の有効性は,実演奏楽曲20曲を用いて確認された.本研究の成果は査読付き国際会議ISMIR 2011に採録・発表された. (2)ベイズ確率統計に基づくマイクロホンアレイ処理:音環境理解システムにおける重要な要素技術として,マイクロホンアレイを用いた複数音源の定位・分離が挙げられる.音源定位・分離の同時推定問題に対する従来の手法は,定位・分離のいずれかの結果を用いてもう片方の推定を行うというcascaded frameworkであった.この場合,前処理にあたる処理に失敗すると,その誤差がそのまま次の処理に悪影響を及ぼすという課題があった.本研究では,両問題を同時に解くベイズ確率モデルを構築した.残響環境における混合音を用いた評価実験では,本手法による同時推定が,従来の各推定を別個に行う手法と比較し安定した性能を示すことを確認した.特に,分離精度においては,最新の音源分離手法d)を上回る性能を示した.本研究の成果は7月に開催される人工知能分野における査読付きトップカンファレンスであるAAAI 2012にて口頭発表される予定である.
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 今年度までに,共演者音楽ロボット構築に必要な次の2つの要素技術の開発が進んだ.(1)ロボットが聞いた音楽に合わせるために,楽曲の位置を推定する楽譜追従技術,(2)ロボットが特定の音を聞き分けるために必要となる,音源分離技術.いずれの成果も査読付き国際会議での発表を行なっており,研究はおおむね順調に進展しているといえる.
Strategy for Future Research Activity	今後は,本年度に開発した要素技術の音楽ロボットへの実装の他,ロボットが音を聞き分ける際の事前知識をより小さくし,環境の多様性に対して頑健に動作する聴覚機能の開発を行う.特に,従来の音源定位・分離技術は音源数が事前に与えられているという強い仮定のもとで処理が実現されていた.今後の本研究では,音源数に関する事前知識のない手法の開発に注力する.

Research Products
(9 results)

All 2012 2011

All Journal Article (2 results) (of which Peer Reviewed: 2 results) Presentation (6 results) Patent(Industrial Property Rights) (1 results)

[Journal Article] A multi-modal tempo and beat tracking system based on audio-visual information from live guitar performances2012
- Author(s)
  T.Itohara, T.Otsuka, et al
- Journal Title
  
  EURASIP Journal on Audio, Speech, and Music Processing
  
  Volume: 2,012
- DOI
  10.1186/1687-4722-2012-6
- Peer Reviewed
[Journal Article] 音楽共演ロボット:開始・終了キューの画像認識による人間のフルート奏者との実時間同期2011
- Author(s)
  リムアンジェリカ, 水本武志, 大塚琢馬, ら
- Journal Title
  
  情報処理学会論文誌
  
  Volume: Vol.52, No.12 Pages: 3599-3610
- Peer Reviewed
[Presentation] 音源定位手法MUSICのベイズ拡張2011
- Author(s)
  大塚琢馬, ら
- Organizer
  第34回AIチャレンジ研究会
- Place of Presentation
  慶應義塾大学(東京都)
- Year and Date
  2011-12-15
[Presentation] An interactive musical ensemble with the NAO Thereminist2011
- Author(s)
  Angelica Lim, 水本武志, 大塚琢馬, ら
- Organizer
  第34回AIチャレンジ研究会
- Place of Presentation
  慶應義塾大学(東京都)
- Year and Date
  2011-12-15
[Presentation] Incremental Bayesian Audio-to-Score Alignment with Flexible Harmonic Structure Models2011
- Author(s)
  T.Otsuka, et al
- Organizer
  12th International Society for Musical Information Retrieval Conference
- Place of Presentation
  マイアミ(アメリカ)
- Year and Date
  2011-10-26
[Presentation] Particle-filter Based Audio-visual Beat-tracking for Music Robot Ensemble with Human Guitarist2011
- Author(s)
  T.Itohara, T.Mizumoto, T.Otsuka, et al
- Organizer
  IEEE/RSJ International Conference on Intelligent Robots and Systems
- Place of Presentation
  サンフランシスコ(アメリカ)
- Year and Date
  2011-09-26
[Presentation] MUSIC法を用いた音源定位のベイズ拡張2011
- Author(s)
  大塚琢馬, ら
- Organizer
  日本ロボット学会第29回学術講演会
- Place of Presentation
  芝浦工業大学(東京都)
- Year and Date
  2011-09-09
[Presentation] Bayesian Extension of MUSIC for Sound Source Iocalization and Tracking2011
- Author(s)
  T.Otsuka, et al
- Organizer
  International Conference on Spoken Language Processing (INTERSPEECH)
- Place of Presentation
  フィレンツェ(イタリア)
- Year and Date
  2011-08-30
[Patent(Industrial Property Rights)] ロボット、ロボット制御方法およびプログラム2011
- Inventor(s)
  中臺一博, 大塚琢馬, ら
- Industrial Property Rights Holder
  本田技研工業(株)
- Patent Publication Number
  特開2011-180590号
- Filing Date
  2011-09-15

2011 Fiscal Year Annual Research Report

共演者音楽ロボット実現のための音響信号に基づく音楽インタラクション手法の開発

Principal Investigator

大塚 琢馬 京都大学, 情報学研究科, 特別研究員(DC1)

Current Status of Research Progress

Reason

Research Products

[Journal Article] A multi-modal tempo and beat tracking system based on audio-visual information from live guitar performances2012

Author(s)

Journal Title

DOI

[Journal Article] 音楽共演ロボット:開始・終了キューの画像認識による人間のフルート奏者との実時間同期2011

Author(s)

Journal Title

[Presentation] 音源定位手法MUSICのベイズ拡張2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] An interactive musical ensemble with the NAO Thereminist2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Incremental Bayesian Audio-to-Score Alignment with Flexible Harmonic Structure Models2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Particle-filter Based Audio-visual Beat-tracking for Music Robot Ensemble with Human Guitarist2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] MUSIC法を用いた音源定位のベイズ拡張2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Bayesian Extension of MUSIC for Sound Source Iocalization and Tracking2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Patent(Industrial Property Rights)] ロボット、ロボット制御方法およびプログラム2011

Inventor(s)

Industrial Property Rights Holder

Patent Publication Number

Filing Date

大塚琢馬京都大学, 情報学研究科, 特別研究員(DC1)