音と映像の複合分析による動画コンテンツ理解の研究

Research Project

Project/Area Number	13J05483
Research Category	Grant-in-Aid for JSPS Fellows
Allocation Type	Single-year Grants
Section	国内
Research Field	Media informatics/Database
Research Institution	Waseda University
Principal Investigator	平井辰典早稲田大学, 理工学術院, 特別研究員(PD)
Project Period (FY)	2013-04-01 – 2016-03-31
Project Status	Completed (Fiscal Year 2015)
Budget Amount *help	¥3,300,000 (Direct Cost: ¥3,300,000) Fiscal Year 2015: ¥1,100,000 (Direct Cost: ¥1,100,000) Fiscal Year 2014: ¥1,100,000 (Direct Cost: ¥1,100,000) Fiscal Year 2013: ¥1,100,000 (Direct Cost: ¥1,100,000)
Keywords	Audio-visual interaction / Audio-visual modeling / Video indexing / 音楽動画コンテンツ / 鑑賞支援 / Audio-Visual Integration
Outline of Annual Research Achievements	平成27年度には、音楽動画の内容理解技術に関連した研究として、主に以下の二点に取り組んだ。①従来の特徴表現を用いない音楽のモデリング手法とそれを応用したアプリケーションの開発。②動画の音声と映像の内容に依存した動画伸縮手法を提案。具体的には、従来の特徴表現を用いないモデリング手法では、トピックモデルという自然言語処理分野において活発に研究されている機械学習のモデリング手法を用いて、音楽集合から、意味を持った特徴（トピック）を学習し、その特徴を用いて音楽の内容を記述する新たな手法を提案した（①）。特にユニークな点として、音楽における記号と自然言語における単語（Word）との間を繋ぐ特徴の表現手法であり、この手法は音楽動画における音楽の特徴表現として利用可能なものである。また、研究計画の立案当初は、大量の動画の分析に際して、スケーリング手法を考慮しておらず、マシンパワーに頼って時間をかけるアプローチを考えていたが、本年度の成果の一つとして、動画の音と映像のエッセンスを保持したままで動画を圧縮する手法を提案した（②）。これにより、大量の動画の分析を元のまま分析するよりも高速に行うことを可能とした。それぞれの成果は国内学会における発表の後に国際会議での発表も行った。具田的には、①の成果については国際会議ACE2015及びMMM2016にて、②の成果についてはMMM2016にて発表した。上記の研究成果に加えて、本年度は、本研究課題の最終年度としてこれまでの研究成果の中で、未発表であった研究成果を積極的に対外発表していった。特に、これまでに機会が少なかった国際会議での発表を増やし、実際に５本の主著国際会議論文を発表することができた。また、本年度は申請者の博士論文の審査があり、本研究課題で取り組んだ内容のほとんどがその論文の中核として重要な役割を果たした。
Research Progress Status	27年度が最終年度であるため、記入しない。
Strategy for Future Research Activity	27年度が最終年度であるため、記入しない。

Report

(3 results)

Research Products

(21 results)

All 2016 2015 2014 2013 Other

All Journal Article (1 results) (of which Peer Reviewed: 1 results) Presentation (19 results) (of which Int'l Joint Research: 6 results) Remarks (1 results)

[Journal Article] ラリーシーンに着目した映像自動要約によるラケットスポーツ動画鑑賞システム2015
- Author(s)
  河村俊哉、福里司、平井辰典、森島繁生
- Journal Title
  
  情報処理学会論文誌
  
  Volume: 56 Pages: 1028-1038
- NAID
  110009884097
- Related Report
  2014 Annual Research Report
- Peer Reviewed
[Presentation] RSViewer: An Efficient Video Viewing System for Racquet Sports focusing on Rally Scenes2016
- Author(s)
  Shunya Kawamura, Tsukasa Fukusato, Tatsunori Hirai
- Organizer
  The 7th International Conference on Information Visualization Theory and Application
- Place of Presentation
  Rome, Italy
- Year and Date
  2016-02-27
- Related Report
  2015 Annual Research Report
- Int'l Joint Research
[Presentation] MusicMixer: Automatic DJ System Considering Beat and Latent Topic Similarity2016
- Author(s)
  Tatsunori Hirai, Hironori Doi, Shigeo Morishima
- Organizer
  The 22nd International Conference on MultiMedia Modeling (MMM2016)
- Place of Presentation
  Kovens Conference Center, Florida International University, FL, USA
- Year and Date
  2016-01-04
- Related Report
  2015 Annual Research Report
- Int'l Joint Research
[Presentation] Frame-Wise Continuity-Based Video Summarization and Stretching2016
- Author(s)
  Tatsunori Hirai, Shigeo Morishima
- Organizer
  The 22nd International Conference on MultiMedia Modeling (MMM2016)
- Place of Presentation
  Kovens Conference Center, Florida International University, FL, USA
- Year and Date
  2016-01-04
- Related Report
  2015 Annual Research Report
- Int'l Joint Research
[Presentation] ラリーシーンに着目したラケットスポーツ動画鑑賞システム2015
- Author(s)
  河村俊哉，福里司，平井辰典，森島繁生
- Organizer
  画像電子学会ビジュアルコンピューティングワークショップ2015
- Place of Presentation
  石川県金沢市
- Year and Date
  2015-11-27
- Related Report
  2015 Annual Research Report
[Presentation] MusicMixer: Computer-Aided DJ System based on an Automatic Song Mixing2015
- Author(s)
  Tatsunori Hirai, Hironori Doi, Shigeo Morishima
- Organizer
  The 12th Advances in Computer Entertainment Technology Conference (ACE 2015)
- Place of Presentation
  Iskandar, Malaysia
- Year and Date
  2015-11-16
- Related Report
  2015 Annual Research Report
- Int'l Joint Research
[Presentation] 楽曲のビート類似度及び潜在トピックの類似度に基づくDJプレイの自動化2015
- Author(s)
  平井辰典，土井啓成，森島繁生
- Organizer
  情報処理学会第108回音楽情報科学研究会
- Place of Presentation
  名古屋大学東山キャンパス（愛知県名古屋市）
- Year and Date
  2015-08-31
- Related Report
  2015 Annual Research Report
[Presentation] デモンストレーション：音楽情報処理の研究紹介XIV2015
- Author(s)
  阪上大地，竹川佳成，浜中雅俊，大野涼平，菅野沙也，清川隼矢，栗原拓也，黒田元気，小池宏幸，鈴木潤一，土橋彩香，長村佳祐，橋田光代，平井辰典
- Organizer
  情報処理学会第108回音楽情報科学研究会
- Place of Presentation
  名古屋大学東山キャンパス（愛知県名古屋市）
- Year and Date
  2015-08-31
- Related Report
  2015 Annual Research Report
[Presentation] Automatic Singing Voice to Music Video Generation via Mashup of Singing Video Clips2015
- Author(s)
  Tatsunori Hirai, Yukara Ikemiya, Kazuyoshi Yoshii, Tomoyasu Nakano, Masataka Goto, Shigeo Morishima
- Organizer
  The 12th Sound and Music Computing Conference (SMC 2015)
- Place of Presentation
  Maynooth University, Ireland
- Year and Date
  2015-07-29
- Related Report
  2015 Annual Research Report
- Int'l Joint Research
[Presentation] MusicMean: Fusion-Based Music Generation2015
- Author(s)
  Tatsunori Hirai, Shoto Sasaki, Shigeo Morishima
- Organizer
  The 12th Sound and Music Computing Conference (SMC 2015)
- Place of Presentation
  Maynooth University, Ireland
- Year and Date
  2015-07-29
- Related Report
  2015 Annual Research Report
- Int'l Joint Research
[Presentation] 音声と映像の変化に注目したフレーム間引きによる動画要約手法2015
- Author(s)
  平井辰典，森島繁生
- Organizer
  音学シンポジウム2015
- Place of Presentation
  電気通信大学（東京都調布市）
- Year and Date
  2015-05-23
- Related Report
  2015 Annual Research Report
[Presentation] VRMixer: 動画セグメンテーションによる動画コンテンツと現実世界の融合2015
- Author(s)
  平井辰典，中村聡史，湯村翼，森島繁生
- Organizer
  情報処理学会シンポジウム　インタラクション2015
- Place of Presentation
  日本科学未来館/東京国際交流館（東京都江東区）
- Year and Date
  2015-03-05 – 2015-03-07
- Related Report
  2014 Annual Research Report
[Presentation] Affective Music Recommendation System Based on the Mood of Input Video2015
- Author(s)
  Shoto Sasaki，Tatsunori Hirai，Hayato Ohya，Shigeo Morishima
- Organizer
  The 21st International Conference on Multimedia Modeling (MMM 2015)
- Place of Presentation
  University of Technology Sydney, Sydney, Australia
- Year and Date
  2015-01-05 – 2015-01-07
- Related Report
  2014 Annual Research Report
[Presentation] VRMixer: Mixing Video and Real World with Video Segmentation2014
- Author(s)
  Tatsunori Hirai，Satoshi Nakamura，Tsubasa Yumura，Shigeo Morishima
- Organizer
  11th Advances in Computer Entertainment Technology Conference (ACE 2014)
- Place of Presentation
  Funchal, Madeira
- Year and Date
  2014-11-11 – 2014-11-14
- Related Report
  2014 Annual Research Report
[Presentation] 歌手映像と歌声の解析に基づく音楽動画中の歌唱シーン検出2014
- Author(s)
  平井辰典，中野倫靖，後藤真孝，森島繁生
- Organizer
  OngaCRESTシンポジウム2014
- Place of Presentation
  明治大学中野キャンパス（東京都中野区）
- Year and Date
  2014-08-23
- Related Report
  2014 Annual Research Report
[Presentation] VRMixer: 動画と現実の融合による新たなコンテンツの生成2014
- Author(s)
  平井辰典，中村聡史，森島繁生，湯村翼
- Organizer
  OngaCRESTシンポジウム2014
- Place of Presentation
  明治大学中野キャンパス（東京都中野区）
- Year and Date
  2014-08-23
- Related Report
  2014 Annual Research Report
[Presentation] Efficient Video Viewing System for Racquet Sports with Automatic Summarization Focusing on Rally Scenes2014
- Author(s)
  Shunya Kawamura，Tsukasa Fukusato，Tatsunori Hirai，Shigeo Morishima
- Organizer
  ACM SIGGRAPH 2014
- Place of Presentation
  Vancouver Convenion Center, Vancouver, Canada
- Year and Date
  2014-08-10 – 2014-08-14
- Related Report
  2014 Annual Research Report
[Presentation] 歌手映像と歌声の解析に基づく音楽動画中の歌唱シーン検出手法の検討2014
- Author(s)
  平井辰典，中野倫靖，後藤真孝，森島繁生
- Organizer
  音学シンポジウム2014
- Place of Presentation
  日本大学文理学部百周年記念館（東京都世田谷区）
- Year and Date
  2014-05-24 – 2014-05-25
- Related Report
  2014 Annual Research Report
[Presentation] ラケットスポーツ動画の構造解析に基づく映像要約と鑑賞インタフェースの提案2014
- Author(s)
  河村俊哉、福里司、平井辰典、森島繁生
- Organizer
  情報処理学会第76回全国大会
- Place of Presentation
  東京電機大学東京千住キャンパス
- Year and Date
  2014-03-11
- Related Report
  2013 Annual Research Report
[Presentation] ラケットスポーツ動画の構造解析による映像要約手法の提案2013
- Author(s)
  河村俊哉、福里司、平井辰典、森島繁生
- Organizer
  情報処理学会第153回GCAD第189回CVIM合同研究会
- Place of Presentation
  九州大学西新プラザ
- Year and Date
  2013-11-29
- Related Report
  2013 Annual Research Report
[Remarks] 研究プロジェクト紹介ページ
- URL
  https://www.komazawa-u.ac.jp/~thirai/member/thirai/projects-j.html
- Related Report
  2015 Annual Research Report

音と映像の複合分析による動画コンテンツ理解の研究

Principal Investigator

平井 辰典 早稲田大学, 理工学術院, 特別研究員(PD)

¥3,300,000 (Direct Cost: ¥3,300,000)

Report

Research Products

[Journal Article] ラリーシーンに着目した映像自動要約によるラケットスポーツ動画鑑賞システム2015

Author(s)

Journal Title

NAID

Related Report

[Presentation] RSViewer: An Efficient Video Viewing System for Racquet Sports focusing on Rally Scenes2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] MusicMixer: Automatic DJ System Considering Beat and Latent Topic Similarity2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Frame-Wise Continuity-Based Video Summarization and Stretching2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] ラリーシーンに着目したラケットスポーツ動画鑑賞システム2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] MusicMixer: Computer-Aided DJ System based on an Automatic Song Mixing2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 楽曲のビート類似度及び潜在トピックの類似度に基づくDJプレイの自動化2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] デモンストレーション：音楽情報処理の研究紹介XIV2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Automatic Singing Voice to Music Video Generation via Mashup of Singing Video Clips2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] MusicMean: Fusion-Based Music Generation2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 音声と映像の変化に注目したフレーム間引きによる動画要約手法2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] VRMixer: 動画セグメンテーションによる動画コンテンツと現実世界の融合2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Affective Music Recommendation System Based on the Mood of Input Video2015

Author(s)

Organizer

平井辰典早稲田大学, 理工学術院, 特別研究員(PD)