人間の行動状況理解のための時空間の非線形圧縮による映像の情報要約機構

Research Project

Project/Area Number	11230205
Research Category	Grant-in-Aid for Scientific Research on Priority Areas (B)
Allocation Type	Single-year Grants
Review Section	Science and Engineering
Research Institution	Kyoto University
Principal Investigator	美濃導彦京都大学, 総合情報メディアセンター, 教授 (70166099)
Co-Investigator(Kenkyū-buntansha)	西口敏司京都大学, 法学研究科, 助手河原達也京都大学, 情報学研究科, 助教授 (00234104) 松山隆司京都大学, 情報学研究科, 教授 (10109035) 東海彰吾福井大学, 工学部, 講師 (50283627) 亀田能成京都大学, 総合情報メディアセンター, 助手 (70283637)
Project Period (FY)	1999 – 2001
Project Status	Completed (Fiscal Year 2001)
Budget Amount *help	¥38,300,000 (Direct Cost: ¥38,300,000) Fiscal Year 2001: ¥8,100,000 (Direct Cost: ¥8,100,000) Fiscal Year 2000: ¥16,100,000 (Direct Cost: ¥16,100,000) Fiscal Year 1999: ¥14,100,000 (Direct Cost: ¥14,100,000)
Keywords	講義アーカイブ / 自動撮影 / 分散マイク / カメラワーク / 状況理解 / 被写体選択 / 複数カメラ / 映像要約 / 視覚メディア / 時空間の圧縮 / 分散視覚 / 情報要約 / 教材制約 / スタジオ撮影 / シナリオ制約
Research Abstract	本計画研究の目標は、講義を様々な観点から蓄積してアーカイブ化するシステムを構築することと、利用者自身がアーカイブの中から必要な部分を選択し、要約する枠組みを提案することである。様々な利用者の目的に応じることを可能とするために、講義アーカイブにはできるだけ多様な映像を記録しておくことが望ましい。そこで、受講者の動きを反映した活性度に基づく撮影と、話者位置を推定することによって、質問等をしている受講者の撮影を実現した。まず、受講者の活性度に応じた多様な映像を撮影するために、確率密度関数に基くカメラワークの決定手法を提案した。できるだけ多くの受講者を映像フレーム内に納めるため、位置的に近くに存在する受講者群を被写体の侯補とし、多数の被写体侯補に対して、受講者の動きに基づく活性度を割り当てた。そして、被写体候補を活性度に基いて順序付けたのち、撮影に利用できるカメラ台数に応じて等分し、それぞれのグループ内に含まれる被写体侯補を選択して撮影する確率を高めた確率密度関数を各カメラに対して割り当てることにより、ある時刻における受講者を、活性度に基き多様に撮影する手法を提案した。また、質問等をしている受講者の表情を撮影するために、講義室に複数設置した分散マイクシステムを用いた話者位置推定を実現した。二本のマイクからなるマイク対に到達する音の位相差を利用して音源方向を推定し、これを複数のマイク対に対して求めることによって、各音源方向の交点付近の領域に存在する話者位置を推定することが可能となった。さらに、利用者自身が視聴したい講義映像の被写体の数や撮影サイズなどの傾向を設定することにより、上記の手法によって撮影した複数の講義映像から、設定を満たすように自動選択して提示する枠組みについて検討した。

Report

(3 results)

Research Products
(16 results)

All Other

All Publications (16 results)

[Publications] 椋木雅之, 西口敏司, 美濃導彦: "複数台カメラによる講義の観測と撮像"コンピュータビジョンとイメージメディア. 131-11. 77-84 (2002)
- Related Report
  2001 Annual Research Report
[Publications] Satoshi NISHIGUCHI et al.: "A Camera Controlling Method for Lecture Archive"Proc. International Workshop on Pattern Recognition and Understanding for Visual Information Media. 37-42 (2002)
- Related Report
  2001 Annual Research Report
[Publications] 美濃導彦: "遠隔講義システムにおけるパターン認識"InterLab. 38. 42 (2001)
- Related Report
  2001 Annual Research Report
[Publications] 毛呂功, 水田忍, 美濃導彦: "表面作成のための点群データからの人体姿勢推定"電子情報通信学会技術研究報告(パターン認識・メディア理解). PRMU2001-128. 105-112 (2001)
- Related Report
  2001 Annual Research Report
[Publications] 美濃導彦: "Imaging a 3D Lecture Room by Interpreting Its Dynamic Situation"Proceedings of The Fourth International Workshop on the Cooperative Distributed Vision. 371-412 (2001)
- Related Report
  2000 Annual Research Report
[Publications] 亀田能成: "High Speed 3D Reconstruction by Spatio-Temporal Division of Video Image Processing "IEICE Transaction on Information and Systems. Vol.E83-D No.7. 1422-1428 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 亀田能成: "A Live Video Imaging Method for Capturing Presentation Information in Distance Learning"IEEE International Conference on Multimedia and Expo. Vol.3. 1237-1240 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 河原達也: "Phonetic Tied-Mixtureモデルを用いた大語彙連続音声認識"電子情報通信学会論文誌. J83-DII. 2517-2525 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 河原達也: "Flexible mixed-initiative dialogue management using concept-level Confidence measures of speech recognizer output"Proc. Int'l Conf. Computational Linguistics (COLING). 467-473 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 河原達也: "A new phonetic tied-mixture model for efficient decoding"Proc. IEEE-ICASSP. 1269-1272 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 亀田能成: "複数のカメラを用いた複数ユーザに対する講義の実時間映像化法"電子情報通信学会論文誌. Vol.J82-D-II,No.10. 1598-1605 (1999)
- Related Report
  1999 Annual Research Report
[Publications] 亀田能成: "講義の自動撮影系における音声・映像インデキシング"電子情報通信学会技術研究報告. PRMU99-258. 91-98 (2000)
- Related Report
  1999 Annual Research Report
[Publications] Takashi Matsuyama: "Active Image Capturing and Dynamic Scene Visualization by Cooperative Distributed Vision"Advanced Multimedia Content Processing (S. Nishio and F. Kishino eds.) Lecture Notes in Computer Science. No. 1554. 252-288 (1999)
- Related Report
  1999 Annual Research Report
[Publications] Takashi Matsuyama: "Human Head Tracking using Adaptive Appearance Models with a Fixed-Viewpoint Pan-Tilt-Zoom Camera"Proc. of 4th International Conference on Face and Gesture Recognition. (2000)
- Related Report
  1999 Annual Research Report
[Publications] 河原達也: "日本語ディクテーション基本ソフトウェア(98年度版)"日本音響学会誌. Vol.56,No.4. (2000)
- Related Report
  1999 Annual Research Report
[Publications] 河原達也: "音声認識結果の信頼度を用いた頑健な混合主導対話の実現法"情報処理学会研究報告. 2000-SLP-30-9. (2000)
- Related Report
  1999 Annual Research Report

人間の行動状況理解のための時空間の非線形圧縮による映像の情報要約機構

Principal Investigator

美濃 導彦 京都大学, 総合情報メディアセンター, 教授 (70166099)

¥38,300,000 (Direct Cost: ¥38,300,000)

Report

Research Products

[Publications] 椋木 雅之, 西口 敏司, 美濃 導彦: "複数台カメラによる講義の観測と撮像"コンピュータビジョンとイメージメディア. 131-11. 77-84 (2002)

Related Report

[Publications] Satoshi NISHIGUCHI et al.: "A Camera Controlling Method for Lecture Archive"Proc. International Workshop on Pattern Recognition and Understanding for Visual Information Media. 37-42 (2002)

Related Report

[Publications] 美濃 導彦: "遠隔講義システムにおけるパターン認識"InterLab. 38. 42 (2001)

Related Report

[Publications] 毛呂 功, 水田 忍, 美濃 導彦: "表面作成のための点群データからの人体姿勢推定"電子情報通信学会技術研究報告(パターン認識・メディア理解). PRMU2001-128. 105-112 (2001)

Related Report

[Publications] 美濃導彦: "Imaging a 3D Lecture Room by Interpreting Its Dynamic Situation"Proceedings of The Fourth International Workshop on the Cooperative Distributed Vision. 371-412 (2001)

Related Report

[Publications] 亀田能成: "High Speed 3D Reconstruction by Spatio-Temporal Division of Video Image Processing "IEICE Transaction on Information and Systems. Vol.E83-D No.7. 1422-1428 (2000)

Related Report

[Publications] 亀田能成: "A Live Video Imaging Method for Capturing Presentation Information in Distance Learning"IEEE International Conference on Multimedia and Expo. Vol.3. 1237-1240 (2000)

Related Report

[Publications] 河原達也: "Phonetic Tied-Mixtureモデルを用いた大語彙連続音声認識"電子情報通信学会論文誌. J83-DII. 2517-2525 (2000)

Related Report

[Publications] 河原達也: "Flexible mixed-initiative dialogue management using concept-level Confidence measures of speech recognizer output"Proc. Int'l Conf. Computational Linguistics (COLING). 467-473 (2000)

Related Report

[Publications] 河原達也: "A new phonetic tied-mixture model for efficient decoding"Proc. IEEE-ICASSP. 1269-1272 (2000)

Related Report

[Publications] 亀田能成: "複数のカメラを用いた複数ユーザに対する講義の実時間映像化法"電子情報通信学会論文誌. Vol.J82-D-II,No.10. 1598-1605 (1999)

Related Report

[Publications] 亀田能成: "講義の自動撮影系における音声・映像インデキシング"電子情報通信学会 技術研究報告. PRMU99-258. 91-98 (2000)

Related Report

[Publications] Takashi Matsuyama: "Active Image Capturing and Dynamic Scene Visualization by Cooperative Distributed Vision"Advanced Multimedia Content Processing (S. Nishio and F. Kishino eds.) Lecture Notes in Computer Science. No. 1554. 252-288 (1999)

Related Report

[Publications] Takashi Matsuyama: "Human Head Tracking using Adaptive Appearance Models with a Fixed-Viewpoint Pan-Tilt-Zoom Camera"Proc. of 4th International Conference on Face and Gesture Recognition. (2000)

Related Report

[Publications] 河原達也: "日本語ディクテーション基本ソフトウェア(98年度版)"日本音響学会誌. Vol.56,No.4. (2000)

Related Report

[Publications] 河原達也: "音声認識結果の信頼度を用いた頑健な混合主導対話の実現法"情報処理学会研究報告. 2000-SLP-30-9. (2000)

Related Report

美濃導彦京都大学, 総合情報メディアセンター, 教授 (70166099)

[Publications] 椋木雅之, 西口敏司, 美濃導彦: "複数台カメラによる講義の観測と撮像"コンピュータビジョンとイメージメディア. 131-11. 77-84 (2002)

[Publications] 美濃導彦: "遠隔講義システムにおけるパターン認識"InterLab. 38. 42 (2001)

[Publications] 毛呂功, 水田忍, 美濃導彦: "表面作成のための点群データからの人体姿勢推定"電子情報通信学会技術研究報告(パターン認識・メディア理解). PRMU2001-128. 105-112 (2001)

[Publications] 亀田能成: "講義の自動撮影系における音声・映像インデキシング"電子情報通信学会技術研究報告. PRMU99-258. 91-98 (2000)