Development of Semi-Supervised Learning Method using Compressed Video for Real-Time Animal Behavior Analysis

Research Project

Project/Area Number	23K24893
Project/Area Number (Other)	22H03637 (2022-2023)
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Multi-year Fund (2024) Single-year Grants (2022-2023)
Section	一般
Review Section	Basic Section 61030:Intelligent informatics-related
Research Institution	Hokkaido University
Principal Investigator	山本雅人北海道大学, 情報科学研究院, 教授 (40292057)
Co-Investigator(Kenkyū-buntansha)	飯塚博幸北海道大学, 人間知・脳・AI研究教育センター, 特任准教授 (30396832) 田村康将北海道大学, 情報科学研究院, 助教 (50773701) 野口渉北海道大学, 数理・データサイエンス教育研究センター, 特任助教 (60868082)
Project Period (FY)	2022-04-01 – 2025-03-31
Project Status	Granted (Fiscal Year 2024)
Budget Amount *help	¥17,420,000 (Direct Cost: ¥13,400,000、Indirect Cost: ¥4,020,000) Fiscal Year 2024: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000) Fiscal Year 2023: ¥7,800,000 (Direct Cost: ¥6,000,000、Indirect Cost: ¥1,800,000) Fiscal Year 2022: ¥8,450,000 (Direct Cost: ¥6,500,000、Indirect Cost: ¥1,950,000)
Keywords	深層学習 / 半教師あり学習 / 行動分類 / 群行動解析 / 行動認識 / 圧縮動画 / インタラクション / 群れ行動
Outline of Research at the Start	一般に，学習するデータに行動ラベルを付与しない教師なし学習では，収集されたデータをクラスタリングするなどして同様の性質をもつデータ群に分けることによって行う．しかし，動画の行動分類へ適用するためには，クラスタリングされたデータ群が分類したい行動と対応しているかの保証がないため望ましい分類にはならない．本研究では，この問題を解決するため，一連の動画データに対して，教師あり学習と教師なし学習の利点をうまく融合した半教師あり学習の開発を目指す．すなわち，「恒常的に撮影され，かつ，行動の区切りなどが明確でない動画データをリアルタイムに処理し，高速かつ高精度で行動分類を実現することは可能か」を追求する．
Outline of Annual Research Achievements	本研究は，カメラ等から取得した動画データに映る人や動物などの行動分類をリアルタイムに行うための手法として，多くの教師データを必要としない半教師あり学習にもとづく手法の開発を目的とする．具体的には，動物園の動物を対象とした実証実験をとおして，ホッキョクグマの常同行動の分析と行動分類，また，アジアゾウの採餌行動の分類などを行うシステムを開発する．さらに，動物の群れ行動の解析と動物とリアルタイムにインタラクションするシステムについても開発を行う．二年度目までに，ホッキョクグマ，アジアゾウについてのトラッキング精度の向上，およびホッキョクグマの常同行動検出と行動分類手法の開発，アジアゾウの採餌行動の検出を主に行った．その結果，ホッキョクグマの常同行動については，約91％程度，行動分類については約89％の精度で実現できることを示した．また，半教師あり学習については，単一のネットワークで I-frame, motion vector, residual を同時に処理する multi-stream single network (MussNet) を提案した． Early fusion は単一のネットワークですべての入力を処理するため, Late fusion よりも効率的な圧縮動画分類を実現できる一方, 学習後の分類精度が低くなってしまうという欠点がある. そこで, MussNet は単一ネットワークに対して, 内部でLate fusion がおこなわれるような学習をおこなう. 結果として, MussNet は Early fusion のような単一ネットワークによる効率性を維持しつつ, Late fusion と同等の精度を達成できることを示した．
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 半教師あり学習において， I-frame, motion vector, residual それぞれに対応するディープネットワークの予測をアンサンブルすることでより信頼のおける擬似ラベルを生成する Compressed Video Ensemble based Pseudo Labeling (CoVEnPL) を提案したことに加え，単一のネットワークで I-frame, motion vector, residual を同時に処理する multi-stream single network (MussNet) を提案したことで，効率よく学習する手法を開発したことが主な理由である．また，実際に動物園の動物に対して，ホッキョクグマの行動分類，特に常同行動の検出の精度向上，また，アジアゾウの個体識別を伴うトラッキングと採餌行動の検出を行う手法を開発し，それぞれ十分な精度を示したことによる．
Strategy for Future Research Activity	今後は，まずこれまでの成果を受けて，動物の検出(Detection)，個体識別(Identification)を伴うトラッキング(Tracking)，および，行動分類(Action Classification)を精度良く実現するための手法のまとめを行う．特に，飼育環境や天候の違いによるカメラの映像の不安定さがある状況で，精度良く実現するための手法を開発する．また，圧縮動画を用いた半教師あり学習の精度向上を実現した成果を使って，ラベル付けされていないデータも学習に使用して分類精度の向上を試みる．さらに，動物の行動分析や情報管理システムの開発なども行い，動物園での実証実験に繋げていく予定である．

Report

(2 results)

2023 Annual Research Report
2022 Annual Research Report

Research Products
(16 results)

All 2024 2023 2022

All Journal Article (9 results) (of which Peer Reviewed: 8 results, Open Access: 6 results) Presentation (7 results) (of which Invited: 2 results)

[Journal Article] Multi-Stream Single Network: Efficient Compressed Video Action Recognition With a Single Multi-Input Multi-Output Network2024
- Author(s)
  Terao Hayato、Noguchi Wataru、Iizuka Hiroyuki、Yamamoto Masahito
- Journal Title
  
  IEEE Access
  
  Volume: 12 Pages: 20983-20997
- DOI
  10.1109/access.2024.3363022
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] 動物園のゾウを対象にした単一監視カメラ映像によるトラッキング手法の検討と実践2024
- Author(s)
  西岡　拳, 野口　渉, 飯塚　博幸, 山本　雅人
- Journal Title
  
  情報処理学会デジタルプラクティス
  
  Volume: 採録決定
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] Robust Animal Tracking and Stereotypical Behavior Detection Under Real Environment Using Temporal Averaging Background Subtraction2024
- Author(s)
  Wang Ruqin、Noguchi Wataru、Zhang Enzhi、Osada Koki、Yamamoto Masahito
- Journal Title
  
  IntelliSys, Lecture Notes in Networks and Systems
  
  Volume: 823 Pages: 857-875
- DOI
  10.1007/978-3-031-47724-9_57
- ISBN
  9783031477232, 9783031477249
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] 4 動物園におけるAI による動物の行動検出技術の開発と実装2024
- Author(s)
  山本雅人
- Volume
  4月号
- Pages
  944
- DOI
  10.18958/7431-00002-0001391-00
- Related Report
  2023 Annual Research Report
[Journal Article] Deep-learning-based Multi-behavior Classification of Animals for Efficient Health and Welfare Monitoring2023
- Author(s)
  Wang Ruqin、Noguchi Wataru、Osada Koki、Yamamoto Masahito
- Journal Title
  
  Sensors and Materials
  
  Volume: 35 Issue: 11 Pages: 3947
- DOI
  10.18494/SAM4521
- ISSN
  0914-4935, 2435-0869
- Year and Date
  2023-11-30
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Detecting Eating Behavior of Elephants in a Zoo Using Temporal Action Localization2023
- Author(s)
  Nishioka Ken、Noguchi Wataru、Izuka Hiroyuki、Yamamoto Masahito
- Journal Title
  
  Sensors and Materials
  
  Volume: 35 Issue: 11 Pages: 3927
- DOI
  10.18494/SAM4501
- ISSN
  0914-4935, 2435-0869
- Year and Date
  2023-11-30
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Compressed video ensemble based pseudo-labeling for semi-supervised action recognition2022
- Author(s)
  Terao Hayato、Noguchi Wataru、Iizuka Hiroyuki、Yamamoto Masahito
- Journal Title
  
  Machine Learning with Applications
  
  Volume: 9 Pages: 100336-100336
- DOI
  10.1016/j.mlwa.2022.100336
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Superposition mechanism as a neural basis for understanding others2022
- Author(s)
  Noguchi Wataru、Iizuka Hiroyuki、Yamamoto Masahito、Taguchi Shigeru
- Journal Title
  
  Scientific Reports
  
  Volume: 12 Issue: 1
- DOI
  10.1038/s41598-022-06717-3
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Learning shared embedding representation of motion and text using contrastive learning2022
- Author(s)
  Horie Junpei、Noguchi Wataru、Iizuka Hiroyuki、Yamamoto Masahito
- Journal Title
  
  Artificial Life and Robotics
  
  Volume: 28 Issue: 1 Pages: 148-157
- DOI
  10.1007/s10015-022-00840-0
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Presentation] 機械学習を利用した実験動物の行動解析2024
- Author(s)
  小林幸司、坂本直観、宮崎優介、原口大雅、山本雅人、村田幸久
- Organizer
  第6回日本獣医薬理毒性学会　春季研究会
- Related Report
  2023 Annual Research Report
[Presentation] 行動分類モデルに基づく動物園内デジタル飼育管理システムの開発2024
- Author(s)
  菊地真優，野口渉，田村康将，山本雅人
- Organizer
  第23回複雑系マイクロシンポジウム
- Related Report
  2023 Annual Research Report
[Presentation] 人工知能を用いたマウスの痛み表情の評価2023
- Author(s)
  小林幸司, 山本雅人, 村田幸久
- Organizer
  第70回実験動物学会
- Related Report
  2023 Annual Research Report
[Presentation] マウスのひっかき行動を長時間解析できる機械学習法の確立2023
- Author(s)
  小林幸司、宮崎優介、坂本直観、山本雅人、村田幸久
- Organizer
  第72回日本アレルギー学会学術大会
- Related Report
  2023 Annual Research Report
[Presentation] Crossover using Backpropagation for Evolutionary Artificial Neural Networks2022
- Author(s)
  Yasumasa Tamura
- Organizer
  Robomec 2022
- Related Report
  2022 Annual Research Report
[Presentation] Path planning in multi-robot systems2022
- Author(s)
  Yasumasa Tamura
- Organizer
  やわらか3D共創コンソーシアム第4回ソフトマシン部会
- Related Report
  2022 Annual Research Report
- Invited
[Presentation] 人工知能技術による社会応用と技能伝承2022
- Author(s)
  山本雅人
- Organizer
  Conference on 4D and Functional Fabrication 2022
- Related Report
  2022 Annual Research Report
- Invited

Development of Semi-Supervised Learning Method using Compressed Video for Real-Time Animal Behavior Analysis

Principal Investigator

山本 雅人 北海道大学, 情報科学研究院, 教授 (40292057)

¥17,420,000 (Direct Cost: ¥13,400,000、Indirect Cost: ¥4,020,000)

Current Status of Research Progress

Reason

Report

Research Products

[Journal Article] Multi-Stream Single Network: Efficient Compressed Video Action Recognition With a Single Multi-Input Multi-Output Network2024

Author(s)

Journal Title

DOI

Related Report

[Journal Article] 動物園のゾウを対象にした単一監視カメラ映像によるトラッキング手法の検討と実践2024

Author(s)

Journal Title

Related Report

[Journal Article] Robust Animal Tracking and Stereotypical Behavior Detection Under Real Environment Using Temporal Averaging Background Subtraction2024

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] 4 動物園におけるAI による動物の行動検出技術の開発と実装2024

Author(s)

Volume

Pages

DOI

Related Report

[Journal Article] Deep-learning-based Multi-behavior Classification of Animals for Efficient Health and Welfare Monitoring2023

Author(s)

Journal Title

DOI

ISSN

Year and Date

Related Report

[Journal Article] Detecting Eating Behavior of Elephants in a Zoo Using Temporal Action Localization2023

Author(s)

Journal Title

DOI

ISSN

Year and Date

Related Report

[Journal Article] Compressed video ensemble based pseudo-labeling for semi-supervised action recognition2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Superposition mechanism as a neural basis for understanding others2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Learning shared embedding representation of motion and text using contrastive learning2022

Author(s)

Journal Title

DOI

Related Report

[Presentation] 機械学習を利用した実験動物の行動解析2024

Author(s)

Organizer

Related Report

[Presentation] 行動分類モデルに基づく動物園内デジタル飼育管理システムの開発2024

Author(s)

Organizer

Related Report

[Presentation] 人工知能を用いたマウスの痛み表情の評価2023

Author(s)

Organizer

Related Report

[Presentation] マウスのひっかき行動を長時間解析できる機械学習法の確立2023

Author(s)

Organizer

Related Report

[Presentation] Crossover using Backpropagation for Evolutionary Artificial Neural Networks2022

Author(s)

Organizer

Related Report

[Presentation] Path planning in multi-robot systems2022

Author(s)

山本雅人北海道大学, 情報科学研究院, 教授 (40292057)