2023 Fiscal Year Research-status Report

High frame rate and ultra-Low delay video sensing system for interactive applications

Research Project

Project/Area Number	21K11816
Research Institution	Waseda University
Principal Investigator	池永剛早稲田大学, 理工学術院(情報生産システム研究科・センター), 教授 (90367178)
Project Period (FY)	2021-04-01 – 2025-03-31
Keywords	映像認識 / 超低遅延システム / ハードウェアアーキテクチャ / 映像センシング / FA / ロボティクス / スーパーピクセル / LK法
Outline of Annual Research Achievements	インタラクティブ応用のための次世代映像センシングのコアとなる、超高速（1000fps）、超低遅延(1から2ミリ秒)映像認識システム実現のための基盤技術創出を行うことを目的として、様々な映像センシングシステム実現の鍵となる、画像の特徴を用いたマッチング、追跡、分類等の基本処理に対し、実環境下で高い精度が得られる超高速・超低遅延向き映像認識アルゴリズム構成法とそれに基づくハードウェアアーキテクチャの実現を目指した取り組みを行なった。マッチング処理に関しては、オリジナル画像の特徴を保ったまま情報削減が可能なスーパーピクセル処理を対象とした取り組みを行った。空間上の繰り返し処理を時間方向に展開する独自の手法を提案し、FPGA上に実装した結果、ソフトウェアベースのアルゴリズムSLIC（Simple Linear Iterative Clustering)手法と同等の精度を保ったまま、0.985msで処理可能な事を実証した。一方、追跡処理に関しては、ロバストな追跡アルゴリズムとして知られるパーティクルフィルターを用いた多物体追跡を、0.929msで処理可能な技術を考案した。さらに分類処理に関しては、深層学習を用いた果物の欠陥検出を対象とした取り組みを行った。欠陥分類と欠陥位置検出を並行して行う手法を考案し、FPGA実装を行った結果、1.3%の精度低下で、0.948msで処理可能なことを確認した。さらに将来の応用を睨んだ関連技術として、深層学習に基づく人物や物体姿勢推定などに関する検討を行なった。以上の関連成果を８件の原著学術論文、および７件の国際会議にて発信した。また、関連企業との技術交流を積極的に行い、今後、これらの技術を産業につなげていく上で重要となる方向性に関して多くの知見を得た。
Current Status of Research Progress	Current Status of Research Progress 1: Research has progressed more than it was originally planned. Reason 最終年度(2023年度）も、前年度までと同様に、超高速（1000fps）、超低遅延(1から2ミリ秒)映像認識システム実現のための基盤技術実現の具体的な目標として掲げる、画像の特徴を用いたマッチング、追跡、分類の３つの基本処理に対する取り組みを行った。FAやロボティクス等の応用を想定し、その実現の鍵となる幅広い課題設定を行い、それぞれを連携させつつ並行して取り組んだ。超低遅延処理実現のために、ブレークスルーとなるストリーム型のアルゴリズム構成法を考案した。さらに、ハードウェア設計並びにFPGA上の実装を行い、実証を終えているテーマもあり、今後、産業化に向けた基盤を構築することができたと考えている。また、これらの成果を今年度は８件（３年間のと合計で１８件）の学術論文として発信することができたが、インパクトファクターの高い、Pattern Recognition (IF: 8.0)やIEEE Transactions on Instrumentation and Measurement(IF: 5.3)などの論文誌に採択されており、本取り組みが世界的に高い技術的インパクトを与えつつあると捉えている。本取り組みは、既に世界的に注目を浴びており、2023年11月にICSMD2023とIWACIII2023の２つの国際会議から招待され、基調講演を行なった。また、2024年3月に中国のトップ大学である、南京大学、東南大学、復旦大学より招待を受け、技術講演を行った。また、関連企業との技術交流により、FAやロボティクスなど実産業につながる取り組みを行った。以上の様に、研究成果面や情報発信面で、想定以上の順調な進捗が果たせていると考えている。
Strategy for Future Research Activity	３年間の取り組みを基盤として、今後は、学術面および産業面の両面からそれらの基盤技術を発展させていく予定である。学術面では、ICSMDやIWACIIIの国際会議の基調講演や各種技術講演等で、多様なインタラクティブ映像認識応用のコアとなる人間の脳を模擬した新たな概念として、人工視覚脳コンピュータ(ABC: Artificial Brain-vision Computer)の提案を行っており、特に小脳の機能を司る超低遅延(1ミリ秒)映像センシング技術の創出を目指す。このため、ルールベース処理、学習ベース処理とそれらの融合処理の３つの対象に対し、実環境下で高い精度が得られる超低遅延向き映像認識アルゴリズム構成法とそれに基づくハードウェアアーキテクチャを確立する。産業面では、FAに加えて、ロボティクス等を含めた幅広い応用展開を試みる。この３年間は、ルールーベース処理として、ハフ変換やLK法などの２次元処理を主な対象としていたが、今後は、３次元のカメラ姿勢推定や３次元の物体姿勢推定、SLAM (Simultaneous Localization And Mapping)や３次元リコンストラクション処理など、３次元に焦点を当てた検討を行う。また、学習ベース処理としては、より複雑かつ大規模な深層学習ネットワークが必須となる応用の実現を検討する。FPGAのリソース（乗算器等）は限られているため、全体の処理を、精度はそれほど必要ないが全フレーム処理が必須となる部分と高精度化がキーとなる部分に分け、前者は、FPGA上で毎フレーム処理し、後者はGPU上で初期フレームとキーフレームのみ処理するヘテロアーキテクチャ上での実現を試みる。また、各種成果を産業に結びつけるための活動も展開していく。
Causes of Carryover	採択されたカナダ開催の国際会議ACM Multimedia 2023参加のための出張旅費を想定していたが、第一著者の学生のvisaが取得できず、航空機代や宿泊代の約56万円が支出できなかったため、次年度支出に回すこととなった。残額は、採択されたIET Image Processingの論文投稿費385,832円、及び国際会議ICASSP2024の参加費119,354円等で、支出予定である。

Research Products
(19 results)

All 2024 2023 Other

All Int'l Joint Research (1 results) Journal Article (7 results) (of which Int'l Joint Research: 4 results, Peer Reviewed: 7 results) Presentation (9 results) (of which Int'l Joint Research: 9 results, Invited: 2 results) Remarks (2 results)

[Int'l Joint Research] 南京大学/東南大学/西安電子科技大学(中国)
- Country Name
  CHINA
- Counterpart Institution
  南京大学/東南大学/西安電子科技大学
[Journal Article] Grid Sample Based Temporal Iteration for Fully Pipelined 1-ms SLIC Superpixel Segmentation System2024
- Author(s)
  LI Yuan、HU Tingting、FUCHIKAMI Ryuji、IKENAGA Takeshi
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E107.D Pages: 515～524
- DOI
  10.1587/transinf.2023EDP7128
- Peer Reviewed
[Journal Article] Global to multi‐scale local architecture with hardwired CNN for 1‐ms tomato defect detection2024
- Author(s)
  Li Yuan、Hu Tingting、Fuchikami Ryuji、Ikenaga Takeshi
- Journal Title
  
  IET Image Processing
  
  Volume: - Pages: -
- DOI
  10.1049/ipr2.13084
- Peer Reviewed
[Journal Article] JoyPose: Jointly learning evolutionary data augmentation and anatomy-aware global?local representation for 3D human pose estimation2024
- Author(s)
  Du Songlin、Yuan Zhiwei、Lai Peifu、Ikenaga Takeshi
- Journal Title
  
  Pattern Recognition
  
  Volume: 147 Pages: 110116～110116
- DOI
  10.1016/j.patcog.2023.110116
- Peer Reviewed / Int'l Joint Research
[Journal Article] Kinematics-aware spatial-temporal feature transform for 3D human pose estimation2024
- Author(s)
  Du Songlin、Yuan Zhiwei、Ikenaga Takeshi
- Journal Title
  
  Pattern Recognition
  
  Volume: 150 Pages: 110316～110316
- DOI
  10.1016/j.patcog.2024.110316
- Peer Reviewed / Int'l Joint Research
[Journal Article] Semi-supervised attention based merging network with hybrid dilated convolution module for few-shot HDR video reconstruction2023
- Author(s)
  Zhao Fengshan、Liu Qin、Ikenaga Takeshi
- Journal Title
  
  Multimedia Tools and Applications
  
  Volume: 83 Pages: 37409～37430
- DOI
  10.1007/s11042-023-16885-7
- Peer Reviewed / Int'l Joint Research
[Journal Article] Motion-aware and data-independent model based multi-view 3D pose refinement for volleyball spike analysis2023
- Author(s)
  Liu Yanchao、Cheng Xina、Ikenaga Takeshi
- Journal Title
  
  Multimedia Tools and Applications
  
  Volume: 83 Pages: 22995～23018
- DOI
  10.1007/s11042-023-16369-8
- Peer Reviewed / Int'l Joint Research
[Journal Article] Temporal Prediction-Based Temporal Iterative Tracking and Parallel Motion Estimation for a 1-ms Rotation-Robust LK-Based Tracking System2023
- Author(s)
  Hu Tingting、Fuchikami Ryuji、Ikenaga Takeshi
- Journal Title
  
  IEEE Transactions on Instrumentation and Measurement
  
  Volume: 72 Pages: 1～14
- DOI
  10.1109/tim.2023.3295456
- Peer Reviewed
[Presentation] Artificial brain-vision computer for creating seamless interactive applications between real and virtual worlds2023
- Author(s)
  Takeshi Ikenaga
- Organizer
  The 8th International Workshop on Advanced Computational Intelligence and Intelligent Informatics (IWACIII 2023)
- Int'l Joint Research / Invited
[Presentation] Artificial brain-vision computer for creating seamless interactive applications between real and virtual worlds2023
- Author(s)
  Takeshi Ikenaga
- Organizer
  The fourth International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD 2023)
- Int'l Joint Research / Invited
[Presentation] A Figure Skating Jumping Dataset for Replay-Guided Action Quality Assessment2023
- Author(s)
  Yanchao Liu, Xina Cheng, Takeshi Ikenaga
- Organizer
  ACM Multimedia (MM2023)
- Int'l Joint Research
[Presentation] HDR-LMDA: A LOCAL AREA-BASED MIXED DATA AUGMENTATION METHOD FOR HDR VIDEO RECONSTRUCTION2023
- Author(s)
  Fengshan Zhao, Qin Liu, Takeshi Ikenaga
- Organizer
  IEEE International Conference on Image Processing (ICIP2023)
- Int'l Joint Research
[Presentation] Hierarchical Spatio-temporal Neural Network with Displacement Based Refinement for Monocular Head Pose Prediction2023
- Author(s)
  Zhe Xu, Yuan Li, Yuhong Li, Songlin Du, Takeshi Ikenaga
- Organizer
  18th IAPR International Conference on Machine Vision Applications (MVA2023)
- Int'l Joint Research
[Presentation] Intra-frame Skeleton Constraints Modeling and Grouping Strategy Based Multi-scale Graph Convolution Network for 3D Human Motion Prediction2023
- Author(s)
  Zhihan Zhuang, Yuan Li, Songlin Du, Takeshi Ikenaga
- Organizer
  18th IAPR International Conference on Machine Vision Applications (MVA2023)
- Int'l Joint Research
[Presentation] Grid Sample Based Temporal Iteration and Compactness-coefficient Distance for High Frame and Ultra-low Delay SLIC Segmentation System2023
- Author(s)
  Yuan Li, Tingting Hu, Ryuji Fuchikami, Takeshi Ikenaga
- Organizer
  18th IAPR International Conference on Machine Vision Applications (MVA2023)
- Int'l Joint Research
[Presentation] Multi-prior based Multi-scale Condition Network for Single-image HDR Reconstruction2023
- Author(s)
  Haorong Jiang, Fengshan Zhao, Junda Liao, Qin Liu, Takeshi Ikenaga
- Organizer
  18th IAPR International Conference on Machine Vision Applications (MVA2023)
- Int'l Joint Research
[Presentation] Pyramid Spatial Feature Transform And Shared-Offsets Deformable Alignment Based Convolutional Network for HDR Imaging2023
- Author(s)
  Junda Liao, Qin Liu, Takeshi Ikenaga
- Organizer
  IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)
- Int'l Joint Research
[Remarks] 研究室Web
- URL
  http://ikenaga.w.waseda.jp/
[Remarks] 早稲田大学研究者データーベース
- URL
  https://w-rdb.waseda.jp/html/100000676_ja.html

2023 Fiscal Year Research-status Report

High frame rate and ultra-Low delay video sensing system for interactive applications

Principal Investigator

池永 剛 早稲田大学, 理工学術院(情報生産システム研究科・センター), 教授 (90367178)

Current Status of Research Progress

Reason

Research Products

[Int'l Joint Research] 南京大学/東南大学/西安電子科技大学(中国)

Country Name

Counterpart Institution

[Journal Article] Grid Sample Based Temporal Iteration for Fully Pipelined 1-ms SLIC Superpixel Segmentation System2024

Author(s)

Journal Title

DOI

[Journal Article] Global to multi‐scale local architecture with hardwired CNN for 1‐ms tomato defect detection2024

Author(s)

Journal Title

DOI

[Journal Article] JoyPose: Jointly learning evolutionary data augmentation and anatomy-aware global?local representation for 3D human pose estimation2024

Author(s)

Journal Title

DOI

[Journal Article] Kinematics-aware spatial-temporal feature transform for 3D human pose estimation2024

Author(s)

Journal Title

DOI

[Journal Article] Semi-supervised attention based merging network with hybrid dilated convolution module for few-shot HDR video reconstruction2023

Author(s)

Journal Title

DOI

[Journal Article] Motion-aware and data-independent model based multi-view 3D pose refinement for volleyball spike analysis2023

Author(s)

Journal Title

DOI

[Journal Article] Temporal Prediction-Based Temporal Iterative Tracking and Parallel Motion Estimation for a 1-ms Rotation-Robust LK-Based Tracking System2023

Author(s)

Journal Title

DOI

[Presentation] Artificial brain-vision computer for creating seamless interactive applications between real and virtual worlds2023

Author(s)

Organizer

[Presentation] Artificial brain-vision computer for creating seamless interactive applications between real and virtual worlds2023

Author(s)

Organizer

[Presentation] A Figure Skating Jumping Dataset for Replay-Guided Action Quality Assessment2023

Author(s)

Organizer

[Presentation] HDR-LMDA: A LOCAL AREA-BASED MIXED DATA AUGMENTATION METHOD FOR HDR VIDEO RECONSTRUCTION2023

Author(s)

Organizer

[Presentation] Hierarchical Spatio-temporal Neural Network with Displacement Based Refinement for Monocular Head Pose Prediction2023

Author(s)

Organizer

[Presentation] Intra-frame Skeleton Constraints Modeling and Grouping Strategy Based Multi-scale Graph Convolution Network for 3D Human Motion Prediction2023

Author(s)

Organizer

[Presentation] Grid Sample Based Temporal Iteration and Compactness-coefficient Distance for High Frame and Ultra-low Delay SLIC Segmentation System2023

Author(s)

Organizer

[Presentation] Multi-prior based Multi-scale Condition Network for Single-image HDR Reconstruction2023

Author(s)

Organizer

[Presentation] Pyramid Spatial Feature Transform And Shared-Offsets Deformable Alignment Based Convolutional Network for HDR Imaging2023

Author(s)

Organizer

[Remarks] 研究室Web

URL

[Remarks] 早稲田大学研究者データーベース

URL

池永剛早稲田大学, 理工学術院(情報生産システム研究科・センター), 教授 (90367178)