2021 Fiscal Year Annual Research Report

Removing the Burden of Data Labeling: Automatic Surgical Video Understanding with Unsupervised Learning

Research Project

Project/Area Number	20K23343
Research Institution	Osaka University
Principal Investigator	李良知大阪大学, データビリティフロンティア機構, 特任助教(常勤) (10875545)
Project Period (FY)	2020-09-11 – 2022-03-31
Keywords	Few-shot Learning / Semantic Segmentation / Surgical Videos / Unsupervised Learning / Medical Images / Computer Vision / Surgical Analysis / Deep Learning
Outline of Annual Research Achievements	As most of the existing automatic surgical video analysis models require a large number of manually labeled data for training, this project aims to design a learning method to perform spatial and temporal segmentations with smaller requirements of humans’ input. During this project, I mainly studied the following sub-topics towards the goal. 1.Surgical images/frames analysis using very few training samples. I developed an explainable few-shot learning method to give accurate recognition labels (as well as the explanations) to the input samples, which is very important for risk-sensitive areas like medicine. This work is presented at CVPRW 2021. 2.Surgical images/frames semantic segmentation in a weakly-supervised way. I developed a new training strategy for video semantic segmentation models to utilized unlabeled data to improve their segmentation performance. This work is published in IEEE Access. 3. Computer vision models that can output prediction results as well as visual explanations for not only medical images but also natural images. The explanations can help downstream tasks like semantic segmentation, etc. Therefore, it has the potential to enable the weakly-supervised surgical images/frames semantic segmentation when only frame-level labels are available. This work is presented at IEEE ICCV. 4.Surgical videos temporal analysis using no labels. I developed a retrieval-based method to automatically predict surgical duration. This work is under submission.

Research Products

(1 results)

All Presentation (1 results) (of which Int'l Joint Research: 1 results)

[Presentation] SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition2021
- Author(s)
  Liangzhi Li, Bowen Wang, Manisha Verma, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara
- Organizer
  IEEE/CVF International Conference on Computer Vision (ICCV)
- Int'l Joint Research