Zero-shot recognition of generic objects

Research Project

Project/Area Number	19K24344
Research Category	Grant-in-Aid for Research Activity Start-up
Allocation Type	Multi-year Fund
Review Section	1001:Information science, computer engineering, and related fields
Research Institution	Kobe University
Principal Investigator	Hascoet Tristan 神戸大学, 経営学研究科, 助教 (60848448)
Project Period (FY)	2019-08-30 – 2022-03-31
Project Status	Completed (Fiscal Year 2021)
Budget Amount *help	¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000) Fiscal Year 2020: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000) Fiscal Year 2019: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Keywords	Zero-Shot Learning / Self-Supervised Learning / Visual Representation / Feature Extraction / Semantic representations / Resource Efficiency / CNN / Computer vision / Language Models / Object recognition / Deep learning / Computational efficiency / Semantic representation
Outline of Research at the Start	This study focuses on deriving new principles for optimization and semantic feature learning applied to generic object recognition. On the optimization front, we will focus on improving the computational and algorithmic efficiency of training deep learning models. In order to expand our search space by enabling quicker iteration over different architectural designs. On the semantic learning front, we aim to achieve a better understanding of the visual features that can be derived from semantic data, which we believe to be the key missing element to enable practical Zero-Shot recognition.
Outline of Final Research Achievements	Until a few years ago, computers could not recognize things in pictures. For example, computers were not capable to tell whether any human is in a given picture or not. Around ten years ago, computer programs became capable to recognize a number of things in pictures with high precision, including humans, dogs, cars, etc. The development of many technologies such as self-driving vehicles and robots were previously limited by the inability of computers to recognize such objects: for example, a self-driving car can not drive if it can not recognize a pedestrian on the road. However, computers can currently only recognize a finite number of things such as "a man" or "a woman", while humans can recognize things with more details and nuance such as "a young asian woman on a bike". This research project has worked towards giving computers the ability to recognize more complex and less predefined things, in order to allow computers to take better decisions.
Academic Significance and Societal Importance of the Research Achievements	本研究では、写真から不特定の物の秋類を認識するために必要な情報を研究しました。視覚情報の特定の処理は、意味表現よりも重要であり、そのようなプログラムを生成する能力は、実行できる計算の量によって制限されることがわかりました。

Report

(4 results)

2021 Annual Research Report Final Research Report ( PDF )
2020 Research-status Report
2019 Research-status Report

Research Products

(4 results)

All 2022 2020 2019 Other

All Int'l Joint Research (1 results) Journal Article (1 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 1 results, Open Access: 1 results) Presentation (2 results) (of which Int'l Joint Research: 2 results)

[Int'l Joint Research] Sicara(フランス)
- Country Name
  FRANCE
- Counterpart Institution
  Sicara
- Related Report
  2019 Research-status Report
[Journal Article] Reversible designs for extreme memory cost reduction of CNN training2022
- Author(s)
  Tristan Hascoet , Quentin Febvre , Weihao Zhuang , Yasuo Ariki,Tetsuya Takiguchi
- Journal Title
  
  EURASIP Journal on Image and Video Processing
  
  Volume: - Pages: -
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] FasterRCNN Monitoring of Road Damages: Competition and Deployment2020
- Author(s)
  Tristan Hascoet; Yihao Zhang; Andreas Persch; Ryoichi Takashima; Tetsuya Takiguchi; Yasuo Ariki
- Organizer
  IEEE Big Data Cup Challenge 20201
- Related Report
  2020 Research-status Report
- Int'l Joint Research
[Presentation] Layer-Wise Invertibility for Extreme Memory Cost Reduction of CNN Training2019
- Author(s)
  Tristan Hascoet, Quentin Febvre, Weihao Zhuang, Tetsuya Takiguchi, Yasuo Ariki
- Organizer
  Neural Architects Workshop, International Conference on Computer Vision 2019
- Related Report
  2019 Research-status Report
- Int'l Joint Research

Zero-shot recognition of generic objects

Principal Investigator

Hascoet Tristan 神戸大学, 経営学研究科, 助教 (60848448)

¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)

Report

Research Products

[Int'l Joint Research] Sicara(フランス)

Country Name

Counterpart Institution

Related Report

[Journal Article] Reversible designs for extreme memory cost reduction of CNN training2022

Author(s)

Journal Title

Related Report

[Presentation] FasterRCNN Monitoring of Road Damages: Competition and Deployment2020

Author(s)

Organizer

Related Report

[Presentation] Layer-Wise Invertibility for Extreme Memory Cost Reduction of CNN Training2019

Author(s)

Organizer

Related Report