Super multi-class object recognition system using a large amount of image data

Research Project

Project/Area Number	24680017
Research Category	Grant-in-Aid for Young Scientists (A)
Allocation Type	Partial Multi-year Fund
Research Field	Perception information processing/Intelligent robotics
Research Institution	The University of Tokyo
Principal Investigator	HARADA Tatsuya 東京大学, 情報理工学(系)研究科, 教授 (60345113)
Project Period (FY)	2012-04-01 – 2015-03-31
Project Status	Completed (Fiscal Year 2014)
Budget Amount *help	¥26,520,000 (Direct Cost: ¥20,400,000、Indirect Cost: ¥6,120,000) Fiscal Year 2014: ¥8,710,000 (Direct Cost: ¥6,700,000、Indirect Cost: ¥2,010,000) Fiscal Year 2013: ¥11,310,000 (Direct Cost: ¥8,700,000、Indirect Cost: ¥2,610,000) Fiscal Year 2012: ¥6,500,000 (Direct Cost: ¥5,000,000、Indirect Cost: ¥1,500,000)
Keywords	画像認識 / コンピュータビジョン / 機械学習 / 人工知能 / ビッグデータ / パターン認識
Outline of Final Research Achievements	The goal of our research is the construction of the super multi-class generic object recognition system by learning the relationship between a large amount of image and text data statistically. A method to continuously learn the classifiers from a huge amount of data without breakdown is crucial to realize this system. If there are many objects in one image, it is important to recognize where and what they are. A cost to construct high quality training dataset is so expensive that reducing the construction cost is also crucial. Moreover, a technique to find novel classes is a bottleneck for the continuously growing recognition system. In this research, we have tackled the above mentioned topics and produced some results.

Report

(4 results)

2014 Annual Research Report Final Research Report ( PDF )
2013 Annual Research Report
2012 Annual Research Report

Research Products
(21 results)

All 2014 2013 2012 Other

All Journal Article (9 results) (of which Peer Reviewed: 9 results, Open Access: 2 results) Presentation (10 results) (of which Invited: 1 results) Remarks (2 results)

[Journal Article] Three Guidelines of Online Learning for Large-Scale Visual Recognition2014
- Author(s)
  Yoshitaka Ushiku, Masatoshi Hidaka, Tatsuya Harada
- Journal Title
  
  The Twenty-Seventh IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2014)
  
  Volume: 1 Pages: 3574-3581
- DOI
  10.1109/cvpr.2014.457
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Hard Negative Classes for Multiple Object Detection2014
- Author(s)
  Asako Kanezaki, Sho Inaba, Yoshitaka Ushiku, Yuya Yamashita, Hiroshi Muraoka, Yasuo Kuniyoshi, Tatsuya Harada
- Journal Title
  
  2014 IEEE International Conference on Robotics and Automation (ICRA 2014)
  
  Volume: 1 Pages: 3066-3073
- DOI
  10.1109/icra.2014.6907300
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Elastic Net Constraints for Shape Matching2013
- Author(s)
  Emanuele Rodola, Andrea Torsello, Tatsuya Harada, Yasuo Kuniyoshi, Daniel Cremers
- Journal Title
  
  The 14th International Conference on Computer Vision (ICCV 2013)
  
  Volume: 1 Pages: 1169-1176
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] Efficient Shape Matching using Vector Extrapolation2013
- Author(s)
  Emanuele Rodola, Tatsuya Harada, Yasuo Kuniyoshi, and Daniel Cremers
- Journal Title
  
  The British Machine Vision Conference (BMVC 2013)
  
  Volume: 1
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] Weakly-supervised Multi-class Object Detection Using Multi-type 3D Features2013
- Author(s)
  Asako Kanezaki, Yasuo Kuniyoshi, and Tatsuya Harada
- Journal Title
  
  the 21th Annual ACM International Conference on Multimedia (ACMMM 2013)
  
  Volume: 1 Pages: 605-608
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] Causal Flow2012
- Author(s)
  Yuya Yamashita, Tatsuya Harada, and Yasuo Kuniyoshi
- Journal Title
  
  IEEE TRANSACTIONS ON MULTIMEDIA
  
  Volume: 14 Issue: 3 Pages: 619-629
- DOI
  10.1109/tmm.2012.2191396
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] Efficient Image Annotation for Automatic Sentence Generation2012
- Author(s)
  Yoshitaka Ushiku, Tatsuya Harada, and Yasuo Kuniyoshi
- Journal Title
  
  the 20th Annual ACM International Conference on Multimedia (ACMMM 2012)
  
  Pages: 549-558
- DOI
  10.1145/2393347.2393424
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] Visual Anomaly Detection from Small Samples for Mobile Robots2012
- Author(s)
  Hiroharu Kato, Tatsuya Harada, and Yasuo Kuniyoshi
- Journal Title
  
  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2012)
  
  Volume: - Pages: 3171-3178
- DOI
  10.1109/iros.2012.6386031
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] Dialog System Using Real-Time Crowdsourcing and Twitter Large-Scale Corpus2012
- Author(s)
  Fumihiro Bessho, Tatsuya Harada and Yasuo Kuniyoshi
- Journal Title
  
  Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2012)
  
  Pages: 227-231
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Presentation] MIL at ImageCLEF 2014: Scalable System for Image Annotation2014
- Author(s)
  Atsushi Kanehira, Masatoshi Hidaka, Yusuke Mukuta, Yuichiro Tsuchiya, Tetsuaki Mano, and Tatsuya Harada
- Organizer
  CLEF 2014 Evaluation Labs and Workshop
- Place of Presentation
  Sheffield, England
- Year and Date
  2014-09-15 – 2014-09-18
- Related Report
  2014 Annual Research Report
[Presentation] Bridging the Gap between Visual Contents and Natural Language2014
- Author(s)
  Tatsuya Harada
- Organizer
  The First Kyoto University-Inamori Foundation Joint Kyoto Prize Symposium
- Place of Presentation
  Kyoto, Japan
- Year and Date
  2014-07-12 – 2014-07-13
- Related Report
  2014 Annual Research Report
- Invited
[Presentation] 確率的偏正準相関分析2013
- Author(s)
  椋田悠介, 原田達也
- Organizer
  信学技報, vol. 113, no. 286, IBISML2013-58
- Place of Presentation
  東京都・東京工業大学
- Related Report
  2013 Annual Research Report
[Presentation] マルチモーダル情報の相互情報量に基づく異常検出に関する研究2013
- Author(s)
  鎌田智恵, 原田達也
- Organizer
  信学技報, vol. 113, no. 196, PRMU2013-48
- Place of Presentation
  鳥取県・鳥取大学
- Related Report
  2013 Annual Research Report
[Presentation] ラベル間の階層構造を考慮したWeb画像アノテーション手法に関する研究2013
- Author(s)
  日高雅俊, 郡司直之, 原田達也
- Organizer
  信学技報, vol. 113, no. 196, PRMU2013-52
- Place of Presentation
  鳥取県・鳥取大学
- Related Report
  2013 Annual Research Report
[Presentation] 大規模コーパスと対話の相互作用を活用した感情遷移推定2012
- Author(s)
  水落大, 原田達也, 國吉康夫
- Organizer
  第30回日本ロボット学会学術講演会
- Place of Presentation
  札幌コンベンションセンター(札幌)
- Year and Date
  2012-09-19
- Related Report
  2012 Annual Research Report
[Presentation] 大規模画像データセットを用いたマルチクラス物体検出器の同時学習～物体毎に特化した負例クラスの導入～2012
- Author(s)
  金崎朝子, 稲葉翔, 牛久祥孝, 山下裕也, 村岡宏是, 原田達也, 國吉康夫
- Organizer
  パターン認識・メディア理解研究会(PRMU)
- Place of Presentation
  東京農工大学(東京)
- Year and Date
  2012-09-03
- Related Report
  2012 Annual Research Report
[Presentation] 学習時間に着目した効率的な大規模画像分類2012
- Author(s)
  稲葉翔, 村岡宏是, 山下裕也, 牛久祥孝, 金崎朝子, 原田達也, 國吉康夫
- Organizer
  画像の認識・理解シンポジウム(MIRU2012)
- Place of Presentation
  福岡国際会議場(福岡)
- Year and Date
  2012-08-08
- Related Report
  2012 Annual Research Report
[Presentation] 多種類の三次元特徴量を用いた物体セグメンテーションの弱教師付き学習2012
- Author(s)
  金崎朝子, 原田達也, 国吉康夫
- Organizer
  画像の認識・理解シンポジウム(MIRU2012)
- Place of Presentation
  福岡国際会議場(福岡)
- Year and Date
  2012-08-08
- Related Report
  2012 Annual Research Report
[Presentation] キーフレーズ推定と文法モデルによる画像説明文生成2012
- Author(s)
  牛久祥孝, 原田達也, 國吉康夫
- Organizer
  画像の認識・理解シンポジウム(MIRU2012)
- Place of Presentation
  福岡国際会議場(福岡)
- Year and Date
  2012-08-06
- Related Report
  2012 Annual Research Report
[Remarks]
- URL
  http://www.mi.t.u-tokyo.ac.jp/#publication
- Related Report
  2014 Annual Research Report
[Remarks] Publication
- URL
  http://www.mi.t.u-tokyo.ac.jp/#publication
- Related Report
  2013 Annual Research Report

Super multi-class object recognition system using a large amount of image data

Principal Investigator

HARADA Tatsuya 東京大学, 情報理工学(系)研究科, 教授 (60345113)

¥26,520,000 (Direct Cost: ¥20,400,000、Indirect Cost: ¥6,120,000)

Report

Research Products

[Journal Article] Three Guidelines of Online Learning for Large-Scale Visual Recognition2014

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Hard Negative Classes for Multiple Object Detection2014

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Elastic Net Constraints for Shape Matching2013

Author(s)

Journal Title

Related Report

[Journal Article] Efficient Shape Matching using Vector Extrapolation2013

Author(s)

Journal Title

Related Report

[Journal Article] Weakly-supervised Multi-class Object Detection Using Multi-type 3D Features2013

Author(s)

Journal Title

Related Report

[Journal Article] Causal Flow2012

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Efficient Image Annotation for Automatic Sentence Generation2012

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Visual Anomaly Detection from Small Samples for Mobile Robots2012

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Dialog System Using Real-Time Crowdsourcing and Twitter Large-Scale Corpus2012

Author(s)

Journal Title

Related Report

[Presentation] MIL at ImageCLEF 2014: Scalable System for Image Annotation2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Bridging the Gap between Visual Contents and Natural Language2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 確率的偏正準相関分析2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] マルチモーダル情報の相互情報量に基づく異常検出に関する研究2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] ラベル間の階層構造を考慮したWeb画像アノテーション手法に関する研究2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 大規模コーパスと対話の相互作用を活用した感情遷移推定2012

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 大規模画像データセットを用いたマルチクラス物体検出器の同時学習～物体毎に特化した負例クラスの導入～2012