• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Super multi-class object recognition system using a large amount of image data

Research Project

Project/Area Number 24680017
Research Category

Grant-in-Aid for Young Scientists (A)

Allocation TypePartial Multi-year Fund
Research Field Perception information processing/Intelligent robotics
Research InstitutionThe University of Tokyo

Principal Investigator

HARADA Tatsuya  東京大学, 情報理工学(系)研究科, 教授 (60345113)

Project Period (FY) 2012-04-01 – 2015-03-31
Project Status Completed (Fiscal Year 2014)
Budget Amount *help
¥26,520,000 (Direct Cost: ¥20,400,000、Indirect Cost: ¥6,120,000)
Fiscal Year 2014: ¥8,710,000 (Direct Cost: ¥6,700,000、Indirect Cost: ¥2,010,000)
Fiscal Year 2013: ¥11,310,000 (Direct Cost: ¥8,700,000、Indirect Cost: ¥2,610,000)
Fiscal Year 2012: ¥6,500,000 (Direct Cost: ¥5,000,000、Indirect Cost: ¥1,500,000)
Keywords画像認識 / コンピュータビジョン / 機械学習 / 人工知能 / ビッグデータ / パターン認識
Outline of Final Research Achievements

The goal of our research is the construction of the super multi-class generic object recognition system by learning the relationship between a large amount of image and text data statistically. A method to continuously learn the classifiers from a huge amount of data without breakdown is crucial to realize this system. If there are many objects in one image, it is important to recognize where and what they are. A cost to construct high quality training dataset is so expensive that reducing the construction cost is also crucial. Moreover, a technique to find novel classes is a bottleneck for the continuously growing recognition system. In this research, we have tackled the above mentioned topics and produced some results.

Report

(4 results)
  • 2014 Annual Research Report   Final Research Report ( PDF )
  • 2013 Annual Research Report
  • 2012 Annual Research Report
  • Research Products

    (21 results)

All 2014 2013 2012 Other

All Journal Article (9 results) (of which Peer Reviewed: 9 results,  Open Access: 2 results) Presentation (10 results) (of which Invited: 1 results) Remarks (2 results)

  • [Journal Article] Three Guidelines of Online Learning for Large-Scale Visual Recognition2014

    • Author(s)
      Yoshitaka Ushiku, Masatoshi Hidaka, Tatsuya Harada
    • Journal Title

      The Twenty-Seventh IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2014)

      Volume: 1 Pages: 3574-3581

    • DOI

      10.1109/cvpr.2014.457

    • Related Report
      2014 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Hard Negative Classes for Multiple Object Detection2014

    • Author(s)
      Asako Kanezaki, Sho Inaba, Yoshitaka Ushiku, Yuya Yamashita, Hiroshi Muraoka, Yasuo Kuniyoshi, Tatsuya Harada
    • Journal Title

      2014 IEEE International Conference on Robotics and Automation (ICRA 2014)

      Volume: 1 Pages: 3066-3073

    • DOI

      10.1109/icra.2014.6907300

    • Related Report
      2014 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Elastic Net Constraints for Shape Matching2013

    • Author(s)
      Emanuele Rodola, Andrea Torsello, Tatsuya Harada, Yasuo Kuniyoshi, Daniel Cremers
    • Journal Title

      The 14th International Conference on Computer Vision (ICCV 2013)

      Volume: 1 Pages: 1169-1176

    • Related Report
      2013 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Efficient Shape Matching using Vector Extrapolation2013

    • Author(s)
      Emanuele Rodola, Tatsuya Harada, Yasuo Kuniyoshi, and Daniel Cremers
    • Journal Title

      The British Machine Vision Conference (BMVC 2013)

      Volume: 1

    • Related Report
      2013 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Weakly-supervised Multi-class Object Detection Using Multi-type 3D Features2013

    • Author(s)
      Asako Kanezaki, Yasuo Kuniyoshi, and Tatsuya Harada
    • Journal Title

      the 21th Annual ACM International Conference on Multimedia (ACMMM 2013)

      Volume: 1 Pages: 605-608

    • Related Report
      2013 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Causal Flow2012

    • Author(s)
      Yuya Yamashita, Tatsuya Harada, and Yasuo Kuniyoshi
    • Journal Title

      IEEE TRANSACTIONS ON MULTIMEDIA

      Volume: 14 Issue: 3 Pages: 619-629

    • DOI

      10.1109/tmm.2012.2191396

    • Related Report
      2012 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Efficient Image Annotation for Automatic Sentence Generation2012

    • Author(s)
      Yoshitaka Ushiku, Tatsuya Harada, and Yasuo Kuniyoshi
    • Journal Title

      the 20th Annual ACM International Conference on Multimedia (ACMMM 2012)

      Pages: 549-558

    • DOI

      10.1145/2393347.2393424

    • Related Report
      2012 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Visual Anomaly Detection from Small Samples for Mobile Robots2012

    • Author(s)
      Hiroharu Kato, Tatsuya Harada, and Yasuo Kuniyoshi
    • Journal Title

      IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2012)

      Volume: - Pages: 3171-3178

    • DOI

      10.1109/iros.2012.6386031

    • Related Report
      2012 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Dialog System Using Real-Time Crowdsourcing and Twitter Large-Scale Corpus2012

    • Author(s)
      Fumihiro Bessho, Tatsuya Harada and Yasuo Kuniyoshi
    • Journal Title

      Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2012)

      Pages: 227-231

    • Related Report
      2012 Annual Research Report
    • Peer Reviewed
  • [Presentation] MIL at ImageCLEF 2014: Scalable System for Image Annotation2014

    • Author(s)
      Atsushi Kanehira, Masatoshi Hidaka, Yusuke Mukuta, Yuichiro Tsuchiya, Tetsuaki Mano, and Tatsuya Harada
    • Organizer
      CLEF 2014 Evaluation Labs and Workshop
    • Place of Presentation
      Sheffield, England
    • Year and Date
      2014-09-15 – 2014-09-18
    • Related Report
      2014 Annual Research Report
  • [Presentation] Bridging the Gap between Visual Contents and Natural Language2014

    • Author(s)
      Tatsuya Harada
    • Organizer
      The First Kyoto University-Inamori Foundation Joint Kyoto Prize Symposium
    • Place of Presentation
      Kyoto, Japan
    • Year and Date
      2014-07-12 – 2014-07-13
    • Related Report
      2014 Annual Research Report
    • Invited
  • [Presentation] 確率的偏正準相関分析2013

    • Author(s)
      椋田悠介, 原田達也
    • Organizer
      信学技報, vol. 113, no. 286, IBISML2013-58
    • Place of Presentation
      東京都・東京工業大学
    • Related Report
      2013 Annual Research Report
  • [Presentation] マルチモーダル情報の相互情報量に基づく異常検出に関する研究2013

    • Author(s)
      鎌田智恵, 原田達也
    • Organizer
      信学技報, vol. 113, no. 196, PRMU2013-48
    • Place of Presentation
      鳥取県・鳥取大学
    • Related Report
      2013 Annual Research Report
  • [Presentation] ラベル間の階層構造を考慮したWeb画像アノテーション手法に関する研究2013

    • Author(s)
      日高雅俊, 郡司直之, 原田達也
    • Organizer
      信学技報, vol. 113, no. 196, PRMU2013-52
    • Place of Presentation
      鳥取県・鳥取大学
    • Related Report
      2013 Annual Research Report
  • [Presentation] 大規模コーパスと対話の相互作用を活用した感情遷移推定2012

    • Author(s)
      水落大, 原田達也, 國吉康夫
    • Organizer
      第30回日本ロボット学会学術講演会
    • Place of Presentation
      札幌コンベンションセンター(札幌)
    • Year and Date
      2012-09-19
    • Related Report
      2012 Annual Research Report
  • [Presentation] 大規模画像データセットを用いたマルチクラス物体検出器の同時学習 ~物体毎に特化した負例クラスの導入~2012

    • Author(s)
      金崎朝子, 稲葉 翔, 牛久祥孝, 山下裕也, 村岡宏是, 原田達也, 國吉康夫
    • Organizer
      パターン認識・メディア理解研究会(PRMU)
    • Place of Presentation
      東京農工大学(東京)
    • Year and Date
      2012-09-03
    • Related Report
      2012 Annual Research Report
  • [Presentation] 学習時間に着目した効率的な大規模画像分類2012

    • Author(s)
      稲葉翔, 村岡宏是, 山下裕也, 牛久祥孝, 金崎朝子, 原田達也, 國吉康夫
    • Organizer
      画像の認識・理解シンポジウム(MIRU2012)
    • Place of Presentation
      福岡国際会議場(福岡)
    • Year and Date
      2012-08-08
    • Related Report
      2012 Annual Research Report
  • [Presentation] 多種類の三次元特徴量を用いた物体セグメンテーションの弱教師付き学習2012

    • Author(s)
      金崎朝子, 原田達也, 国吉康夫
    • Organizer
      画像の認識・理解シンポジウム(MIRU2012)
    • Place of Presentation
      福岡国際会議場(福岡)
    • Year and Date
      2012-08-08
    • Related Report
      2012 Annual Research Report
  • [Presentation] キーフレーズ推定と文法モデルによる画像説明文生成2012

    • Author(s)
      牛久祥孝, 原田達也, 國吉康夫
    • Organizer
      画像の認識・理解シンポジウム(MIRU2012)
    • Place of Presentation
      福岡国際会議場(福岡)
    • Year and Date
      2012-08-06
    • Related Report
      2012 Annual Research Report
  • [Remarks]

    • URL

      http://www.mi.t.u-tokyo.ac.jp/#publication

    • Related Report
      2014 Annual Research Report
  • [Remarks] Publication

    • URL

      http://www.mi.t.u-tokyo.ac.jp/#publication

    • Related Report
      2013 Annual Research Report

URL: 

Published: 2012-04-24   Modified: 2019-07-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi