Image Reconstruction from Bag of Visual Words using Large-scale Image Data
Project/Area Number |
26540079
|
Research Category |
Grant-in-Aid for Challenging Exploratory Research
|
Allocation Type | Multi-year Fund |
Research Field |
Perceptual information processing
|
Research Institution | The University of Tokyo |
Principal Investigator |
Harada Tatsuya 東京大学, 情報理工学(系)研究科, 教授 (60345113)
|
Project Period (FY) |
2014-04-01 – 2016-03-31
|
Project Status |
Completed (Fiscal Year 2015)
|
Budget Amount *help |
¥3,640,000 (Direct Cost: ¥2,800,000、Indirect Cost: ¥840,000)
Fiscal Year 2015: ¥2,210,000 (Direct Cost: ¥1,700,000、Indirect Cost: ¥510,000)
Fiscal Year 2014: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
|
Keywords | コンピュータビジョン / 機械学習 / 画像復元 / 画像認識 / 人工知能 |
Outline of Final Research Achievements |
The objective of this study is to reconstruct images from Bag-of-Visual-Words (BoVW). BoVW is defined here as a histogram of quantized local descriptors extracted densely on a regular grid at a single scale. This task is challenging for two reasons: 1) BoVW includes quantization errors. 2) BoVW lacks spatial information of local descriptors. To tackle this difficult task, we use a large-scale image database to estimate the spatial arrangement of local descriptors. Then this task creates a jigsaw puzzle problem with adjacency and global location costs of visual words. Solving this optimization problem is also challenging because it is known as an NP-Hard problem. We propose a heuristic but efficient method to optimize it. To underscore the effectiveness of our method, we apply it to BoVWs extracted from about 100 different categories and demonstrate that it can reconstruct the original images.
|
Report
(3 results)
Research Products
(6 results)