2018 Fiscal Year Final Research Report

Multi-Input Deep Learning and Its Application to Video Recognition

Research Project

Project/Area Number	15K16019
Research Category	Grant-in-Aid for Young Scientists (B)
Allocation Type	Multi-year Fund
Research Field	Perceptual information processing
Research Institution	Tokyo Institute of Technology
Principal Investigator	Inoue Nakamasa 東京工業大学, 情報理工学院, 助教 (10733397)
Project Period (FY)	2015-04-01 – 2019-03-31
Keywords	深層学習 / 映像認識
Outline of Final Research Achievements	In this project, we proposed a deep learning method for video recognition. The proposed method is based on vocabulary expansion using word vectors. Its performance is demonstrated on the TRECVID video dataset. We presented this work at ACM Multimedia.
Free Research Field	マルチメディア情報処理
Academic Significance and Societal Importance of the Research Achievements	本研究の成果は、映像や画像を認識するための人工知能技術に関するものである。画像データとテキストデータの情報を組み合わせることで、認識精度が向上することを示した。これは映像のどの部分に何があるかを詳細に検索する次世代の検索システムに役立つ技術である。