2017 Fiscal Year Annual Research Report

Deep semantic annotation of video contents

Research Project

Project/Area Number	17H01831
Research Institution	Waseda University
Principal Investigator	林良彦早稲田大学, 理工学術院, 教授(任期付) (80379156)
Co-Investigator(Kenkyū-buntansha)	加藤恒昭東京大学, 大学院総合文化研究科, 教授 (60334299) 小川哲司早稲田大学, 理工学術院, 准教授 (70386598) 植木一也明星大学, 情報学部, 准教授 (80580638)
Project Period (FY)	2017-04-01 – 2021-03-31
Keywords	情報資源の構築・管理 / 動画 / 意味的注釈 / 動詞意味論 / オントロジー / シーングラフ生成 / キャプション生成
Outline of Annual Research Achievements	本研究課題の目的は，画像・動画処理技術と言語・知識処理技術を統合的に用いることにより，動画中に描写されている，エージェントによる意味ある動作区間を検出し，その内容を表す意味注釈を付与することにある．本研究計画の初年度である2017年度は，以下の項目についての検討を進めた． (1) フレーム画像からのシーングラフの生成: 動画におけるフレーム画像に描写されている物体，および，それらの間の関係をシーングラフと呼ぶグラフ構造として抽出することは，動作区間の検出・記述における基本技術である．計算量を押さえつつ，膨大な組み合わせとなる物体識別，この結果を利用した関係識別に取り組み，それぞれ有用な結果を得るとともに，これらを統合的に行う方式の検討を進めた．(国際会議論文 1件，国内会議論文 1件) (2) クエリに基づく動画検索: 動画中に描写される可能性のある物体やその動作などを識別する研究を継続的に研究し，アドホック動画検索タスク (TRECVID AVSタスク) の文脈において検証し，良好な結果を確認した．また，(3)の言語の意味表現に関する基礎研究から得た知見をもとに，検索クエリにおける多義語の語義解消の検索精度向上に与える効果を評価した．(国際会議論文 1件，国内会議論文 2件) (3) 言語の意味表現に関する基礎研究: 深層学習時代における言語学的知識・知見の活用について包括的に再検討した．また，言語情報を活用したゼロショット物体認識，語義・概念の分散表現の意味関係分類や未知語意味推定などへの応用，非テキストモダリティの情報を加味した意味表現に関する検討などを進めた他，本研究課題の主要な課題の一つである動詞のもつ機能・意味に関する分析について有益な指針を得た．(国内誌招待論文 1件，国際会議論文 7件，国内会議論文 8件)
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 当初計画していた，画像に対する意味的注釈付与のためのオントロジー体系に構築については，大規模な注釈付き画像データベースに対するボトムアップ的な分析を進めたが，データに含まれるノイズやバイアスのために有用な結果を得ることはできていない．その一方で，アドホック動画検索との技術的な共通性に注目することにより，動画(フレーム画像群)に対して，多数の識別器を適用することによる物体・シーン・動作の検出について進展が得られた．またそこにおいて，言語の意味に関する基礎的研究の成果が適用できることが確認できた．以上より，総じて概ね順調に進展したと評価する．
Strategy for Future Research Activity	本研究課題の申請時からの研究展開の顕著な方向性として，深層学習を応用した画像・動画からのキャプション生成技術が進展してきたことが挙げられる．そこで，申請時の方針 (シーングラフを時間方向に展開して動作の意味記述を得る) に加え，まずキャプションを言語生成し，それから意味記述を求める手法を並行して検討する．この方法では，評定者によって付与されたキャプションを学習データとするため，そのバイアスに強く影響されるという問題があるが，一方で学習データが得やすいという利点もある．以上から，本研究計画の2年目となる2018年度は以下の方針による研究を推進する． (1) 動画からの動作区間の抽出とキャプション生成: 現在，大きな進展を見せつつある動作に関する画像データ (ActivityNet) を利用し，動作区間の検出・分類，これを制約として利用する動作キャプションの生成に関する研究を立ち上げる． (2) キーとなるフレーム画像からのシーングラフ生成の高度化: 本年度までの成果に基づき，計算効率を保ちつつシーングラフ生成の精度を改善する方式の研究を進める．このために，知識グラフや画像の描写するシーン分類などの先験的・体系的な知識の利用法を明らかにする． (3) 意味・知識基盤技術の研究の継続: 特に動作を表す動詞の時間的構造，意味注釈に置いて利用するオントロジー体系，オブジェクト・エンティティの意味属性・制約に関する基礎基盤的研究をさらに強化する．

Research Products
(23 results)

All 2018 2017

All Journal Article (1 results) Presentation (20 results) (of which Int'l Joint Research: 9 results) Book (2 results)

[Journal Article] 言語学とAI2017
- Author(s)
  林良彦
- Journal Title
  
  人工知能学会誌
  
  Volume: 32 Pages: 384--393
[Presentation] Undersampling Improves Hypernymy Prototypicality Learning2018
- Author(s)
  Koki Washio and Tsuneaki Kato
- Organizer
  LREC 2018 (accepted, May 2018)
- Int'l Joint Research
[Presentation] Filling Missing Paths: Modeling Co-occurrences of Word Pairs and Dependency Paths for Recognizing Lexical Semantic Relations2018
- Author(s)
  Koki Washio and Tsuneaki Kato
- Organizer
  NAACL HLT 2018 (accepted June 2018)
- Int'l Joint Research
[Presentation] Social Image Tags as a Source of Word Embeddings: A Task-oriented Evaluation2018
- Author(s)
  Mika Hasegawa, Tetsunori Kobayashi and Yoshihiko Hayashi
- Organizer
  LREC 2018 (accepted, May 2018)
- Int'l Joint Research
[Presentation] Speaker invariant feature extraction for zero-resource languages with adversarial training2018
- Author(s)
  Taira Tsuchiya, Naohiro Tawara, Tetsunori Kobayashi, Tetsuji Ogawa
- Organizer
  ICASSP2018 (accepted, April 2018)
- Int'l Joint Research
[Presentation] クエリ文を用いた詳細映像検索 -TRECVID 2017 AVSタスクの成果報告-2018
- Author(s)
  植木一也，平川幸司，菊池康太郎，小林哲則
- Organizer
  動的画像処理実用化ワークショップ(DIA2018) (March 2018)
[Presentation] 動詞語義の階層的分類に関する一考察2018
- Author(s)
  加藤恒昭
- Organizer
  第24回言語処理学会年次大会 (March 2018)
[Presentation] FrameNetを利用した談話関係の認識2018
- Author(s)
  李凌寒, 加藤恒昭
- Organizer
  第24回言語処理学会年次大会 (March 2018)
[Presentation] 単語ペアと依存構造パスの共起モデリングを用いた語の意味関係の分類2018
- Author(s)
  鷲尾光樹, 加藤恒昭
- Organizer
  第24回言語処理学会年次大会 (March 2018)
[Presentation] 語義・概念の分散表現を利用したSemantic Taxonomy Enrichment2018
- Author(s)
  金田健太郎, 小林哲則, 林良彦
- Organizer
  第24回言語処理学会年次大会 (March 2018)
[Presentation] 共起性を利用した物体認識における言語情報の有効性2018
- Author(s)
  黒澤郁音, 菊池康太郎, 小林哲則, 林良彦
- Organizer
  第24回言語処理学会年次大会 (March 2018)
[Presentation] ゼロショット物体認識における辞書定義文の援用2018
- Author(s)
  菊池康太郎, 林良彦, 小林哲則
- Organizer
  第24回言語処理学会年次大会 (March 2018)
[Presentation] クエリ中の単語の語義絞り込みによる動画検索精度の向上2018
- Author(s)
  平川幸司, 菊池康太郎, 植木一也, 林良彦, 小林哲則
- Organizer
  第24回言語処理学会年次大会 (March 2018)
[Presentation] 機械学習による単語概念の意味属性推定2018
- Author(s)
  長谷川美夏, 小林哲則, 林良彦
- Organizer
  第24回言語処理学会年次大会 (March 2018)
[Presentation] Classifying Lexical-semantic Relationships by Exploiting Sense/Concept Representations2017
- Author(s)
  Kentaro Kanada, Tetsunori Kobayashi and Yoshihiko Hayashi
- Organizer
  Workshop on Sense, Concept and Entity Representations and their Applications (SENSE) (April 2017)
- Int'l Joint Research
[Presentation] Incorporating visual features into word embeddings: A bimodal autoencoder-based approach2017
- Author(s)
  Mika Hasegawa, Tetsunori Kobayashi and Yoshihiko Hayashi
- Organizer
  IWCS 2017 (September 2017)
- Int'l Joint Research
[Presentation] Waseda_Meisei at TRECVID 2017: Ad-hoc Video Search2017
- Author(s)
  Kazuya Ueki, Koji Hirakawa, Kotaro Kikuchi, Tetsuji Ogawa and Tetsunori Kobayashi
- Organizer
  TRECVID 2017 (November 2017)
- Int'l Joint Research
[Presentation] A Neural Network Model for Detecting Inter-object Relationships2017
- Author(s)
  Ikuto Kurosawa, Tatsunori Kobayashi and Yoshihiko Hayashi
- Organizer
  CVPR Language and Vision workshop (July 2017)
- Int'l Joint Research
[Presentation] Word Vector Augmentation by its Definition for Zero-shot Image Classification2017
- Author(s)
  Kotaro Kikuchi, Naohiro Tawara, Tatsunori Kobayashi and Yoshihiko Hayashi
- Organizer
  CVPR Language and Vision workshop (July 2017)
- Int'l Joint Research
[Presentation] 単語、語義、概念：意味タスクにおける分散表現の適用性2017
- Author(s)
  金田健太郎，小林哲則，林良彦
- Organizer
  人工知能学会全国大会 (June 2017)
[Presentation] 辞書定義文を用いたゼロショット一般物体認識2017
- Author(s)
  菊池康太郎，俵直弘，小林哲則
- Organizer
  画像の認識・理解シンポジウム (MIRU) (August 2017)
[Book] Springer2017
- Author(s)
  van Erp, M., Hellmann, S., McCrae, J.P., Chiarcos, C., Choi, K.-S., Gracia, J., Hayashi, Y., Koide, S., Mendes, P., Paulheim, H., Takeda, H. (Eds.)
- Total Pages
  152
- Publisher
  Knowledge Graphs and Language Technology
- ISBN
  978-3-319-68723-0
[Book] 統計的自然言語処理の基礎2017
- Author(s)
  Christopher D.Manning、Hinrich Schutze、加藤恒昭、菊井玄一郎、林良彦、森辰則
- Total Pages
  640
- Publisher
  共立出版
- ISBN
  978-4320124219

2017 Fiscal Year Annual Research Report

Deep semantic annotation of video contents

Principal Investigator

林 良彦 早稲田大学, 理工学術院, 教授(任期付) (80379156)

Current Status of Research Progress

Reason

Research Products

[Journal Article] 言語学とAI2017

Author(s)

Journal Title

[Presentation] Undersampling Improves Hypernymy Prototypicality Learning2018

Author(s)

Organizer

[Presentation] Filling Missing Paths: Modeling Co-occurrences of Word Pairs and Dependency Paths for Recognizing Lexical Semantic Relations2018

Author(s)

Organizer

[Presentation] Social Image Tags as a Source of Word Embeddings: A Task-oriented Evaluation2018

Author(s)

Organizer

[Presentation] Speaker invariant feature extraction for zero-resource languages with adversarial training2018

Author(s)

Organizer

[Presentation] クエリ文を用いた詳細映像検索 -TRECVID 2017 AVSタスクの成果報告-2018

Author(s)

Organizer

[Presentation] 動詞語義の階層的分類に関する一考察2018

Author(s)

Organizer

[Presentation] FrameNetを利用した談話関係の認識2018

Author(s)

Organizer

[Presentation] 単語ペアと依存構造パスの共起モデリングを用いた語の意味関係の分類2018

Author(s)

Organizer

[Presentation] 語義・概念の分散表現を利用したSemantic Taxonomy Enrichment2018

Author(s)

Organizer

[Presentation] 共起性を利用した物体認識における言語情報の有効性2018

Author(s)

Organizer

[Presentation] ゼロショット物体認識における辞書定義文の援用2018

Author(s)

Organizer

[Presentation] クエリ中の単語の語義絞り込みによる動画検索精度の向上2018

Author(s)

Organizer

[Presentation] 機械学習による単語概念の意味属性推定2018

Author(s)

Organizer

[Presentation] Classifying Lexical-semantic Relationships by Exploiting Sense/Concept Representations2017

Author(s)

Organizer

[Presentation] Incorporating visual features into word embeddings: A bimodal autoencoder-based approach2017

Author(s)

Organizer

[Presentation] Waseda_Meisei at TRECVID 2017: Ad-hoc Video Search2017

Author(s)

Organizer

[Presentation] A Neural Network Model for Detecting Inter-object Relationships2017

Author(s)

Organizer

[Presentation] Word Vector Augmentation by its Definition for Zero-shot Image Classification2017

Author(s)

Organizer

[Presentation] 単語、語義、概念：意味タスクにおける分散表現の適用性2017

Author(s)

Organizer

[Presentation] 辞書定義文を用いたゼロショット一般物体認識2017

Author(s)

Organizer

[Book] Springer2017

Author(s)

Total Pages

Publisher

ISBN

[Book] 統計的自然言語処理の基礎2017

Author(s)

Total Pages

Publisher

ISBN

林良彦早稲田大学, 理工学術院, 教授(任期付) (80379156)