2019 Fiscal Year Annual Research Report

モダリティごとの不確実性を考慮した共有表現学習の研究

Research Project

Project/Area Number	19K21527
Allocation Type	Multi-year Fund
Research Institution	The University of Tokyo
Principal Investigator	鈴木雅大東京大学, 大学院工学系研究科(工学部), 特任研究員 (30823885)
Project Period (FY)	2019-04-01 – 2020-03-31
Keywords	深層学習 / 共有表現学習 / マルチモーダル学習 / 深層生成モデル
Outline of Annual Research Achievements	本研究では，画像や文書，音声などの複数の異なる種類の情報（マルチモーダル情報）を統合する表現を獲得する共有表現学習（およびマルチモーダル学習）に取り組む． 2019年度は，昨年に引き続き，様々なマルチモーダル情報を扱った研究や，それを実現するためのライブラリ開発を行った．画像の表現をラベル情報という別のモダリティを利用してうまく分離するように学習する研究や，与えられた服画像を利用して人物画像の服を着せ替える研究などを共著で行った．これらは国際学会のワークショップにて発表した．また，これまでの研究は2つのモダリティに限定していたが，確率的生成モデルに基づき，人間のように複数のモダリティを統合可能な大規模な認知アーキテクチャの枠組みを共著で提案した．この成果についてはNew Generation Computingに採録された．上記の研究のいくつかは，本研究のサブ研究の一つとして開発した，深層生成モデルライブラリで実装したものである．このライブラリの開発成果については，2019年度人工知能学会全国大会で発表した他，ロボット分野の国際学会であるIROSワークショップの招待講演にて発表した．研究機関全体を通して，深層生成モデルの枠組みによって，マルチモーダル情報を統合でき，さらにこのアプローチを様々な領域に適用可能であると示すことができた．またそれを行う過程で，深層生成モデルを実装するためのライブラリを開発し，その有効性を示すことができた．

Research Products
(5 results)

All 2020 2019

All Journal Article (1 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 1 results) Presentation (4 results) (of which Int'l Joint Research: 3 results, Invited: 1 results)

[Journal Article] Neuro-SERKET: Development of Integrative Cognitive System Through the Composition of Deep Probabilistic Generative Models2020
- Author(s)
  Taniguchi Tadahiro、Nakamura Tomoaki、Suzuki Masahiro、Kuniyasu Ryo、Hayashi Kaede、Taniguchi Akira、Horii Takato、Nagai Takayuki
- Journal Title
  
  New Generation Computing
  
  Volume: 38 Pages: 23～48
- DOI
  https://doi.org/10.1007/s00354-019-00084-w
- Peer Reviewed / Int'l Joint Research
[Presentation] Pixyz: a framework for developing complex deep generative models2019
- Author(s)
  Masahiro Suzuki
- Organizer
  Workshop on Deep Probabilistic Generative Models for Cognitive Architecture in Robotics (IROS2019)
- Int'l Joint Research / Invited
[Presentation] 深層生成モデルと世界モデル2019
- Author(s)
  鈴木雅大
- Organizer
  第4回統計・機械学習若手シンポジウム
[Presentation] UVTON: UV Mapping to Consider the 3D Structure of a Human in Image-Based Virtual Try-On Network2019
- Author(s)
  Shizuma Kubo, Yusuke Iwasawa, Masahiro Suzuki, Yutaka Matsuo
- Organizer
  Workshop on Computer Vision for Fashion, Art and Design, The IEEE International Conference on Computer Vision (ICCV 2019)
- Int'l Joint Research
[Presentation] Dual Space Learning with Variational Autoencoders2019
- Author(s)
  Hirono Okamoto, Masahiro Suzuki, Itto Higuchi, Shohei Ohsawa, Yutaka Matsuo
- Organizer
  Workshop on Deep Generative Models for Highly Structured Data, International Conference on Learning Representation
- Int'l Joint Research

2019 Fiscal Year Annual Research Report

モダリティごとの不確実性を考慮した共有表現学習の研究

Principal Investigator

鈴木 雅大 東京大学, 大学院工学系研究科(工学部), 特任研究員 (30823885)

Research Products

[Journal Article] Neuro-SERKET: Development of Integrative Cognitive System Through the Composition of Deep Probabilistic Generative Models2020

Author(s)

Journal Title

DOI

[Presentation] Pixyz: a framework for developing complex deep generative models2019

Author(s)

Organizer

[Presentation] 深層生成モデルと世界モデル2019

Author(s)

Organizer

[Presentation] UVTON: UV Mapping to Consider the 3D Structure of a Human in Image-Based Virtual Try-On Network2019

Author(s)

Organizer

[Presentation] Dual Space Learning with Variational Autoencoders2019

Author(s)

Organizer

鈴木雅大東京大学, 大学院工学系研究科(工学部), 特任研究員 (30823885)