Development of a Method to Combine Deep Learning and Symbolic Reasoning and its Application to Machine Reading Comprehension

Research Project

Project/Area Number	20K23314
Research Category	Grant-in-Aid for Research Activity Start-up
Allocation Type	Multi-year Fund
Review Section	1001:Information science, computer engineering, and related fields
Research Institution	Tohoku University
Principal Investigator	Yoshikawa Masashi 東北大学, タフ・サイバーフィジカルAI研究センター, 助教 (80883470)
Project Period (FY)	2020-09-11 – 2022-03-31
Project Status	Completed (Fiscal Year 2021)
Budget Amount *help	¥2,470,000 (Direct Cost: ¥1,900,000、Indirect Cost: ¥570,000) Fiscal Year 2021: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2020: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Keywords	自然言語処理 / 深層学習 / 文章読解 / 機械学習 / 記号推論 / 数量推論 / 記号処理 / 言語推論
Outline of Research at the Start	深層学習は大量の教師データと誤差逆伝播法を用いたend-to-end学習により、文章読解を含む多くの自然言語処理課題において高い性能を獲得している。しかし、このようなシステムには扱いが困難な言語の推論タイプが存在し、文章読解における数量推論はその典型例である。本研究では、数量推論を深層学習モデルの微分可能関数内部で行わせるのではなく、数量計算に必要な項を抽出して外部のシンボリックなプログラムに計算させながら、システム全体はend-to-endに学習する仕組みを開発することで、パラメータ効率が良く、モデルの挙動により透明性を持つ深層文章読解モデルを構築する。
Outline of Final Research Achievements	We believe the key to advance further the state of deep learning (DL)-based natural language processing (NLP) is combining its technologies with those of symbolic reasoning. By the combination, it will be possible to develop an AI system that is (1) more data-efficient, and (2) more robust to variations of texts (e.g., domains), with (3) the more visible inference process essential to symbols. However, as a major key to the success of DL is end-to-end training using backpropagation, it is challenging to incorporate discrete symbolic function within a neural network. In this project, we propose a simple extension of the Gumbel-Softmax trick to overcome the problem and enable gradient-based learning of such a neuro-symbolic system. Using the method, we tackle numerical reasoning, which is an unsolved problem in DL-based NLP. We approach this problem by incorporating an arithmetic calculator layer within a DL-based reasoning model.
Academic Significance and Societal Importance of the Research Achievements	深層学習言語モデルの高い推論能力が喧伝される一方で、少し入力を操作するだけで思わぬ挙動を示すなど脆弱性も分野内でよく知られている。記号推論の仕組みを外部知識として組み込むことで、深層モデルのブラックボックス性を軽減でき、推論過程の透明性のみならず、期待される推論過程を教え込むことでより頑健なシステムを構築することが可能になる。数量は言語中に頻出である一方で自然言語処理では見過ごされがちである。金融文章の解析など応用にも直結する課題であるが、本研究の取り組みにより数量処理に大きな前進をもたらすことが期待される。

Report

(3 results)

2021 Annual Research Report Final Research Report ( PDF )
2020 Research-status Report

Research Products
(6 results)

All 2022 2021

All Journal Article (1 results) (of which Peer Reviewed: 1 results, Open Access: 1 results) Presentation (5 results)

[Journal Article] Instance-Based Neural Dependency Parsing2021
- Author(s)
  Ouchi Hiroki、Suzuki Jun、Kobayashi Sosuke、Yokoi Sho、Kuribayashi Tatsuki、Yoshikawa Masashi、Inui Kentaro
- Journal Title
  
  Transactions of the Association for Computational Linguistics
  
  Volume: 9 Pages: 1493-1507
- DOI
  10.1162/tacl_a_00439
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Presentation] End-to-End 学習可能な記号処理層の検討と数量推論への応用における課題の分析2022
- Author(s)
  吉川将司, Benjamin Heinzerling, 乾健太郎
- Organizer
  言語処理学会第28回年次大会
- Related Report
  2021 Annual Research Report
[Presentation] 四則演算を用いた Transformer の再帰的構造把握能力の調査2022
- Author(s)
  松本悠太, 吉川将司, Benjamin Heinzerling, 乾健太郎
- Organizer
  言語処理学会第28回年次大会
- Related Report
  2021 Annual Research Report
[Presentation] 多段の数量推論タスクに対する適応的なモデルの振る舞いの検証2022
- Author(s)
  青木洋一, 工藤慧音, Ana Brassard, 栗林樹生, 吉川将司, 乾健太郎
- Organizer
  言語処理学会第28回年次大会
- Related Report
  2021 Annual Research Report
[Presentation] 数量推論モデルに対する人工データを活用した表層的な手がかりに頑健な評価法の検討2021
- Author(s)
  工藤慧音, 青木洋一, 吉川将司, 乾健太郎
- Organizer
  NLP若手の会第16回シンポジウム
- Related Report
  2021 Annual Research Report
[Presentation] 離散記号処理に対する近似的な微分構造の考察と数量推論を要する文章読解問題への応用2021
- Author(s)
  吉川将司, 乾健太郎
- Organizer
  言語処理学会第27回年次大会
- Related Report
  2020 Research-status Report

Development of a Method to Combine Deep Learning and Symbolic Reasoning and its Application to Machine Reading Comprehension

Principal Investigator

Yoshikawa Masashi 東北大学, タフ・サイバーフィジカルAI研究センター, 助教 (80883470)

¥2,470,000 (Direct Cost: ¥1,900,000、Indirect Cost: ¥570,000)

Report

Research Products

[Journal Article] Instance-Based Neural Dependency Parsing2021

Author(s)

Journal Title

DOI

Related Report

[Presentation] End-to-End 学習可能な記号処理層の検討と数量推論への応用における課題の分析2022

Author(s)

Organizer

Related Report

[Presentation] 四則演算を用いた Transformer の再帰的構造把握能力の調査2022

Author(s)

Organizer

Related Report

[Presentation] 多段の数量推論タスクに対する適応的なモデルの振る舞いの検証2022

Author(s)

Organizer

Related Report

[Presentation] 数量推論モデルに対する人工データを活用した表層的な手がかりに頑健な評価法の検討2021

Author(s)

Organizer

Related Report

[Presentation] 離散記号処理に対する近似的な微分構造の考察と数量推論を要する文章読解問題への応用2021

Author(s)

Organizer

Related Report