2020 Fiscal Year Final Research Report

Construction of mathematical optimization methods for discrete data useful in machine learning algorithms.

Research Project

PDF

Project/Area Number	17K19973
Research Category	Grant-in-Aid for Challenging Research (Exploratory)
Allocation Type	Multi-year Fund
Research Field	Information science, computer engineering, and related fields
Research Institution	Kyoto University
Principal Investigator	Yamamoto Akihiro 京都大学, 情報学研究科, 教授 (30230535)
Co-Investigator(Kenkyū-buntansha)	西野正彬日本電信電話株式会社NTTコミュニケーション科学基礎研究所, 協創情報研究部, 特別研究員 (90794529)
Project Period (FY)	2017-06-30 – 2021-03-31
Keywords	機械学習 / 文脈自由文法 / 木構造 / 一階述語論理 / 帰納論理プログラミング / 最小汎化
Outline of Final Research Achievements	Machine learning is now a fundamental technology in processing data in natural languages. If we convert natural language sentences converted into vectors of number and then applied the latest machine learning techniques, such as deep learning, we would meet difficulty in interpreting the meaning of the learning results. Moreover, we would have no guarantee that the natural structure of a sentence are adequately represented with vectors whose structure is very flat. In this study, we have developed optimization mathematics and algorithms for machine learning for parse trees in context-free languages, which are mathematical models of natural language data, sentences in first-order predicate logic, and patterns, which are direct algebraic representations of word sequences.
Free Research Field	知能情報学
Academic Significance and Societal Importance of the Research Achievements	機械学習は自然言語データの処理における基本技術となっている．特に自然言語データを自然数ベクトルのデータに変換した上で，深層学習など最新の機械学習技術を適用する方法は大きな成果を上げつつある．しかし，深層学習は学習結果の意味を解釈しづらく，さらには文のもつ自然な構造がベクトルという平坦な構造で適切に表現できる保証はない．本研究で扱った，語の列である自然言語データ,あるいはそこから抽出した構文木を直接扱う機械学習アルゴリズムを用いれば，解釈可能な構造を表現した結果を出力することが期待される．