Detecting phrase similarity based on compositional symbolic computation and statistical knowledge acquisition using a large-scale corpus.
Project/Area Number |
15K16045
|
Research Category |
Grant-in-Aid for Young Scientists (B)
|
Allocation Type | Multi-year Fund |
Research Field |
Intelligent informatics
|
Research Institution | Institute of Physical and Chemical Research (2018) Tohoku University (2015-2017) |
Principal Investigator |
Matsubayashi Yuichiroh 国立研究開発法人理化学研究所, 革新知能統合研究センター, 研究員 (20582901)
|
Project Period (FY) |
2015-04-01 – 2019-03-31
|
Project Status |
Completed (Fiscal Year 2018)
|
Budget Amount *help |
¥4,030,000 (Direct Cost: ¥3,100,000、Indirect Cost: ¥930,000)
Fiscal Year 2017: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000)
Fiscal Year 2016: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Fiscal Year 2015: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
|
Keywords | 自然言語処理 / 人工知能 / 項構造解析 / 談話 / 知識 / 文の意味構造解析 / 省略解析 / 意味構造解析 / 文章生成 / 項構造アノテーション / 多言語翻訳 / 方言翻訳 / 概念構造 / 類犠牲認識 / 分散意味論 / 選択選考モデル / 時間関係認識 |
Outline of Final Research Achievements |
In this research, in order to establish a semantic parser that can handle various linguistic expressions robustly, we have aimed to build models that can handle semantics of rather longer expressions than words, such as phrases, sentences, and discourse structures of multiple sentences. Specifically, employing distributed word representations as a key technology, we (1) improved semantic structure analysis models for phrases including multiple events, (2) improved semantic models that can handle inter-sentential semantic structures, and (3) proposed computational models that incorporate linguistic and non-linguistic multimodal structures. The contribution in this research project was presented in 26 publications including 11 peer-reviewed papers.
|
Academic Significance and Societal Importance of the Research Achievements |
(1)で開発した意味解析器では、現状の日本語解析における重大なボトルネックである「省略された内容を補う」解析の精度を当初の40%強から60%弱まで飛躍的に向上させ実用レベルに近づけた。開発したシステムは一般公開し、実世界テキスト解析に適用可能である。(2)では文の解析に先行文脈の情報を利用するという近年取り組みが減っていたアイデアに再注目、ハイライトした。このアイデアに基づく研究の数は徐々に増えている。(3) では、音楽、絵本の2つの題材を取り上げ、言語の構造とそれを取り巻く情報の構造を相互に考慮することが言語構造の推定に重要な役割を果たすことを示し、分野での先行事例としての役割を果たした。
|
Report
(5 results)
Research Products
(25 results)
-
[Journal Article] Modeling Storylines in Lyrics2018
Author(s)
Kento WATANABE, Yuichiroh MATSUBAYASHI, Kentaro INUI, Satoru FUKAYAMA, Tomoyasu NAKANO, Masataka GOTO
-
Journal Title
IEICE Transactions on Information and Systems
Volume: E101.D
Issue: 4
Pages: 1167-1179
DOI
NAID
ISSN
0916-8532, 1745-1361
Related Report
Peer Reviewed / Open Access
-
-
-
-
-
-
-
-
[Presentation] A Melody-conditioned Lyrics Language Model2018
Author(s)
Kento Watanabe, Yuichiroh Matsubayashi, Satoru Fukayama, Masataka Goto, Kentaro Inui and Tomoyasu Nakano
Organizer
The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Related Report
-
[Presentation] A Melody-conditioned Lyrics Language Model2018
Author(s)
Kento Watanabe, Yuichiroh Matsubayashi, Satoru Fukayama, Masataka Goto, Kentaro Inui and Tomoyasu Nakano
Organizer
The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT)
Related Report
Int'l Joint Research
-
-
-
-
-
-
-
-
-
-
-
-
[Presentation] Modeling Discourse Segments in Lyrics Using Repeated Patterns2016
Author(s)
Kento Watanabe, Yuichiroh Matsubayashi, Naho Orita, Naoaki Okazaki, Kentaro Inui, Satoru Fukayama, Tomoyasu Nakano, Jordan B. L. Smith and Masataka Goto
Organizer
International Conference on Computational Linguistics
Place of Presentation
Osaka International Convention Center, Japan
Year and Date
2016-12-13
Related Report
Int'l Joint Research
-
-
[Presentation] 分散表現に基づく選択選好モデルの文脈化2016
Author(s)
大野雅之, 井之上直也, 松林優一郎, 岡崎直観, 乾健太郎
Organizer
情報処理学会 自然言語処理研究会報告
Place of Presentation
株式会社ミクシィ 住友不動産渋谷ファーストタワー 7F
Year and Date
2016-01-22
Related Report
-