2016 年度実施状況報告書

統辞・意味解析情報タグ付き日本語ツリーバンクからの視覚意味情報の抽出と応用

研究課題

研究課題/領域番号	15K02469
研究機関	大学共同利用機関法人人間文化研究機構国立国語研究所
研究代表者	バトラーアラステア大学共同利用機関法人人間文化研究機構国立国語研究所, 理論・対照研究領域, プロジェクト非常勤研究員 (90588873)
研究期間 (年度)	2015-04-01 – 2018-03-31
キーワード	コーパス / 日本語 / 意味論 / 統語論
研究実績の概要	The research aims to develop methods of visualising and making accessible semantic information, e.g., predicate argument information, but also higher levels of analysis, such as propositional connectives that distinguish between coordination and subordination of structure. Such information enables, for example, mapping out binding dependencies, which has proved relevant as a method to reconstruct unpronounced argument information (zero pronouns) for Japanese, and extract valence patterns for predicates, an essential part of word meaning. To carry out this work it has been necessary to continue developing a method for reaching semantic representations automatically from syntactic parsed representations and to create a large base of already analysed and human checked syntactic structures that can be transformed to semantic representations. The establishment of such a base forms training data for creating yet more like data, with the potential to scale to large volumes of data.
現在までの達成度 (区分)	現在までの達成度 (区分) 1: 当初の計画以上に進展している理由 The pipeline for producing analysed data has continued to improve. Models resulting from training are slightly smaller than a year ago despite a large increase in new data, reflecting improvements to the annotation. The work on developing methods of visualising and making accessible semantic information has focused on ways to embed information back into parsed data. This has led to the enrichment of the existing corpus data with a second layer of special-purpose annotation made up of indexing information. This corpus semantic information can now be searched because of a transformation to the TIGER-XML format that includes a structure sharing mechanism (multi-dominance) that can be queried. Research results can be seen in the interfaces of the NINJAL Parsed Corpus of Modern Japanese (NPCMJ; http://npcmj.ninjal.ac.jp/interfaces/), where, aside from a default tree view of the syntactic annotation, examples can be seen (semantic view) as predicate logic formulas capturing semantic content, as well as a view (indexed view) that embeds the calculated semantic content into the trees as indexing information. In addition, there is a visualisation for how the semantics was derived (eval view).
今後の研究の推進方策	The semantic component will continue to be developed, especially in use as a basis for visualising dependencies. The existing indexing component will be extended so as to produce the character-indexed report format of FrameNet. This will allow creation of browsable reports that display semantic dependencies in a very intuitive way. A new "scaffolding" component will be built as a layer of automated analysis to further specify part-of-speech analysis derived from systems of morphological analysis (mecab/Comainu). It is expected that additional specification will lead to improvements of the automatic parsing. The project will also be extending the range of data analysed to more genres and to historical Japanese texts.
次年度使用額が生じた理由	Money has been carried over to pay for assistance in the process of undertaking human annotation correction.
次年度使用額の使用計画	Money has been carried over to pay for assistance in the process of undertaking human annotation correction.

研究成果
(11件)

すべて 2017 2016 その他

すべて雑誌論文 (4件) (うち国際共著 3件、オープンアクセス 4件、査読あり 3件、謝辞記載あり 3件) 学会発表 (5件) (うち国際学会 3件、招待講演 2件) 備考 (1件) 学会・シンポジウム開催 (1件)

[雑誌論文] Keyaki Treebank segmentation and part-of-speech labelling2017
- 著者名/発表者名
  Alastair Butler and Stephen Wright Horn and Kei Yoshimoto
- 雑誌名
  
  言語処理学会第23回年次大会発表論文集
  
  巻: なしページ: 414-417
- オープンアクセス
[雑誌論文] From meaning representations to syntactic trees2016
- 著者名/発表者名
  Alastair Butler
- 雑誌名
  
  Proceedings of the Thirteenth International Workshop of Logic and Engineering of Natural Language Semantics 13 (LENLS 13)
  
  巻: なしページ: 147-160
- 査読あり / オープンアクセス / 国際共著 / 謝辞記載あり
[雑誌論文] DynamicPower at SemEval-2016 Task 8: Processing syntactic parse trees with a Dynamic Semantics core2016
- 著者名/発表者名
  Alastair Butler
- 雑誌名
  
  Proceedings of SemEval-2016
  
  巻: なしページ: 1148-1153
- 査読あり / オープンアクセス / 国際共著 / 謝辞記載あり
[雑誌論文] Deterministic natural language generation from meaning representations for machine translation2016
- 著者名/発表者名
  Alastair Butler
- 雑誌名
  
  Proceedings of the 2nd Workshop on Semantics-Driven Machine Translation
  
  巻: なしページ: 1-9
- 査読あり / オープンアクセス / 国際共著 / 謝辞記載あり
[学会発表] From meaning representations to syntactic trees2016
- 著者名/発表者名
  Alastair Butler
- 学会等名
  Logic and Engineering of Natural Language Semantics (LENLS 13)
- 発表場所
  Tokyo, Japan
- 年月日
  2016-11-15
- 国際学会
[学会発表] Treebank annotation of FraCaS and JSeM2016
- 著者名/発表者名
  Alastair Butler, Ai Kubota, Shota Hiyama and Kei Yoshimoto
- 学会等名
  Logic and Engineering of Natural Language Semantics (LENLS 13)
- 発表場所
  Tokyo, Japan
- 年月日
  2016-11-13
- 国際学会
[学会発表] Parsed Corpus Semantics2016
- 著者名/発表者名
  Alastair Butler
- 学会等名
  New Landscapes in Theoretical Computational Linguistics
- 発表場所
  Ohio State University, USA
- 年月日
  2016-10-15
- 招待講演
[学会発表] A parsed corpus of Japanese enriched to reach levels of semantic analysis2016
- 著者名/発表者名
  Alastair Butler, Shiro Akasegawa, Prashant Pardeshi and Kei Yoshimoto
- 学会等名
  なし
- 発表場所
  Brandeis University, Boston, USA
- 年月日
  2016-09-02
- 招待講演
[学会発表] Deterministic natural language generation from meaning representations for machine translation2016
- 著者名/発表者名
  Alastair Butler
- 学会等名
  2nd Workshop on Semantics-Driven Machine Translation
- 発表場所
  San Diego, California
- 年月日
  2016-06-16
- 国際学会
[備考] Alastair Butler - Homepage
- URL
  http://www.compling.jp/ajb129/index.html
[学会・シンポジウム開催] Unshared Task at LENLS 13 (Theory and System analysis with FraCaS, MultiFraCaS and JSeM Test Suites)2016
- 発表場所
  National Institute for Japanese Language and Linguistics, Tokyo, Japan
- 年月日
  2016-11-13 – 2016-11-13

2016 年度 実施状況報告書

統辞・意味解析情報タグ付き日本語ツリーバンクからの視覚意味情報の抽出と応用

研究代表者

バトラー アラステア 大学共同利用機関法人人間文化研究機構国立国語研究所, 理論・対照研究領域, プロジェクト非常勤研究員 (90588873)

現在までの達成度 (区分)

理由

研究成果

[雑誌論文] Keyaki Treebank segmentation and part-of-speech labelling2017

著者名/発表者名

雑誌名

[雑誌論文] From meaning representations to syntactic trees2016

著者名/発表者名

雑誌名

[雑誌論文] DynamicPower at SemEval-2016 Task 8: Processing syntactic parse trees with a Dynamic Semantics core2016

著者名/発表者名

雑誌名

[雑誌論文] Deterministic natural language generation from meaning representations for machine translation2016

著者名/発表者名

雑誌名

[学会発表] From meaning representations to syntactic trees2016

著者名/発表者名

学会等名

発表場所

年月日

[学会発表] Treebank annotation of FraCaS and JSeM2016

著者名/発表者名

学会等名

発表場所

年月日

[学会発表] Parsed Corpus Semantics2016

著者名/発表者名

学会等名

発表場所

年月日

[学会発表] A parsed corpus of Japanese enriched to reach levels of semantic analysis2016

著者名/発表者名

学会等名

発表場所

年月日

[学会発表] Deterministic natural language generation from meaning representations for machine translation2016

著者名/発表者名

学会等名

発表場所

年月日

[備考] Alastair Butler - Homepage

URL

[学会・シンポジウム開催] Unshared Task at LENLS 13 (Theory and System analysis with FraCaS, MultiFraCaS and JSeM Test Suites)2016

発表場所

年月日

2016 年度実施状況報告書

バトラーアラステア大学共同利用機関法人人間文化研究機構国立国語研究所, 理論・対照研究領域, プロジェクト非常勤研究員 (90588873)