• 研究課題をさがす
  • 研究者をさがす
  • KAKENの使い方
  1. 課題ページに戻る

2017 年度 実施状況報告書

統辞・意味解析情報タグ付き日本語ツリーバンクからの視覚意味情報の抽出と応用

研究課題

研究課題/領域番号 15K02469
研究機関大学共同利用機関法人人間文化研究機構国立国語研究所

研究代表者

バトラー アラステア  大学共同利用機関法人人間文化研究機構国立国語研究所, 大学共同利用機関等の部局等, 研究員 (90588873)

研究期間 (年度) 2015-04-01 – 2019-03-31
キーワードsemantic dependencies / parsed corpus / visualisation / annotation / predicate arguments / discourse relations
研究実績の概要

The research aim has been to develop methods of visualising and making accessible semantic information from analyses of Japanese and English, e.g., predicate argument information, but also higher levels of analysis, such as propositional connectives as well as modals, negation and factors of discourse.

The key part of this work has been the development of a visualisation tool for semantic relationships derivable from a parsed corpus. This enables human annotators to assess whether their interpretations of discourse have been adequately captured by the parsed corpus. As now realised, this tool has the capability of capturing many relationships found in discourse, providing a framework in which a fleshed out account of semantic roles, quantification, and modality becomes feasible.

現在までの達成度 (区分)
現在までの達成度 (区分)

1: 当初の計画以上に進展している

理由

The visualisation tool is now being used as a key part in the creation and presentation chain of three corpus resources: the NINJAL Parsed Corpus of Modern Japanese (NPCMJ; (http://npcmj.ninjal.ac.jp), the Oxford-NINJAL Corpus of Old Japanese (ONCOJ; http://oncoj.ninjal.ac.jp/?lang=en), and the Treebank Semantics Parsed Corpus (TSPC; http://www.compling.jp/ajb129/tspc.html).

The developed visualisation tool has revealed layers of dependencies that were not easily visible before. At the same time, the tool has revealed
inadequacies of analyses in the present state of the corpus data.

今後の研究の推進方策

Until now, two essential components for establishing semantic dependencies (allocation of "sort" information and the specification of clause linkages) have been handled by a small number of specialists who are able to cache out the results of complex grammatical rules (such as involve an antecedent hierarchy) and build these into annotation information without the aid of visualisation tools.
Now the project is in a position to turn these tasks over to non-specialists who need only have intuitions about meaningful relationships in texts and enough knowledge to be able to spot whether they are represented in the visualisation or not.
Only after reviewing the results of a program of annotation that takes advantage of this new technology can the adequacy of the tool be properly assessed, and the feasibility of including additional layers of semantic information be ascertained.
For the remainder of the term of the project the plan is to increase the volume of relevant data by hiring annotators, and to publicise the results of the project domestically and abroad at academic conferences.

次年度使用額が生じた理由

The developed visualisation tool has revealed layers of dependencies that were not easily visible before. At the same time, the tool has revealed inadequacies of analyses in the present state of the corpus data.

For the remainder of the term of the project the plan is to increase the volume and quality of relevant data by hiring annotators, and to publicise the results of the project domestically and abroad at academic conferences.

備考

The Treebank Semantics Parsed Corpus (TSPC) and Keyaki Treebank are corpus resources that can be viewed and downloaded. Treebank Semantics implements obtaining meaning representations.

  • 研究成果

    (8件)

すべて 2018 2017 その他

すべて 雑誌論文 (2件) (うち国際共著 1件、 オープンアクセス 2件、 査読あり 1件) 学会発表 (3件) 備考 (3件)

  • [雑誌論文] 統語解析情報付きコーパス検索用インタフェースの開発2018

    • 著者名/発表者名
      長崎郁 and アラステア・バトラー and スティーブン・ライト・ホーン and プラシャント・パルデシ and 吉本
    • 雑誌名

      『言語処理学会第24回年次大会発表論文集』

      巻: - ページ: 1123--1126

    • オープンアクセス
  • [雑誌論文] Annotating syntax and lexical semantics with(out) indexing2017

    • 著者名/発表者名
      Alastair Butler and Stephen Wright Horn
    • 雑誌名

      Proceedings of the Fourteenth International Workshop of Logic and Engineering of Natural Language Semantics (LENLS 14)

      巻: - ページ: -

    • 査読あり / オープンアクセス / 国際共著
  • [学会発表] Developing a model of typical Japanese grammar development: The role of parsed corpora and parsing programs2017

    • 著者名/発表者名
      Susanne Miyata and Alastair Butler
    • 学会等名
      Exploiting Parsed Corpora: Applications in Research, Pedagogy, and Processing
  • [学会発表] Developing a model of typical Japanese grammar development: The role of parsed corpora and parsing programs2017

    • 著者名/発表者名
      Stephen Wright Horn, Alastair Butler and Iku Nagasaki
    • 学会等名
      Exploiting Parsed Corpora: Applications in Research, Pedagogy, and Processing
  • [学会発表] Annotating syntax and lexical semantics with(out) indexing2017

    • 著者名/発表者名
      Alastair Butler and Stephen Wright Horn
    • 学会等名
      Logic and Engineering of Natural Language Semantics (LENLS 14)
  • [備考] The Treebank Semantics Parsed Corpus (TSPC)

    • URL

      http://www.compling.jp/ajb129/tspc.html

  • [備考] Treebank Semantics

    • URL

      http://www.compling.jp/ajb129/ts.html

  • [備考] The Keyaki Treebank Homepage

    • URL

      http://www.compling.jp/keyaki/

URL: 

公開日: 2018-12-17  

サービス概要 検索マニュアル よくある質問 お知らせ 利用規程 科研費による研究の帰属

Powered by NII kakenhi