2020 年度実施状況報告書

Developing a program for language teaching with parsed corpora

研究課題

研究課題/領域番号	19K00541
研究機関	弘前大学
研究代表者	バトラーアラステア弘前大学, 人文社会科学部, 准教授 (90588873)
研究期間 (年度)	2019-04-01 – 2022-03-31
キーワード	grammatical analysis / parsed corpora / language teaching / English / Japanese
研究実績の概要	The implementation plan is to develop a program for language teaching with parsed corpora. The components are: 1) a grammar textbook focused on English language learning for Japanese students at university level, 2) a large grammatically analysed corpus of English, also linked to Japanese language analysis for purposes of comparison, and 3) the development of a "toolkit" for analysis creation, for students to start analysing their own written language. The goal is to empower students to critically analyse their own language use and be drawn to explore wider insights from the grammatically analysed corpus. The second year of the project has seen further development in all three components of the project.
現在までの達成度 (区分)	現在までの達成度 (区分) 1: 当初の計画以上に進展している理由 Development of a textbook has branched into two directions: (i) an introductory guide for the English parsed corpus, and (ii) a supplement to a published textbook (Grammar and Beyond 3B, Cambridge University Press) that links the textbook units to queries of the corpus data. The size of the analysed corpus has grown to just under half-a-million words (43,831 trees). Improvements include the further integration of functional information, and verb codes to assist with word sense disambiguation. The "toolkit" has been further developed to include additional post-processing of the parse analysis to reach rich dependency information for meaning extraction. This post-processing was described in a conference presentation (LENLS 17), and is available as a demo on the web.
今後の研究の推進方策	The parsing guide will be further enlarged and refined. Content will be added to the textbook supplement for it to be useful when used alone. When combined with the published textbook, a complete package will be obtained for a full semester course of communicative English organised around insights of grammar and linked to corpus queries for twelve units/chapters. Annotation of the English corpus will aim to add more verb code information for word sense disambiguation to assist students with gaining knowledge of vocabulary and word use. The dependency analysis gained from the "toolkit" will be further advanced to make the output easily readable as a graph, and so serve as an alternative way for students to gain a strong feeling for how words interact with each other.
次年度使用額が生じた理由	The incurring amount was small and will be used to support further annotation work.
備考	The Treebank Semantics Parsed Corpus website is the main portal to the parsed corpus and its documentation. The Treebank Semantics website gives access to the project software and online demo.

研究成果
(5件)

すべて 2020 その他

すべて雑誌論文 (2件) (うち国際共著 2件、査読あり 2件、オープンアクセス 1件) 学会発表 (1件) (うち国際学会 1件) 備考 (2件)

[雑誌論文] From Discourse to Logic with Stanford CoreNLP and Treebank Semantics2020
- 著者名/発表者名
  Alastair Butler
- 雑誌名
  
  New Frontiers in Artificial Intelligence. JSAI-isAI 2019
  
  巻: 12331 ページ: 182-196
- DOI
  10.1007/978-3-030-58790-1_12
- 査読あり / 国際共著
[雑誌論文] Constructing Web-Accessible Semantic Role Labels and Frames for Japanese as Additions to the NPCMJ Parsed Corpus2020
- 著者名/発表者名
  Koichi Takeuchi, Alastair Butler, Iku Nagasaki, Takuya Okamura and Prashant Pardeshi
- 雑誌名
  
  Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020)
  
  巻: - ページ: 3153-3161
- 査読あり / オープンアクセス / 国際共著
[学会発表] Knowledge acquisition from natural language with Treebank Semantics and Flora-22020
- 著者名/発表者名
  Alastair Butler
- 学会等名
  Proceedings of the Seventeenth International Workshop of Logic and Engineering of Natural Language Semantics (LENLS 17)
- 国際学会
[備考] Treebank Semantics Parsed Corpus (TSPC) website
- URL
  http://www.compling.jp/ajb129/tspc.html
[備考] Treebank Semantics website
- URL
  http://www.compling.jp/ajb129/ts.html

2020 年度 実施状況報告書

Developing a program for language teaching with parsed corpora

研究代表者

バトラー アラステア 弘前大学, 人文社会科学部, 准教授 (90588873)

現在までの達成度 (区分)

理由

研究成果

[雑誌論文] From Discourse to Logic with Stanford CoreNLP and Treebank Semantics2020

著者名/発表者名

雑誌名

DOI

[雑誌論文] Constructing Web-Accessible Semantic Role Labels and Frames for Japanese as Additions to the NPCMJ Parsed Corpus2020

著者名/発表者名

雑誌名

[学会発表] Knowledge acquisition from natural language with Treebank Semantics and Flora-22020

著者名/発表者名

学会等名

[備考] Treebank Semantics Parsed Corpus (TSPC) website

URL

[備考] Treebank Semantics website

URL

2020 年度実施状況報告書

バトラーアラステア弘前大学, 人文社会科学部, 准教授 (90588873)