2019 Fiscal Year Research-status Report

Developing a program for language teaching with parsed corpora

Research Project

Project/Area Number	19K00541
Research Institution	Hirosaki University
Principal Investigator	バトラーアラステア弘前大学, 人文社会科学部, 准教授 (90588873)
Project Period (FY)	2019-04-01 – 2022-03-31
Keywords	grammatical analysis / parsed corpora / language teaching / English / Japanese
Outline of Annual Research Achievements	The implementation plan is to develop a program for language teaching with parsed corpora. The components are: 1) a grammar textbook focused on English language learning for Japanese students at university level, 2) a large grammatically analysed corpus of English, also linked to Japanese language analysis for purposes of comparison, and 3) the development of a "toolkit" for analysis creation, for students to start analysing their own written language. The goal is to empower students to critically analyse their own language use and be drawn to explore wider insights from the grammatically analysed corpus.
Current Status of Research Progress	Current Status of Research Progress 1: Research has progressed more than it was originally planned. Reason The first year of the project has seen development in all three components of the project. An initial draft of the textbook has been prepared with chapters introducing 1) Words 2) Phrases 3) Clauses and 4) Subordination and coordination of clause content. A lot of work has gone into the creation of an English corpus, and to the development of the scheme that underpins the annotation, which has been optimised to support the textbook description. The "toolkit" has been developed as a post-processor that is able to convert the output of the Stanford CoreNLP parser, a state-of-the-art statistical parser, to correspond to annotations of the corpus/textbook description. This conversion was described in a conference presentation (LENLS 16), and is currently available as a demo on the web.
Strategy for Future Research Activity	Future plans are to further develop the three components. The textbook will be enlarged and trial lessons undertaken. Annotation of the English corpus will aim to grow the overall size. A key part of language learning is to gain knowledge of vocabulary. To support vocabulary learning, the coming year will start adding word sense disambiguation information. So far the "toolkit" has involved adding post-processing to a statistical parser. This buys wide coverage and a way to manage ambiguity. However, this does lead to an unpredictable range of parsing errors, requiring students to be sensitive to miss-analysis. The future plan is to re-orient the toolkit around a logic based grammar approach with predictable properties. The new challenge is to increase coverage for unconstrained input.
Causes of Carryover	The incurring amount was small and will be used to support further annotation work.

Research Products
(4 results)

All 2020 2019

All Journal Article (3 results) (of which Int'l Joint Research: 1 results, Open Access: 3 results, Peer Reviewed: 1 results) Presentation (1 results) (of which Int'l Joint Research: 1 results)

[Journal Article] PropBank形式を考慮したNPCMJに対する意味役割付与~態の違いと経験者の付与~2020
- Author(s)
  竹内孔一, バトラーアラステア, 長崎郁, ホーンスティーブンライト
- Journal Title
  
  言語処理学会第26回年次大会
  
  Volume: - Pages: 633-636
- Open Access
[Journal Article] NPCMJに対する述語項構造シソーラスの意味役割と概念フレームの付与2019
- Author(s)
  竹内孔一, Alastair Butler, 長崎郁, Prashant Pardeshi
- Journal Title
  
  SIG Technical Reports
  
  Volume: 2019-NL-241(4) Pages: 2188--8779
- Open Access
[Journal Article] From discourse to logic with Stanford CoreNLP and Treebank Semantics2019
- Author(s)
  Alastair Butler
- Journal Title
  
  Proceedings of the Sixteenth International Workshop of Logic and Engineering of Natural Language Semantics (LENLS 16)
  
  Volume: - Pages: 1-14
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] From discourse to logic with Stanford CoreNLP and Treebank Semantics2019
- Author(s)
  Alastair Butler
- Organizer
  Logic and Engineering of Natural Language Semantics (LENLS 16)
- Int'l Joint Research

2019 Fiscal Year Research-status Report

Developing a program for language teaching with parsed corpora

Principal Investigator

バトラー アラステア 弘前大学, 人文社会科学部, 准教授 (90588873)

Current Status of Research Progress

Reason

Research Products

[Journal Article] PropBank形式を考慮したNPCMJに対する意味役割付与~態の違いと経験者の付与~2020

Author(s)

Journal Title

[Journal Article] NPCMJに対する述語項構造シソーラスの意味役割と概念フレームの付与2019

Author(s)

Journal Title

[Journal Article] From discourse to logic with Stanford CoreNLP and Treebank Semantics2019

Author(s)

Journal Title

[Presentation] From discourse to logic with Stanford CoreNLP and Treebank Semantics2019

Author(s)

Organizer

バトラーアラステア弘前大学, 人文社会科学部, 准教授 (90588873)