2020 年度実績報告書

英語学習者のための議論構造を考慮した作文支援システム

研究課題

研究課題/領域番号	20J13239
研究機関	東京工業大学
研究代表者	PUTRA JAN WIRA GOTAMA 東京工業大学, 情報理工学院, 特別研究員(DC2)
研究期間 (年度)	2020-04-24 – 2022-03-31
キーワード	EFL students / argumentative structure / annotation tool / TIARA / annotator agreement
研究実績の概要	My research aims to develop an argument-oriented writing assistant for EFL students. The tool extracts and visualises the argumentative structure in essays. Based on the predicted argumentative structure, the tool also presents suggested improvements of the essays by reordering sentences. Currently, I have created a corpus of 434 annotated EFL essays with argumentative structure and sentence reordering. This is the largest annotated EFL essays so far. I also developed novel inter-annotator agreement metrics for a better evaluation of annotation quality. The paper describing this corpus has been accepted at an international journal with minor revision and will be available in the next fiscal year. Alongside this dataset annotation project, I also developed an in-house annotation tool TIARA to accommodate our task (publicly available). The paper describing this tool has been published at an international conference. Using the annotated corpus, I have built a deep learning model for parsing argumentative structure in essays. Given an essay (a sequence of sentences) as input, this module predicts the argumentative relations between sentences, forming a hierarchical structure. I employed a “Biaffine-Attention-Model”, which is the state-of-the-art model for automatic parsing of hierarchical structure. The paper describing automatic parsing experiment has been presented at an international conference as well.
現在までの達成度 (区分)	現在までの達成度 (区分) 2: おおむね順調に進展している理由 In this fiscal year, I annotated argumentative essays sampled from the ICNALE corpus (writings by Asian students). There are two annotation layers: (1) argumentative structure and (2) sentence reordering. I confirmed that the argumentative structure analysis could be very useful for textual assessment. The sentence reordering procedure also helped in improving essays. To realise my goal in developing a writing assistant tool, I then experimented on automatic argumentative structure parsing models using deep learning techniques. The model has not achieved an industrial-level performance yet, but this experiment provided insights and contributions for work on essays written by non-native English speakers (which is a niche research area).
今後の研究の推進方策	The automatic parsing model is very important for the downstream sentence reordering task. To this end, I will still try various techniques to improve the parsing performance in the next fiscal year. Particularly, using multi-task learning technique and training-data expansion. Multi-task learning will help the model to perform better. As my corpus is relatively small for a deep learning experiment, expanding the data using other corpora will be helpful. I will also implement the second module, i.e., sentence reordering. Given an essay and its argumentative structure as input, this module rearranges sentences in the essay to improve text coherence. There are two subtasks involved in this module: (a) pairwise ordering constraint prediction and (b) inference. In the pairwise ordering constraint prediction task, I determine the partial ordering preference between sentences connected by argumentative relations. This subtask outputs a topological graph. In the inference task (b), I then reorder sentences in the text based on the topological graph of (a). Since there is no quantitative metrics to measure whether the reordered essay is better than the original ones written by students, I plan to evaluate the system outputs by manual evaluation in the next fiscal year.

研究成果
(4件)

すべて 2020 その他

すべて国際共同研究 (1件) 学会発表 (2件) (うち国際学会 1件、招待講演 1件) 備考 (1件)

[国際共同研究] University of Cambridge(英国)
- 国名
  英国
- 外国機関名
  University of Cambridge
[学会発表] TIARA: A Tool for Annotating Discourse Relations and Sentence Reordering2020
- 著者名/発表者名
  Jan Wira Gotama Putra, Simone Teufel, Kana Matsumura, Takenobu Tokunaga
- 学会等名
  International Conference on Language Resources and Evaluation (peer-reviewed)
- 国際学会
[学会発表] Annotating Argumentative Structure in English-as-Foreign-Language Essays2020
- 著者名/発表者名
  Jan Wira Gotama Putra
- 学会等名
  Indonesia Association of Computational Linguistics (INACL)
- 招待講演
[備考] TIARA annotation tool open-source page
- URL
  https://github.com/wiragotama/TIARA-annotationTool

2020 年度 実績報告書

英語学習者のための議論構造を考慮した作文支援システム

研究代表者

PUTRA JAN WIRA GOTAMA 東京工業大学, 情報理工学院, 特別研究員(DC2)

現在までの達成度 (区分)

理由

研究成果

[国際共同研究] University of Cambridge(英国)

国名

外国機関名

[学会発表] TIARA: A Tool for Annotating Discourse Relations and Sentence Reordering2020

著者名/発表者名

学会等名

[学会発表] Annotating Argumentative Structure in English-as-Foreign-Language Essays2020

著者名/発表者名

学会等名

[備考] TIARA annotation tool open-source page

URL

2020 年度実績報告書