2017 年度実績報告書

マルチリソース適応によるローリソースニューラル機械翻訳の高度化

研究課題

研究課題/領域番号	17H06822
研究機関	大阪大学
研究代表者	チョシンキ大阪大学, データビリティフロンティア機構, 特任助教(常勤) (70784891)
研究期間 (年度)	2017-08-25 – 2019-03-31
キーワード	機械翻訳 / ローリソース / ドメイン適応 / ニューラル機械翻訳
研究実績の概要	In Japan, because of the rapid increase of foreign tourists and the host of the 2020 Tokyo Olympic Games, translation needs are rapidly growing, making machine translation (MT) indispensable. In MT, the translation knowledge is acquired from parallel corpora (sentence-aligned bilingual texts). However, as parallel corpora between Japanese and most languages (e.g., Japanese-Indonesian) and domains (e.g., medical domain) are very scarce (only tens of thousands of parallel sentences or fewer), the translation quality is not satisfied. Improving MT quality in this low resource scenario is a challenging unsolved problem. The purpose of this research is improving MT quality in this low resource scenario using multiple resources, including parallel corpora of resource rich languages (such as French-English) and domains (such as the parliamentary domain), and large-scale monolingual web corpora. In FY2017, we established model adaptation technologies using resource rich language and domain parallel corpora. Specifically, we obtained the following achievements: 1. Single language/domain adaptation. We developed novel methods and conducted a comprehensive empirical comparison of previous studies. Our research achievements have been published at ACL 2017 (the top conference in natural language processing) and accepted to be published in the Journal of Information Processing in June. 2. Multiple language/domain adaptation. We also developed methods for domain adaptation using multilingual and multi-domain corpora, and presented our work at NLP 2018.
現在までの達成度 (区分)	現在までの達成度 (区分) 2: おおむね順調に進展している理由 This research is divided into three sub-topics: 1. Model adaptation using resource rich language and domain parallel corpora; 2. Data adaptation using large-scale monolingual web corpora; 3. Multiple resource adapted system integration. In FY2017, we established the model adaptation technology based on both resource rich language and domain parallel corpora as scheduled.
今後の研究の推進方策	We will study the remaining two topics: data adaptation using large-scale monolingual web corpora and multiple resource adapted system integration as scheduled. In our journal paper, which will be published in the Journal of Information Processing in June, we actually have conducted a comparison of previous studies in these two topics. In addition, we wrote a survey paper of domain adaptation for neural machine translation and submitted it to COLING 2018 (a top conference in natural language processing). We believe that these preliminary studies will make our research in FY2018 smooth.

研究成果
(8件)

すべて 2018 2017 その他

すべて雑誌論文 (2件) (うち査読あり 2件、オープンアクセス 2件) 学会発表 (5件) (うち国際学会 1件) 備考 (1件)

[雑誌論文] A Comprehensive Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation2018
- 著者名/発表者名
  Chenhui Chu, Raj Dabre and Sadao Kurohashi
- 雑誌名
  
  情報処理学会論文誌
  
  巻: 26 ページ: N/A
- 査読あり / オープンアクセス
[雑誌論文] Constrained Partial Parsing Based Dependency Tree Projection for Tree-to-Tree Machine Translation2017
- 著者名/発表者名
  Chenhui Chu, Yu Shen, Fabien Cromieres and Sadao Kurohashi
- 雑誌名
  
  自然言語処理
  
  巻: 24(2) ページ: 267-296
- DOI
  https://doi.org/10.11185/imt.12.172
- 査読あり / オープンアクセス
[学会発表] ニューラル機械翻訳における単語予測の重要性について2018
- 著者名/発表者名
  竹林佑斗, Chenhui Chu, 荒瀬由紀, 永田昌明
- 学会等名
  2018年度人工知能学会全国大会
[学会発表] Multilingual and Multi-Domain Adaptation for Neural Machine Translation2018
- 著者名/発表者名
  Chenhui Chu and Raj Dabre
- 学会等名
  言語処理学会第24回年次大会
[学会発表] Recursive Neural Networkを用いた事前並び替えによる英日機械翻訳2018
- 著者名/発表者名
  瓦祐希, Chenhui Chu, 荒瀬由紀
- 学会等名
  言語処理学会第24回年次大会
[学会発表] An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation2017
- 著者名/発表者名
  Chenhui Chu, Raj Dabre and Sadao Kurohashi
- 学会等名
  Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics
- 国際学会
[学会発表] An Empirical Comparison of Simple Domain Adaptation Methods for Neural Machine Translation2017
- 著者名/発表者名
  Chenhui Chu, Raj Dabre and Sadao Kurohashi
- 学会等名
  言語処理学会第23回年次大会
[備考] 研究者個人ホームページ
- URL
  https://researchmap.jp/chu/

2017 年度 実績報告書

マルチリソース適応によるローリソースニューラル機械翻訳の高度化

研究代表者

チョ シンキ 大阪大学, データビリティフロンティア機構, 特任助教(常勤) (70784891)

現在までの達成度 (区分)

理由

研究成果

[雑誌論文] A Comprehensive Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation2018

著者名/発表者名

雑誌名

[雑誌論文] Constrained Partial Parsing Based Dependency Tree Projection for Tree-to-Tree Machine Translation2017

著者名/発表者名

雑誌名

DOI

[学会発表] ニューラル機械翻訳における単語予測の重要性について2018

著者名/発表者名

学会等名

[学会発表] Multilingual and Multi-Domain Adaptation for Neural Machine Translation2018

著者名/発表者名

学会等名

[学会発表] Recursive Neural Networkを用いた事前並び替えによる英日機械翻訳2018

著者名/発表者名

学会等名

[学会発表] An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation2017

著者名/発表者名

学会等名

[学会発表] An Empirical Comparison of Simple Domain Adaptation Methods for Neural Machine Translation2017

著者名/発表者名

学会等名

[備考] 研究者個人ホームページ

URL

2017 年度実績報告書

チョシンキ大阪大学, データビリティフロンティア機構, 特任助教(常勤) (70784891)