言語生産性：有効な類推関係クラスターの迅速な抽出・統計的機械翻訳でその評価

研究課題

研究課題/領域番号	15K00317
研究種目	基盤研究(C)
配分区分	基金
応募区分	一般
研究分野	知能情報学
研究機関	早稲田大学
研究代表者	LEPAGE YVES 早稲田大学, 理工学術院(情報生産システム研究科・センター), 教授 (70573608)
研究協力者	楊巍ファムラシェルスサンティゴジャリ
研究期間 (年度)	2015-04-01 – 2018-03-31
研究課題ステータス	完了 (2017年度)
配分額 *注記	4,550千円 (直接経費: 3,500千円、間接経費: 1,050千円) 2017年度: 1,300千円 (直接経費: 1,000千円、間接経費: 300千円) 2016年度: 1,690千円 (直接経費: 1,300千円、間接経費: 390千円) 2015年度: 1,560千円 (直接経費: 1,200千円、間接経費: 360千円)
キーワード	自然言語処理 / 人工知能 / データ構造 / 形態で豊かな言語 / 中国語・日本語
研究成果の概要	本研究の目的は、１。単言語データから類推関係クラスターを構築し、２。そのクラスターから擬似パラレルコーパスを生成し、３。パラレルコーパスに追加することにより４。統計的機械翻訳（SMT）の精度を向上させる。そのため、様々なツールを実装し公開した。新しいデータ構造も導入した：類推関係グリッド。形態的に貧しい言語を始め形態豊かな言語を渡って様々な言語でデータを構築した：欧州連合の１１ヶ国語、中国語、日本語、また追加言語（アラビア語、グルジア語、ナバホ語、ロシア語、トルコ語）。データの一部分は公開した。行った実験で擬似パラレルコーパスの追加により日中SMTの翻訳精度を向上することを明らかにした。

報告書

(4件)

研究成果
(19件)

すべて 2018 2017 2016 その他

すべて雑誌論文 (2件) (うち査読あり 2件、オープンアクセス 2件、謝辞記載あり 2件) 学会発表 (15件) (うち招待講演 3件) 備考 (2件)

[雑誌論文] Inflating a Small Parallel Corpus into a Large Quasi-parallel Corpus Using Monolingual Data for Chinese-Japanese Machine Translation2017
- 著者名/発表者名
  W. Yang, H. Shen, and Y. Lepage
- 雑誌名
  
  Journal of Information Processing
  
  巻: 25 号: 0 ページ: 88-99
- DOI
  10.2197/ipsjjip.25.88
- NAID
  130005292406
- ISSN
  1882-6652
- 関連する報告書
  2016 実施状況報告書
- 査読あり / オープンアクセス / 謝辞記載あり
[雑誌論文] A method of generating translations of unseen n-grams by using proportional analogy2016
- 著者名/発表者名
  J. Luo and Y. Lepage
- 雑誌名
  
  IEEJ Transactions in Electronics, Information and Systems
  
  巻: 11(3) 号: 3 ページ: 325-330
- DOI
  10.1002/tee.22221
- 関連する報告書
  2016 実施状況報告書
- 査読あり / オープンアクセス / 謝辞記載あり
[学会発表] Plausibility of word forms generated from analogical grids in Indonesian2018
- 著者名/発表者名
  R. Fam, A. Purwarianti, and Y. Lepage
- 学会等名
  Proceedings of the 16th International Conference on Computer Applications (ICCA 2018), pages 179--184, Yangon, Myanmar, February 2018.
- 関連する報告書
  2017 実績報告書
[学会発表] Validating analogically generated Indonesian words using Fisher’s exact test2018
- 著者名/発表者名
  R. Fam and Y. Lepage
- 学会等名
  Proceedings of the 24th Annual Meeting of the Japanese Association for Natural Language Processing, pages 312--315, Okayama, Japan, March 2018.
- 関連する報告書
  2017 実績報告書
[学会発表] Automatic Production of Quasi-parallel Corpora for Machine Translation2018
- 著者名/発表者名
  Y. Lepage
- 学会等名
  International Conference on Natural Language, Signal and Speech Processing 2017, Casablanca, Morocco, 06--07 Dec. 2017
- 関連する報告書
  2017 実績報告書
- 招待講演
[学会発表] Quasi-Parallel Corpora: Hallucinating Translations for the Chinese-Japanese Language Pair2018
- 著者名/発表者名
  Y. Lepage
- 学会等名
  BUCC workshop colocated with LREC 2018, Miyazaki, Japan, May 2018
- 関連する報告書
  2017 実績報告書
- 招待講演
[学会発表] Indonesian unseen words explained by form, morphology and distributional semantics at the same time.2017
- 著者名/発表者名
  R. Fam and Y. Lepage
- 学会等名
  言語処理学会第23回年次大会(NLP2017)論文集, pages 178--181.
- 発表場所
  筑波大学
- 年月日
  2017-03-14
- 関連する報告書
  2016 実施状況報告書
[学会発表] A study in explaining unseen words in Indonesian using analogical clusters2017
- 著者名/発表者名
  R. Fam and Y. Lepage
- 学会等名
  In Proceedings of 15th International Conference on Computer Applications (ICCA 2017), pages 416--421.
- 発表場所
  Yangon, Myanmar
- 年月日
  2017-02-16
- 関連する報告書
  2016 実施状況報告書
[学会発表] Character-position arithmetic for analogy questions between word forms2017
- 著者名/発表者名
  Y. Lepage
- 学会等名
  Proceedings of the Computational Analogy Workshop at the 24th International Conference on Case-Based Reasoning (ICCBR-17), pages 17--26, Trondheim, Norway, August 2017
- 関連する報告書
  2017 実績報告書
[学会発表] A study of the saturation of analogical grids agnostically extracted from texts2017
- 著者名/発表者名
  R. Fam and Y. Lepage
- 学会等名
  Proceedings of the Computational Analogy Workshop at the 24th International Conference on Case-Based Reasoning (ICCBR-17), pages 7--16, Trondheim, Norway, August 2017.
- 関連する報告書
  2017 実績報告書
[学会発表] A holistic approach at a morphological inflection task2017
- 著者名/発表者名
  R. Fam and Y. Lepage
- 学会等名
  Proceedings of the 8th Language & Technology Conference (LTC’17), pages 88--92, Poznan, November 2017. Fundacja uniwersytetu im. Adama Mickiewicza.
- 関連する報告書
  2017 実績報告書
[学会発表] Confidence of word forms generated in analogical grids2017
- 著者名/発表者名
  P. Liu and Y. Lepage
- 学会等名
  Proceedings of the 11th International collaboration Symposium on Information, Production and Systems (ISIPS 2017), pages 238--240, IPS, Waseda university, nov 2017.
- 関連する報告書
  2017 実績報告書
[学会発表] Tools for the production of analogical grids and a resource of n-gram analogical grids in 11 languages2017
- 著者名/発表者名
  R. Fam and Y. Lepage
- 学会等名
  Proceedings of the 11th Edition of the Language Resources and Evaluation Conference (LREC 2018), Miyazaki, Japan, May 2018. (accepted, to appear)
- 関連する報告書
  2017 実績報告書
[学会発表] Analogical grids and clusters: assessment with machine translation [in French]2017
- 著者名/発表者名
  Y. Lepage
- 学会等名
  40 ans de traduction automatique, Grenoble, France, July 2017
- 関連する報告書
  2017 実績報告書
- 招待講演
[学会発表] Production of analogical clusters between marker-based chunks in Chinese and Japanese2016
- 著者名/発表者名
  W. Yang, M. Gao, and Y. Lepage
- 学会等名
  In Proceedings of the 10th International collaboration Symposium on Information, Production and Systems (ISIPS 2016), pages 238--241.
- 発表場所
  北九州
- 年月日
  2016-11-09
- 関連する報告書
  2016 実施状況報告書
[学会発表] Morphological predictability of unseen words using computational analogy2016
- 著者名/発表者名
  R. Fam and Y. Lepage
- 学会等名
  Proceedings of the Computational Analogy Workshop at the 24th International Conference on Case-Based Reasoning (ICCBR-16), pages 51--60.
- 発表場所
  Atlanta, Georgia, USA.
- 関連する報告書
  2016 実施状況報告書
[学会発表] Solving analogical equations between strings of symbols using neural networks2016
- 著者名/発表者名
  V. Kaveeta and Y. Lepage
- 学会等名
  In Proceedings of the Computational Analogy Workshop at the 24th International Conference on Case- Based Reasoning (ICCBR-16), pages 67--76.
- 発表場所
  Atlanta, Georgia, USA.
- 関連する報告書
  2016 実施状況報告書
[備考] Grants-in-Aid Kakenhi Kiban C 15K00317
- 関連する報告書
  2017 実績報告書
[備考] Projects / Kakenhi 15K00317 / Experimental results
- URL
  http://lepage-lab.ips.waseda.ac.jp/index.php/2016-08-01-06-37-56/kakenhi-2/kakenhi-2-experiment-result
- 関連する報告書
  2016 実施状況報告書

言語生産性：有効な類推関係クラスターの迅速な抽出・統計的機械翻訳でその評価

研究代表者

LEPAGE YVES 早稲田大学, 理工学術院(情報生産システム研究科・センター), 教授 (70573608)

4,550千円 (直接経費: 3,500千円、間接経費: 1,050千円)

報告書

研究成果

[雑誌論文] Inflating a Small Parallel Corpus into a Large Quasi-parallel Corpus Using Monolingual Data for Chinese-Japanese Machine Translation2017

著者名/発表者名

雑誌名

DOI

NAID

ISSN

関連する報告書

[雑誌論文] A method of generating translations of unseen n-grams by using proportional analogy2016

著者名/発表者名

雑誌名

DOI

関連する報告書

[学会発表] Plausibility of word forms generated from analogical grids in Indonesian2018

著者名/発表者名

学会等名

関連する報告書

[学会発表] Validating analogically generated Indonesian words using Fisher’s exact test2018

著者名/発表者名

学会等名

関連する報告書

[学会発表] Automatic Production of Quasi-parallel Corpora for Machine Translation2018

著者名/発表者名

学会等名

関連する報告書

[学会発表] Quasi-Parallel Corpora: Hallucinating Translations for the Chinese-Japanese Language Pair2018

著者名/発表者名

学会等名

関連する報告書

[学会発表] Indonesian unseen words explained by form, morphology and distributional semantics at the same time.2017

著者名/発表者名

学会等名

発表場所

年月日

関連する報告書

[学会発表] A study in explaining unseen words in Indonesian using analogical clusters2017

著者名/発表者名

学会等名

発表場所

年月日

関連する報告書

[学会発表] Character-position arithmetic for analogy questions between word forms2017

著者名/発表者名

学会等名

関連する報告書

[学会発表] A study of the saturation of analogical grids agnostically extracted from texts2017

著者名/発表者名

学会等名

関連する報告書

[学会発表] A holistic approach at a morphological inflection task2017

著者名/発表者名

学会等名

関連する報告書

[学会発表] Confidence of word forms generated in analogical grids2017

著者名/発表者名

学会等名

関連する報告書

[学会発表] Tools for the production of analogical grids and a resource of n-gram analogical grids in 11 languages2017

著者名/発表者名

学会等名

関連する報告書

[学会発表] Analogical grids and clusters: assessment with machine translation [in French]2017

著者名/発表者名

学会等名

関連する報告書

[学会発表] Production of analogical clusters between marker-based chunks in Chinese and Japanese2016

著者名/発表者名

学会等名

発表場所

年月日

関連する報告書

[学会発表] Morphological predictability of unseen words using computational analogy2016

著者名/発表者名

学会等名

発表場所