• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2017 Fiscal Year Annual Research Report

Language productivity: fast extraction of productive analogical clusters and their evaluation using statistical machine translation

Research Project

Project/Area Number 15K00317
Research InstitutionWaseda University

Principal Investigator

LEPAGE YVES  早稲田大学, 理工学術院(情報生産システム研究科・センター), 教授 (70573608)

Project Period (FY) 2015-04-01 – 2018-03-31
Keywords自然言語処理 / 人工知能 / データ構造 / 形態で豊かな言語 / 中国語・日本語
Outline of Annual Research Achievements

Year 3 was dedicated to further experiments in generating new words in analogical grids, an additional promising data structure. The notion of saturation of a grid is important to better identify new words or phrases which can be created by analogy. Experiments on the quality of newly created word forms were conducted in a morphologically rich language. Experiments announced in the research plan to predict the quality of newly created word forms in several languages, English, Finnish, German and Indonesian, showed that Fisher's test can help to control the quality of newly generated word forms.
Tools and data produced during the research will be officially advertised during the LREC 2018 conference: fast tools for the computation of analogical clusters and grids and analogical clusters and grids for N-grams in 11 European languages (they were already available on the web site of the project since year 2). In addition, a large data set of 65 million formal analogies between word forms in 10 languages will be released.
Dissemination of results. The principal investigator will deliver an invited talk in a LREC workshop to present the results of the research: 1/ fast production of analogical clusters from monolingual data, 2/ use in production of quasi-parallel data, 3/ use of such data in statistical machine translation and 4/ synthesis on improvements in translation accuracy. The principal investigator also made two invited presentations, one in a meeting on machine translation in France and in an international conference on natural language processing in Morocco.

  • Research Products

    (11 results)

All 2018 2017 Other

All Presentation (10 results) (of which Invited: 3 results) Remarks (1 results)

  • [Presentation] Plausibility of word forms generated from analogical grids in Indonesian2018

    • Author(s)
      R. Fam, A. Purwarianti, and Y. Lepage
    • Organizer
      Proceedings of the 16th International Conference on Computer Applications (ICCA 2018), pages 179--184, Yangon, Myanmar, February 2018.
  • [Presentation] Validating analogically generated Indonesian words using Fisher’s exact test2018

    • Author(s)
      R. Fam and Y. Lepage
    • Organizer
      Proceedings of the 24th Annual Meeting of the Japanese Association for Natural Language Processing, pages 312--315, Okayama, Japan, March 2018.
  • [Presentation] Automatic Production of Quasi-parallel Corpora for Machine Translation2018

    • Author(s)
      Y. Lepage
    • Organizer
      International Conference on Natural Language, Signal and Speech Processing 2017, Casablanca, Morocco, 06--07 Dec. 2017
    • Invited
  • [Presentation] Quasi-Parallel Corpora: Hallucinating Translations for the Chinese-Japanese Language Pair2018

    • Author(s)
      Y. Lepage
    • Organizer
      BUCC workshop colocated with LREC 2018, Miyazaki, Japan, May 2018
    • Invited
  • [Presentation] Character-position arithmetic for analogy questions between word forms2017

    • Author(s)
      Y. Lepage
    • Organizer
      Proceedings of the Computational Analogy Workshop at the 24th International Conference on Case-Based Reasoning (ICCBR-17), pages 17--26, Trondheim, Norway, August 2017
  • [Presentation] A study of the saturation of analogical grids agnostically extracted from texts2017

    • Author(s)
      R. Fam and Y. Lepage
    • Organizer
      Proceedings of the Computational Analogy Workshop at the 24th International Conference on Case-Based Reasoning (ICCBR-17), pages 7--16, Trondheim, Norway, August 2017.
  • [Presentation] A holistic approach at a morphological inflection task2017

    • Author(s)
      R. Fam and Y. Lepage
    • Organizer
      Proceedings of the 8th Language & Technology Conference (LTC’17), pages 88--92, Poznan, November 2017. Fundacja uniwersytetu im. Adama Mickiewicza.
  • [Presentation] Confidence of word forms generated in analogical grids2017

    • Author(s)
      P. Liu and Y. Lepage
    • Organizer
      Proceedings of the 11th International collaboration Symposium on Information, Production and Systems (ISIPS 2017), pages 238--240, IPS, Waseda university, nov 2017.
  • [Presentation] Tools for the production of analogical grids and a resource of n-gram analogical grids in 11 languages2017

    • Author(s)
      R. Fam and Y. Lepage
    • Organizer
      Proceedings of the 11th Edition of the Language Resources and Evaluation Conference (LREC 2018), Miyazaki, Japan, May 2018. (accepted, to appear)
  • [Presentation] Analogical grids and clusters: assessment with machine translation [in French]2017

    • Author(s)
      Y. Lepage
    • Organizer
      40 ans de traduction automatique, Grenoble, France, July 2017
    • Invited
  • [Remarks] Grants-in-Aid Kakenhi Kiban C 15K00317

    • URL

      http://lepage-lab.ips.waseda.ac.jp/ > Projects > Kakenhi 15K00317

URL: 

Published: 2018-12-17  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi