• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2020 Fiscal Year Research-status Report

Neural Machine Translation Based on Bilingual Resources Extracted from Multimodal Data

Research Project

Project/Area Number 19K20343
Research InstitutionKyoto University

Principal Investigator

チョ シンキ  京都大学, 情報学研究科, 特定准教授 (70784891)

Project Period (FY) 2019-04-01 – 2022-03-31
Keywords機械翻訳 / マルチモーダル
Outline of Annual Research Achievements

In FY 2020, we mainly studied the following to improve promote multimodal neural machine translation (NNMT).
1. MNMT with comparable sentences. We propose a new multimodal English-Japanese corpus with comparable sentences that are compiled from existing image captioning datasets. In addition, we supplement our comparable sentences with a smaller parallel corpus for validation and test purposes. To test the performance of this comparable sentence translation scenario, we train several baseline NMT models with our comparable corpus and evaluate their English-Japanese translation performance.
2. MNMT with word-region alignment (WRA). We propose MNMT-WRA focus on semantically relevant image regions during translation. This study advances the semantic correlation between textual and visual modalities in MNMT by integrating WRA. Experimental results on the widely used Multi30k dataset show that our model significantly improves over competitive baselines.
3. Video guided MT (VMT). In this work, we propose our VMT system by using both temporal and spatial representations in a video to cope with both the motion ambiguity problem as well as the object ambiguity problem. To obtain spatial features efficiently, we propose to use a hierarchical attention network encoder to model the spatial information from object-level to video-level. Experiments on the VATEX dataset show improvement over a strong baseline method.

Current Status of Research Progress
Current Status of Research Progress

2: Research has progressed on the whole more than it was originally planned.

Reason

We planed to study the following items in FY 2020 and finished them as scheduled:
1. MT with parallel sentences and image/region representation fusion.
2. NMT with comparable sentences.

Strategy for Future Research Activity

1. Improve MNMT with parallel and comparable sentences. Although we have shown that our MNMT system with parallel sentences can improve MT with image regions, the improvement is not significant; for which we plan to design novel models to address. Our MNMT system with comparable sentences are still baseline level, for which we plan to design specific MNMT models for comparable sentences.
2. Improve VMT. The current VATEX validation and test sets contain many noisy sentence pairs. We plan to improve the quality of them via post-editing. After that, we will improve our current model towards better VMT.

  • Research Products

    (18 results)

All 2021 2020 Other

All Int'l Joint Research (3 results) Journal Article (5 results) (of which Int'l Joint Research: 3 results,  Open Access: 5 results) Presentation (9 results) (of which Int'l Joint Research: 2 results) Remarks (1 results)

  • [Int'l Joint Research] 上海交通大学(中国)

    • Country Name
      CHINA
    • Counterpart Institution
      上海交通大学
  • [Int'l Joint Research] Microsoft, Hyderabad(インド)

    • Country Name
      INDIA
    • Counterpart Institution
      Microsoft, Hyderabad
  • [Int'l Joint Research] University of Georgia(米国)

    • Country Name
      U.S.A.
    • Counterpart Institution
      University of Georgia
  • [Journal Article] Preordering Encoding on Transformer for Translation2021

    • Author(s)
      Kawara Yuki、Chu Chenhui、Arase Yuki
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: 29 Pages: 644~655

    • DOI

      10.1109/taslp.2020.3042001

    • Open Access
  • [Journal Article] A Survey of Multilingual Neural Machine Translation2020

    • Author(s)
      Dabre Raj、Chu Chenhui、Kunchukuttan Anoop
    • Journal Title

      ACM Computing Surveys

      Volume: 53 Pages: 1~38

    • DOI

      10.1145/3406095

    • Open Access / Int'l Joint Research
  • [Journal Article] A Survey of Domain Adaptation for Machine Translation2020

    • Author(s)
      Chu Chenhui、Wang Rui
    • Journal Title

      Journal of Information Processing

      Volume: 28 Pages: 413~426

    • DOI

      10.2197/ipsjjip.28.413

    • Open Access / Int'l Joint Research
  • [Journal Article] A Corpus for English-Japanese Multimodal Neural Machine Translation with Comparable Sentences2020

    • Author(s)
      Andrew Merritt, Chenhui Chu, Yuki Arase
    • Journal Title

      arXiv:2010.08725

      Volume: - Pages: -

    • Open Access / Int'l Joint Research
  • [Journal Article] Lexically Cohesive Neural Machine Translation with Copy Mechanism2020

    • Author(s)
      Vipul Mishra, Chenhui Chu, Yuki Arase
    • Journal Title

      arXiv:2010.05193

      Volume: - Pages: -

    • Open Access
  • [Presentation] Multilingual Neural Machine Translation (Tutorial)2020

    • Author(s)
      Raj Dabre , Chenhui Chu , Anoop Kunchukuttan
    • Organizer
      In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020)
    • Int'l Joint Research
  • [Presentation] Double Attention-based Multimodal Neural Machine Translation with Semantic Image Regions2020

    • Author(s)
      Yuting Zhao, Mamoru Komachi, Tomoyuki Kajiwara, Chenhui Chu
    • Organizer
      In Proceedings of the 22nd Annual Conference of the European Association for Machine Translation (EAMT 2020)
    • Int'l Joint Research
  • [Presentation] Video-guided Machine Translation with Spatial Hierarchical Attention Network Encoder2020

    • Author(s)
      Weiqi Gu, Haiyue Song, Chenhui Chu, Sadao Kurohashi
    • Organizer
      言語処理学会 第27回年次大会
  • [Presentation] Self-supervised Dynamic Programming Encoding for Neural Machine Translation2020

    • Author(s)
      Haiyue Song, Raj Dabre, Chenhui Chu, Sadao Kurohashi, Eiichiro Sumita
    • Organizer
      言語処理学会 第27回年次大会
  • [Presentation] Learning Cross-lingual Sentence Representations for Multilingual Document Classification with Token-level Reconstruction2020

    • Author(s)
      Zhuoyuan Mao, Prakhar Gupta, Chenhui Chu, Martin Jaggi, Sadao Kurohashi
    • Organizer
      言語処理学会 第27回年次大会
  • [Presentation] Non-Autoregressive Translationモデルにおける事前並び替え適用手法の検討2020

    • Author(s)
      瓦 祐希, Chenhui Chu, 荒瀬 由紀
    • Organizer
      言語処理学会 第27回年次大会
  • [Presentation] End-to-End Speech Translation with Cross-lingual Transfer Learning2020

    • Author(s)
      Shuichiro Shimizu, Chenhui Chu, Sheng Li, Sadao Kurohashi
    • Organizer
      言語処理学会 第27回年次大会
  • [Presentation] Neural Machine Translation with Semantic Relevant Image Regions2020

    • Author(s)
      Yuting Zhao, Mamoru Komachi, Tomoyuki Kajiwara, Chenhui Chu
    • Organizer
      言語処理学会 第27回年次大会
  • [Presentation] 日本語話し言葉書き言葉変換による大学講義の日英翻訳の精度向上2020

    • Author(s)
      中尾 亮太, Chenhui Chu, 黒橋 禎夫
    • Organizer
      言語処理学会 第27回年次大会
  • [Remarks]

    • URL

      https://researchmap.jp/chu/

URL: 

Published: 2021-12-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi