• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2020 Fiscal Year Research-status Report

Multilingual Knowledge Discovery in Digital Cultural Collections

Research Project

Project/Area Number 20K20135
Research InstitutionRitsumeikan University

Principal Investigator

SONG Yuting  立命館大学, 情報理工学部, 助教 (50849388)

Project Period (FY) 2020-04-01 – 2023-03-31
KeywordsWord embeddings / MT evaluation / Metadata translation / Entity recognition / Relation extraction
Outline of Annual Research Achievements

This year we focused on improving bilingual word embeddings models and collecting datasets of metadata records. First, we proposed a method to improve the accuracy of Japanese-English bilingual word embeddings. Second, we did preliminary attempts to evaluate machine translations on translating ukiyo-e metadata records from Japanese to English. In addition, in order to conduct further experiments, we collected English human translations of Japanese ukiyo-e metadata records by using a crowdsourcing platform. Moreover, the machine translations of ukiyo-e metadata records were evaluated by both Japanese and English native speakers through a crowdsourcing platform (Lancers). Overall, the project has been smoothly conducted step by step according to the research proposal.

Current Status of Research Progress
Current Status of Research Progress

2: Research has progressed on the whole more than it was originally planned.

Reason

The project progress is going smoothly as planned. We have proposed a method to improve Japanese-English word embedding. Besides, we have evaluated the performance of online machine translation systems (i.e., Google Translator, Microsoft Translate, DeepL Translator) on translating Japanese ukiyo-e metadata to English. In addition, we have collected Japanese-English metadata records for future research. What's more, we have investigated the current neural network based models of entity and relation extraction, which can be applied to the dataset of ukiyo-e metadata in the next year.

Strategy for Future Research Activity

For future work, we will focus on developing neural network based methods for learning multilingual representations of metadata and extracting named entities from Japanese and English textual metadata in cultural collections. We will also manually annotated named entities in metadata records, which are essential for training and evaluating entity extraction models.

Causes of Carryover

We will use the budget to purchase hardware such as GPUs to be able to conduct research based on deep neural networks. Besides, some funds will be spent on crowdsourcing jobs for data annotations. Finally, we will attend the conferences to disseminate research results.

  • Research Products

    (5 results)

All 2021 2020

All Journal Article (1 results) (of which Peer Reviewed: 1 results) Presentation (4 results) (of which Int'l Joint Research: 2 results)

  • [Journal Article] Learning Japanese-English Bilingual Word Embeddings by Using Language Specificity2020

    • Author(s)
      Song Yuting、Batjargal Biligsaikhan、Maeda Akira
    • Journal Title

      International Journal of Asian Language Processing

      Volume: 3 Pages: 14 pages

    • DOI

      10.1142/S2717554520500149

    • Peer Reviewed
  • [Presentation] Linking Ukiyo-e Records across Languages: An Application of Cross-Language Record Linkage Techniques to Digital Cultural Collections2021

    • Author(s)
      Yuting Song, Biligsaikhan Batjargal, and Akira Maeda
    • Organizer
      The 5th Anniversary International Symposium of Asia-Japan Research at Ritsumeikan University
  • [Presentation] Joint Entity and Relation Extraction from Clinical Records Using Pre-trained Language Model2021

    • Author(s)
      FANG Xintao, SONG Yuting, Maeda Akira
    • Organizer
      第13回データ工学と情報マネジメントに関するフォーラム(DEIM2021)
  • [Presentation] A Preliminary Attempt to Evaluate Machine Translations of Ukiyo-e Metadata Records2020

    • Author(s)
      Yuting Song, Biligsaikhan Batjargal, and Akira Maeda
    • Organizer
      The 22nd International Conference on Asia-Pacific Digital Libraries
    • Int'l Joint Research
  • [Presentation] Finding Identical Ukiyo-e Prints across Databases in Japanese, English and Dutch2020

    • Author(s)
      Yuting Song, Biligsaikhan Batjargal, and Akira Maeda
    • Organizer
      Digital Humanities 2020
    • Int'l Joint Research

URL: 

Published: 2021-12-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi