研究課題/領域番号 |
20K20135
|
研究機関 | 立命館大学 |
研究代表者 |
SONG Yuting 立命館大学, 情報理工学部, 助教 (50849388)
|
研究期間 (年度) |
2020-04-01 – 2023-03-31
|
キーワード | Word embeddings / MT evaluation / Metadata translation / Entity recognition / Relation extraction |
研究実績の概要 |
This year we focused on improving bilingual word embeddings models and collecting datasets of metadata records. First, we proposed a method to improve the accuracy of Japanese-English bilingual word embeddings. Second, we did preliminary attempts to evaluate machine translations on translating ukiyo-e metadata records from Japanese to English. In addition, in order to conduct further experiments, we collected English human translations of Japanese ukiyo-e metadata records by using a crowdsourcing platform. Moreover, the machine translations of ukiyo-e metadata records were evaluated by both Japanese and English native speakers through a crowdsourcing platform (Lancers). Overall, the project has been smoothly conducted step by step according to the research proposal.
|
現在までの達成度 (区分) |
現在までの達成度 (区分)
2: おおむね順調に進展している
理由
The project progress is going smoothly as planned. We have proposed a method to improve Japanese-English word embedding. Besides, we have evaluated the performance of online machine translation systems (i.e., Google Translator, Microsoft Translate, DeepL Translator) on translating Japanese ukiyo-e metadata to English. In addition, we have collected Japanese-English metadata records for future research. What's more, we have investigated the current neural network based models of entity and relation extraction, which can be applied to the dataset of ukiyo-e metadata in the next year.
|
今後の研究の推進方策 |
For future work, we will focus on developing neural network based methods for learning multilingual representations of metadata and extracting named entities from Japanese and English textual metadata in cultural collections. We will also manually annotated named entities in metadata records, which are essential for training and evaluating entity extraction models.
|
次年度使用額が生じた理由 |
We will use the budget to purchase hardware such as GPUs to be able to conduct research based on deep neural networks. Besides, some funds will be spent on crowdsourcing jobs for data annotations. Finally, we will attend the conferences to disseminate research results.
|