• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2019 Fiscal Year Research-status Report

Development of a System for Collecting Context Data for Large-Scale Inverse Reinforcement Learning

Research Project

Project/Area Number 17K00295
Research InstitutionHokkaido University

Principal Investigator

RZEPKA Rafal  北海道大学, 情報科学研究院, 助教 (80396316)

Project Period (FY) 2017-04-01 – 2021-03-31
Keywordsknowledge completion / natural language / language models / context processing
Outline of Annual Research Achievements

I proceeded with further development of a system to acquire contextual common sense knowledge. I have develop a module that automatically completes given sentence to add contextual information to predict its changes, and it is possible to input any situation. The task is performed according to not only to the standard semantic categories like actors or places but the system is also able to predict possible causes, effects, problems and costs.
I conducted experiments to compare automatic and human evaluations to measure the reliability of the system. Ethically ambiguous input sentences were used but the algorithm achieved rather low agreement with human evaluators. More natural sentences (I used self-generated ones to avoid bias) and more evaluators will be needed.
The context retrieval and processing were used in other tasks involving affective processing, motivation-oriented dialog, metaphor understanding, persuasiveness estimation. Trials with event completion were continued and event chain generation was extended to four related events which was not achieved by previous works.
A Slack-based chatbot for demonstrating context processing was developed, but still the retrieved knowledge base size is not big enough for open dialog. I tried utilizing BERT model language directly, but one sentence input needs several queries which takes too much time. I used help from researchers and students from Queensland University of Technology to acquire tools for more sophisticated language processing in English language, but the cooperation was halted by the Coronavirus outbreak.

Current Status of Research Progress
Current Status of Research Progress

2: Research has progressed on the whole more than it was originally planned.

Reason

The algorithms creation went rather smoothly but the context-related data for Japanese is still relatively small. I have automatically translated almost 100.000 short stories corpus from English to alleviate this problem after hopefully unsupervised post-editing process. Toolkit for English is still not fully created but a new multilingual NLP toolset Stanza was published and it should put processing other languages back on track. Also more language models are now available which will accelerate the comparison and sharing knowledge between languages.
As mentioned above, new insights about context help in various tasks, combined together with standard methods like deep-learning, led to improving existing systems and publishing several papers, therefore I assess the output as rather satisfactory.

Strategy for Future Research Activity

As the methods are tested, the last year of the project will concentrate on three topics. First will be further accumulation of knowledge, also in languages other than Japanese, second will be implementing the knowledge into dialog, third will be experimentation. More empirical prove showing that the knowledge acquired accurately reflects the real world thanks to contextual processing is needed. In addition, an evaluation experiment will be conducted with a scenario-type dialogue system to confirm the authenticity of contextualized output. Also a new type of evaluation experiment will be conducted in which the naturalness of the acquired knowledge can be confirmed while conversing and giving each subject slightly different information. Finally, I plan to release the generated contextual data and publish the results domestically and internationally.

Remarks

As the collaboration was prematurely ended due to the COVID-19 outbreak, there is not any concrete material yet.

  • Research Products

    (13 results)

All 2020 2019 Other

All Int'l Joint Research (1 results) Journal Article (1 results) Presentation (11 results) (of which Int'l Joint Research: 7 results,  Invited: 2 results)

  • [Int'l Joint Research] Queensland University of Technology(オーストラリア)

    • Country Name
      AUSTRALIA
    • Counterpart Institution
      Queensland University of Technology
  • [Journal Article] Stepwise Noise Elimination for Better Motivational and Advisory Texts Classification2020

    • Author(s)
      Patrycja Swieczkowska, Rafal Rzepka, and Kenji Araki
    • Journal Title

      J. Adv. Comput. Intell. Intell. Inform.

      Volume: 24 Pages: 156-168

    • DOI

      https://doi.org/10.20965/jaciii.2020.p0156

  • [Presentation] Bacteria Lingualis on BERToids - Concept Expansion for Cognitive Architectures2020

    • Author(s)
      Rafal Rzepka, Sho Takishita and Kenji Araki
    • Organizer
      Technical Report of JSAI Special Interest Group for Artificial General Intelligence, SIG-AGI
  • [Presentation] Convolutional Neural Network for Chinese Sentiment Analysis Considering Chinese Slang Lexicon and Emoticons2019

    • Author(s)
      Da Li, Rafal Rzepka, Michal Ptaszynski and Kenji Araki
    • Organizer
      Proceedings The 33th Annual Conference of the Japanese Society for Artificial Intelligence
  • [Presentation] Comparing Conceptual Metaphor Theory-Related Features in Searching for Figurative Expressions in Japanese Literary Texts2019

    • Author(s)
      Mateusz Babieno, Rafal Rzepka and Kenji Araki
    • Organizer
      IJCAI Workshop on Language Sense on Computer
    • Int'l Joint Research
  • [Presentation] A Convolutional Neural Network For Ranking Advice Quality In Texts For A Motivational Dialogue System2019

    • Author(s)
      Patrycja Swieczkowska, Rafal Rzepka and Kenji Araki
    • Organizer
      IJCAI Workshop on Linguistic and Cognitive Approaches To Dialog Agents Workshop
    • Int'l Joint Research
  • [Presentation] Debate Outcome Prediction using Automatic Persuasiveness Evaluation and Counterargument Relations2019

    • Author(s)
      Daiki Shirafuji, Rafal Rzepka and Kenji Araki
    • Organizer
      IJCAI Workshop on Linguistic and Cognitive Approaches To Dialog Agents Workshop
    • Int'l Joint Research
  • [Presentation] Emoji-Aware Attention-based Bi-directional GRU Network Model for Chinese Sentiment Analysis2019

    • Author(s)
      Da Li, Rafal Rzepka, Michal Ptaszynski and Kenji Araki
    • Organizer
      IJCAI Workshop on Linguistic and Cognitive Approaches To Dialog Agents Workshop
    • Int'l Joint Research
  • [Presentation] Implicit Knowledge Completion Using Relevance Calculation of Distributed Word Representations2019

    • Author(s)
      Sho Takishita, Rafal Rzepka and Kenji Araki
    • Organizer
      IJCAI Workshop on Bridging The Gap Between Human and Automated Reasoning
    • Int'l Joint Research
  • [Presentation] Evaluating Classification Methods for Recognizing Figurative Expressions Within Japanese Literary Texts2019

    • Author(s)
      Mateusz Babieno, Rafal Rzepka and Kenji Araki
    • Organizer
      Proceedings of the 16th International Conference of the Pacific Association for Computational Linguistics, PACLING 2019
    • Int'l Joint Research
  • [Presentation] Creating Reverse Dictionary of English Idiomatic Expressions by Mapping Word Embeddings to Singular Vectors2019

    • Author(s)
      Xiaodong Liu, Rafal Rzepka, Kenji Araki
    • Organizer
      Technical Report of JSAI Special Interest Group for Interactive Information Access and Visualization, SIG-AM
  • [Presentation] Humanizing Machines by Simulating Empathy and Understanding Human Needs"2019

    • Author(s)
      Rafal Rzepka
    • Organizer
      IJCAI Humanizing AI Workshop
    • Int'l Joint Research / Invited
  • [Presentation] AI Ethics for Developers, Users and Legislators - Problems and Solutions2019

    • Author(s)
      Rafal Rzepka
    • Organizer
      Nitobe School, Hokkaido University
    • Invited

URL: 

Published: 2021-01-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi