研究課題/領域番号 |
19K20345
|
研究機関 | 徳島大学 |
研究代表者 |
康 シン 徳島大学, 大学院社会産業理工学研究部(理工学域), 助教 (80777350)
|
研究期間 (年度) |
2019-04-01 – 2022-03-31
|
キーワード | long-vision conversation |
研究実績の概要 |
The purpose of this research plan is to learn the long-vision conversation agents from scratch by allowing them to make sequential decisions in a multi-round communication. For these agents, the language ability, the self-concept identification ability, and the strategic conversation ability would be developed through three strategic communication games respectively. In the second year, we collected the data of conversation for learning the self-concept identification ability for the conversation agents. The data was composed of two parts, which were the conversation turns collected from Twitter and the tags that should represent the meaning of the conversation. We employed a latent dirichlet allocation (LDA) model and a rule based method for the tag extraction. These tags were treated as the self-identities for the agents to learn. The agents learned two neural networks which are a generator and a discriminator. The generator network was responsible to generate sequences of word tokens which represents its self-identity, that is, the tag given by the LDA model or the rule-based method, and the discriminator network was responsible to discriminate the identities of the other agents by analyzing the word tokens generated by those agents.
|
現在までの達成度 (区分) |
現在までの達成度 (区分)
3: やや遅れている
理由
The experiment is based on an inter-conversation between agents, which requires them to generate the reasonable and meaningful responses for each conversation turn. Because generation of the agent responses were purely based on a generative model with a generative adversarial learning framework, controlling the quality of responses was extremely difficult. The quality of agents' inter-conversations has shown significant defect compared to the quality of human conversations.
|
今後の研究の推進方策 |
In the future work, the quality of inter-conversation should be improved, while the strategic conversation ability of conversation agents should be learned. To improve the conversation quality, we would consider replacing the generator with GPT-based models, the retrieval-based models, and other slot-based approaches. To learn the strategic conversation ability of conversation agents, we will design a long-vision reward for the agents through a reinforcement learning framework. Three agents will be given two types of social identities, and they must learn to identify the similarity of each other and to avoid to be recognized as the distinct one, by following the conversation rules. Agents get rewarded separately based on their decisions made through out the reinforcement learning framework.
|
次年度使用額が生じた理由 |
理由:文房具が3月に納品となり、支払いが完了していないため。 使用計画:文房具の支払が4月に完了する予定である。
|