研究課題/領域番号 |
23K16864
|
研究種目 |
若手研究
|
配分区分 | 基金 |
審査区分 |
小区分60050:ソフトウェア関連
|
研究機関 | 九州大学 |
研究代表者 |
王 棟 九州大学, システム情報科学研究院, 助教 (30965075)
|
研究期間 (年度) |
2023-04-01 – 2025-03-31
|
研究課題ステータス |
中途終了 (2023年度)
|
配分額 *注記 |
4,550千円 (直接経費: 3,500千円、間接経費: 1,050千円)
2025年度: 1,430千円 (直接経費: 1,100千円、間接経費: 330千円)
2024年度: 1,430千円 (直接経費: 1,100千円、間接経費: 330千円)
2023年度: 1,690千円 (直接経費: 1,300千円、間接経費: 390千円)
|
キーワード | Code Review / Information Need / Software Engineering / Information Needs / Repository Mining |
研究開始時の研究の概要 |
To meet the developers’ information needs and facilitate an effective code review, the applicant proposes a framework of an intelligent and non-intrusive notification mechanism to automatically recommend developers the needed information seamlessly that they should be aware of instantly and dynamically during the code review process.
|
研究実績の概要 |
In FY2023, I established the research environment and began mining information from social coding platforms such as GitHub and OpenStack. As part of this, I have been investigating developers' activities across various development channels, including code review channels, GitHub Discussion and GitHub Issue. We have now collected data from over 10 million GitHub repositories and are ready for the next stage. Here is a summary of achieved publications.
-Information need of continuous integration. I have worked with international collaborators on an empirical study to understand the software waste resulting from the misuse of recheck command on continuous integration failures. -Information spread across various channels. Specifically, I conducted a study investigating developer activities on GitHub Discussion. The results suggested that, in addition to issues, many code reviews were mentioned or converted in the GitHub Discussion. -Other developer activities. Meanwhile, I focus on the developer communication through issues (i.e., use of visuals to report bugs) and code comments (i.e., self-admitted technique debt)
|
現在までの達成度 (区分) |
現在までの達成度 (区分)
2: おおむね順調に進展している
理由
So far, I have collected a large amount of data from open-source software ecosystems. This has resulted in several publications in top international journals and conferences.
These publications better complements the gap lying in information needs by developers during the code review process (such as continuous integration information, cross-channel knowledge).
Large language models have demonstrated impressive performance in a variety of recommendation tasks. This could further prove the feasible of the automated information recommendation and accelerate research progress.
|
今後の研究の推進方策 |
The next step is to further mine the developers' information needs from other communication channels, particular issues, in order to establish a relationship between code review and issues. Inspired by the state-of-the-art Retrieval-Augmented Generation(RAG) technology, I plan to construct a high-quality knowledge graph that is specifically devised for the code review activities, based on the information/knowledge across various communications. The knowledge graph would be the premise for employing large language models to fulfill the automation of information recommendation for code reviews.
|