Project/Area Number |
19H04215
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Review Section |
Basic Section 62020:Web informatics and service informatics-related
|
Research Institution | University of Tsukuba |
Principal Investigator |
于 海涛 筑波大学, 図書館情報メディア系, 准教授 (30751052)
|
Co-Investigator(Kenkyū-buntansha) |
吉川 正俊 大阪成蹊大学, データサイエンス学部, 教授 (30182736)
康 シン 徳島大学, 大学院社会産業理工学研究部(理工学域), 助教 (80777350)
Adam Jatowt 京都大学, 情報学研究科, 特定准教授 (00415861)
|
Project Period (FY) |
2019-04-01 – 2024-03-31
|
Project Status |
Granted (Fiscal Year 2023)
|
Budget Amount *help |
¥15,860,000 (Direct Cost: ¥12,200,000、Indirect Cost: ¥3,660,000)
Fiscal Year 2023: ¥2,990,000 (Direct Cost: ¥2,300,000、Indirect Cost: ¥690,000)
Fiscal Year 2022: ¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000)
Fiscal Year 2021: ¥2,340,000 (Direct Cost: ¥1,800,000、Indirect Cost: ¥540,000)
Fiscal Year 2020: ¥3,640,000 (Direct Cost: ¥2,800,000、Indirect Cost: ¥840,000)
Fiscal Year 2019: ¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)
|
Keywords | privacy-preserving / federated learning / semantic matching / online learning-to-rank / interactive search / on-device learning / Metric Optimization / Policy Gradient / Differential Privacy / Privacy-preserving IR / Federated Optimization / On-device Learning / policy gradient / learning to rank / probabilistic regression / Semantic Matching / Learning-To-Rank / Metric Optimisation / Privacy-preserving / Semantic matching / Federated learning / On-device intelligence |
Outline of Research at the Start |
This project aims to initiate research into privacy-preserving information retrieval (IR) and develop a scalable privacy-preserving IR system. The proposed IR system builds upon a federated paradigm which fuses on-device machine learning, federated optimization and semantic matching.
|
Outline of Annual Research Achievements |
This year our main task is to develop the federated information retrieval model based on a large-scale dataset. To this end, we firstly view information retrieval as an interactive process between users and the search engine system rather than an independent ranking per query. Secondly, we explore how to effectively integrate online learning-to-rank and federated learning. Specifically, online learning-to-rank enables us to cope with the aforementioned interactive process. Federated learning enables us to deal with the privacy issue by learning the ranking model in an on-device manner. Based on a series of experiments on several benchmark ranking datasets, our experimental results show that this direction is applicable.
|
Current Status of Research Progress |
Current Status of Research Progress
3: Progress in research has been slightly delayed.
Reason
Due to the impact of COVID-19, the number of students is limited in order to guarantee a healthy working environment when using the research room. Sometimes the research work has to be conducted either online or at home. As a result, the efficiency is impacted to some extent.
|
Strategy for Future Research Activity |
The major research objective of the next fiscal year is to develop the federated information retrieval model. Since the large-scale language models have achieved a revolutionary effect on many fields, including but not limited to, natural language processing and information retrieval. For the future work, we will explore whether it is possible to make full use of the newly proposed large-scale language models to enhance our research. Finally, the COVID-19 pandemic has passed away. We plan to have more on-site discussions and attend more top international conferences in order to better conduct the planned research.
|