Project/Area Number |
23K13024
|
Research Category |
Grant-in-Aid for Early-Career Scientists
|
Allocation Type | Multi-year Fund |
Review Section |
Basic Section 12040:Applied mathematics and statistics-related
|
Research Institution | Institute of Physical and Chemical Research |
Principal Investigator |
Wolfer Geoffrey 国立研究開発法人理化学研究所, 革新知能統合研究センター, 基礎科学特別研究員 (10965784)
|
Project Period (FY) |
2023-04-01 – 2026-03-31
|
Project Status |
Granted (Fiscal Year 2023)
|
Budget Amount *help |
¥4,420,000 (Direct Cost: ¥3,400,000、Indirect Cost: ¥1,020,000)
Fiscal Year 2025: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2024: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2023: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
|
Keywords | Information geometry / Markov chains / Identity testing / Markov Chain Monte Carlo / Data science |
Outline of Research at the Start |
This project has three objectives: (1) to deepen our theoretical understanding of the information geometry of Markov models (2) to develop new geometrical tools and methods for the data-science community (3) to demonstrate their applicability by addressing modern inference problems in Markov chains.
|
Outline of Annual Research Achievements |
The primary focus of my research in FY23 was demonstrating how geometric techniques can be applied to modern problems in Markov chains within the field of data science.
In the theory we developed, allowable mappings between Markov chains all possess an operational (data-processing) interpretation, and as a result, the applicability of our theory follows by design. This year, we showed how our Markov embeddings aid in developing reduction techniques for modern Markov chain inference problems. Specifically, we devised a method to reduce identity testing of reversible Markov chains to that of symmetric ones. Our method recovers state-of-the-art sample complexity in most regimes and applies beyond the immediate problem, including tolerant testing settings.
In parallel work, we observed that one could recover many of the established “reversiblizations” of (non-reversible) Markov chains by regarding them as a geometric projection of the chain onto the reversible set. In particular, although it was previously understood that the Metropolis-Hastings algorithm is a form of reversiblization corresponding to a specific divergence, we demonstrated how exploring different notions of divergences can give rise to other widely used sampling algorithms. Notably, we recover the Markov Chain Monte Carlo (MCMC) algorithm based on Barker dynamics, which has recently gained popularity. This is important because it can help practioners develop new MCMC algorithms using a similar approach.
|
Current Status of Research Progress |
Current Status of Research Progress
1: Research has progressed more than it was originally planned.
Reason
The progress on the applicative side of the project is already considerable.
In terms of output, we successfully secured immediate acceptance of both our works at our top choices of venues: one oral presentation at the international conference on Geometric Science of Information (GSI'2023) and one publication in the IEEE Transactions on Information Theory journal, the reference in the field of information theory. These achievements serve as significant milestones, demonstrating the applicability of our framework to modern problems in probability and statistics.
Furthermore, our research has ignited interest within the statistical physics community, as we were invited to write an academic survey on the topic of information geometry of Markov chains.
|
Strategy for Future Research Activity |
Given the substantial progress in the more applicative aspects of the project in FY23, I will recenter the emphasis on the theoretical facets of my project in FY24.
Specifically, I will make progress on geometric classifications of families of transition matrices, at least for spaces of size up to 3. This will be a significant stepping stone towards a Cencov-type theorem in the context of Markov chains.
In parallel, I will continue to explore connection between information geometry of Markov chains and core questions in statistics such as mixing properties of Markov chains, Markov Chain Monte Carlo algorithms or hypothesis testing in order to further demonstrate the value of my approach.
|