2007 Fiscal Year Final Research Report Summary
The Study on Development and Applicability of Knowledge-Based Learning Algorithm for Route Guidance
Project/Area Number |
18560519
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
交通工学・国土計画
|
Research Institution | Tohoku University (2007) Gifu University (2006) |
Principal Investigator |
MIYAGI Toshihiko Tohoku University, Graduate School of information Sciences, Professor (20092968)
|
Project Period (FY) |
2006 – 2007
|
Keywords | Wardrop Equilibrim / Hannan Consistency / Nash Equilibrium / Approchability Theorm / Reinforcement Learning / Adaptive Learning / Repeated Games / Markov Decision Process |
Research Abstract |
This study aims a fundamental study of the learning algorithm to develop a distributed navigation system. The study was conducted through both theoretical and numerical analysis, and obtained significant results about a characteristic of the traffic equilibrium to be realized when all drivers used this algorithm. The basic assumption of our approach is that each individual driver chooses his or her route based on travel information obtained by one's daily experience. Travel information, far instance a travel time to a destination from an origin of a trip, however, can fluctuate stochastically to be affected by the choice of other drivers. Even with such a situation, each driver can reach a certain stable state if he chooses a route according to the regret matching rule. This conclusion is fir different from the conventional concept of Wardrop equilibrium where drivers' perfect information is assumed. This conclusion implies that a carefully designed information acquisition system allows the transportation system being stable and drivers to acquaint the best route. The core engine that enables this maybe called the intelligent driving algorithm. The intelligent driving algorithm consists of a combination of Markov decision process and the regret matching algorithm. The efficiency of numerical calculation depends on the theory of approximate dynamic programming and stochastic approximation algorithm. We tested the algorithm by applying it to a simple network, but, we assumed possibly realist is link cost functions. For all cases, we obtained successful results; however, the rate of convergence was very slow. This study suggests the possibility of the distributed vehicle navigation system, in which an individual driver collects travel information by self with using GPS and is automatically guided by machine learning to a better route.
|