2012 Fiscal Year Final Research Report

Theory of Reinforcement Learning and Algorithms of Route Choice in Transportation Networks

Research Project

Project/Area Number	22360201
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Civil engineering project/Traffic engineering
Research Institution	Tohoku University
Principal Investigator	MIYAGI Toshihiko 東北大学, 大学院・情報科学研究科, 教授 (20092968)
Co-Investigator(Kenkyū-buntansha)	FUKUMOTO Jyunya 東北大学, 大学院・情報科学研究科, 准教授 (30323447)
Project Period (FY)	2010 – 2012
Keywords	繰り返しゲーム / 強化学習 / 交通行動理論 / 適応学習アルゴリズム / Nash 均衡 / 利用者均衡確率近似理論 / 動的離散的選択モデル
Research Abstract	This research shows that an individual traveler in transportation networks is rigorously modeled as an adaptive learning agent who receives travel information through day-to-day experience and makes his decision so as to reinforce his action depending the realized payoffs. An adaptive learning algorithm consistent with the theory is proposed and proved that it leads the system to a Nash equilibrium with probability one. The proposed algorithms have tested numerically by using example networks with various ill-defined link cost functions and examined a rapid convergence of the algorithms. In addition, we have proposed an estimation method for the structure parameters included in the route choice model. The application to the data of theday-to-day route choice obtained by the indoor experiments was satisfactory.

Research Products
(16 results)

All 2013 2012 2011

All Journal Article (7 results) Presentation (9 results)

[Journal Article] Adaptive Learning Algorithms for Traffic Games with Naive Users2013
- Author(s)
  Miyagi, T., G.C. Peque, Jr and J. Fukumoto
- Journal Title
  
  the 20th International Symposium on Traffic Theory and Transportation
[Journal Article] 経路選択行動に関する室内実験2013
- Author(s)
  池田愛,宮城俊彦
- Journal Title
  
  交通工学
  
  Volume: Vo1.48,No.2 Pages: 53-62
- URL
  http://www.jste.or.jp/Books/kikan-cont-4802.html
[Journal Article] Estimation of Structural Parameters of the Economic Growth Model with Subspace System Identification2012
- Author(s)
  Kato, H. and T. Miyagi
- Journal Title
  
  Journal of China-US Business Review
  
  Volume: Vol.11, N0.7
[Journal Article] Informed-user algorithms that converges to Nash equilibrium in traffic games2012
- Author(s)
  Miyagi, T., and G.C. Peque,Jr.
- Journal Title
  
  Procedia-Social and Behavioral Sciences
  
  Volume: 54 Pages: 438-449
[Journal Article] DSGEモデルによる公共投資の効果分析とモデルの時変パラメータ推定2012
- Author(s)
  加藤裕人,宮城俊彦
- Journal Title
  
  土木学会論文集D3
  
  Volume: Vol.68,N0.5 Pages: I_121-I_130
- URL
  htts://www.jsce.or.jp/committee/lp/monograph/file/ipv29-2012.pdf
[Journal Article] 適応的経路選択モデルにおける経路集合の限定手法と経路分散パラメータの推定法2011
- Author(s)
  宮城俊彦・遠藤雅人
- Journal Title
  
  土木学会論文集D3
  
  Volume: Vol.67,No.5 Pages: I_541-I_552
- URL
  htts://www.jsce.or.jp/committee/lp/monograph/file/ipv28-2011.pdf
[Journal Article] An adaptive learning algorithm for a route choice problem in uncertain traffic environments2011
- Author(s)
  Miyagi,T.
- Journal Title
  
  Urban Transport
  
  Volume: Vol.17 Pages: 43-52
[Presentation] Adaptive Learning Algorithms for Traffic Games with Naive Users2013
- Author(s)
  Miyagi, T., G.C. Peque, Jr and J. Fukumoto
- Organizer
  the 20th International Symposium on Traffic Theory and Transportation
- Place of Presentation
  Noordwijk, the Netherlands
- Year and Date
  2013-07-17
[Presentation] 経路選択行動モデルの構造パラメータ推定について2012
- Author(s)
  池田愛・宮城俊彦
- Organizer
  土木計画学研究・講演集
- Place of Presentation
  埼玉大学
- Year and Date
  2012-11-03
[Presentation] 混雑ゲームにおけるNon-atomicモデルの数値計算特性について2012
- Author(s)
  張洋・G.C.Peque,Jr.・宮城俊彦
- Organizer
  土木計画学研究・講演集土木学会
- Place of Presentation
  埼玉大学
- Year and Date
  2012-11-03
[Presentation] 多数エージェントの動学的離散的選択行動モデルに対する定常均衡アプローチについて2012
- Author(s)
  宮城俊彦
- Organizer
  日本地域学会学術発表論文集49,日本地域学会
- Place of Presentation
  立正大学.
- Year and Date
  2012-10-07
[Presentation] Informed-user algorithms that converges to Nash equilibrium in traffic games2012
- Author(s)
  Miyagi, T., and G.C. Peque, Jr.
- Organizer
  the 15th meeting of the Euro Working Group onTransportation
- Place of Presentation
  CiteDescartes, France.
- Year and Date
  2012-09-11
[Presentation] Traffic Games with Incomplete Travel Information2012
- Author(s)
  Miyagi, T., and G.C. Peque,Jr.
- Organizer
  土木計画学研究講演集、土木学会
- Place of Presentation
  京都大学
- Year and Date
  2012-06-02
[Presentation] 動的経路選択行動の室内実験による検証と分析2011
- Author(s)
  池田愛・宮城俊彦
- Organizer
  土木計画学研究・講演集土木学会
- Place of Presentation
  岐阜大学
- Year and Date
  2011-11-27
[Presentation] 動学的確率的一般均衡モデルの時変パラメータ推定による財政政策の計効果分析2011
- Author(s)
  加藤裕人・宮城俊彦
- Organizer
  土木計画学研究・講演集,土木学会
- Place of Presentation
  岐阜大学
- Year and Date
  2011-11-27
[Presentation] Estimation of Structural Parameters of the Economic Growth Model with Subspace System Identification2011
- Author(s)
  Kato, H. and Miyagi,T.
- Organizer
  the 22nd Pacific Regional Science Conference, RSAI
- Place of Presentation
  Seoul KyoYuk MunHwa HoeKwan, Seoul, Korea
- Year and Date
  2011-07-05

2012 Fiscal Year Final Research Report

Theory of Reinforcement Learning and Algorithms of Route Choice in Transportation Networks

Principal Investigator

MIYAGI Toshihiko 東北大学, 大学院・情報科学研究科, 教授 (20092968)

Research Products

[Journal Article] Adaptive Learning Algorithms for Traffic Games with Naive Users2013

Author(s)

Journal Title

[Journal Article] 経路選択行動に関する室内実験2013

Author(s)

Journal Title

URL

[Journal Article] Estimation of Structural Parameters of the Economic Growth Model with Subspace System Identification2012

Author(s)

Journal Title

[Journal Article] Informed-user algorithms that converges to Nash equilibrium in traffic games2012

Author(s)

Journal Title

[Journal Article] DSGEモデルによる公共投資の効果分析とモデルの時変パラメータ推定2012

Author(s)

Journal Title

URL

[Journal Article] 適応的経路選択モデルにおける経路集合の限定手法と経路分散パラメータの推定法2011

Author(s)

Journal Title

URL

[Journal Article] An adaptive learning algorithm for a route choice problem in uncertain traffic environments2011

Author(s)

Journal Title

[Presentation] Adaptive Learning Algorithms for Traffic Games with Naive Users2013

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 経路選択行動モデルの構造パラメータ推定について2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 混雑ゲームにおけるNon-atomicモデルの数値計算特性について2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 多数エージェントの動学的離散的選択行動モデルに対する定常均衡アプローチについて2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Informed-user algorithms that converges to Nash equilibrium in traffic games2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Traffic Games with Incomplete Travel Information2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 動的経路選択行動の室内実験による検証と分析2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 動学的確率的一般均衡モデルの時変パラメータ推定による財政政策の計効果分析2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Estimation of Structural Parameters of the Economic Growth Model with Subspace System Identification2011

Author(s)

Organizer

Place of Presentation

Year and Date