Theory of Reinforcement Learning and Algorithms of Route Choice in Transportation Networks

Research Project

Project/Area Number	22360201
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Civil engineering project/Traffic engineering
Research Institution	Tohoku University
Principal Investigator	MIYAGI Toshihiko 東北大学, 大学院・情報科学研究科, 教授 (20092968)
Co-Investigator(Kenkyū-buntansha)	FUKUMOTO Jyunya 東北大学, 大学院・情報科学研究科, 准教授 (30323447)
Project Period (FY)	2010 – 2012
Project Status	Completed (Fiscal Year 2012)
Budget Amount *help	¥8,190,000 (Direct Cost: ¥6,300,000、Indirect Cost: ¥1,890,000) Fiscal Year 2012: ¥2,340,000 (Direct Cost: ¥1,800,000、Indirect Cost: ¥540,000) Fiscal Year 2011: ¥2,210,000 (Direct Cost: ¥1,700,000、Indirect Cost: ¥510,000) Fiscal Year 2010: ¥3,640,000 (Direct Cost: ¥2,800,000、Indirect Cost: ¥840,000)
Keywords	繰り返しゲーム / 強化学習 / 交通行動理論 / 適応学習アルゴリズム / Nash 均衡 / 利用者均衡確率近似理論 / 動的離散的選択モデル / Nash均衡 / 利用者均衡 / 確率近似理論 / ゲーム理論 / 強化学習理論 / 実験経済学 / 経路選択行動 / リグレット基準 / ネットワーク均衡 / 離散的交通行動理論 / ロジット均衡
Research Abstract	This research shows that an individual traveler in transportation networks is rigorously modeled as an adaptive learning agent who receives travel information through day-to-day experience and makes his decision so as to reinforce his action depending the realized payoffs. An adaptive learning algorithm consistent with the theory is proposed and proved that it leads the system to a Nash equilibrium with probability one. The proposed algorithms have tested numerically by using example networks with various ill-defined link cost functions and examined a rapid convergence of the algorithms. In addition, we have proposed an estimation method for the structure parameters included in the route choice model. The application to the data of theday-to-day route choice obtained by the indoor experiments was satisfactory.

Report

(4 results)

2012 Annual Research Report Final Research Report ( PDF )
2011 Annual Research Report
2010 Annual Research Report

Research Products
(37 results)

All 2013 2012 2011 2010 Other

All Journal Article (15 results) (of which Peer Reviewed: 8 results) Presentation (22 results)

[Journal Article] Adaptive Learning Algorithms for Traffic Games with Naive Users2013
- Author(s)
  Miyagi, T., G.C. Peque, Jr and J. Fukumoto
- Journal Title
  
  the 20th International Symposium on Traffic Theory and Transportation
- Related Report
  2012 Final Research Report
[Journal Article] 経路選択行動に関する室内実験2013
- Author(s)
  池田愛,宮城俊彦
- Journal Title
  
  交通工学
  
  Volume: Vo1.48,No.2 Pages: 53-62
- URL
  http://www.jste.or.jp/Books/kikan-cont-4802.html
- Related Report
  2012 Final Research Report
[Journal Article] 経路選択行動に関する室内実験2013
- Author(s)
  池田　愛
- Journal Title
  
  交通工学
  
  Volume: 48巻 Pages: 53-62
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] Adaptive Learning Algorithms for Traffic Games with Naive Users2013
- Author(s)
  Miyagi, Toshihiko
- Journal Title
  
  Proc. of the 20th ISTTT
  
  Volume: 20
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] Estimation of Structural Parameters of the Economic Growth Model with Subspace System Identification2012
- Author(s)
  Kato, H. and T. Miyagi
- Journal Title
  
  Journal of China-US Business Review
  
  Volume: Vol.11, N0.7
- Related Report
  2012 Final Research Report
[Journal Article] Informed-user algorithms that converges to Nash equilibrium in traffic games2012
- Author(s)
  Miyagi, T., and G.C. Peque,Jr.
- Journal Title
  
  Procedia-Social and Behavioral Sciences
  
  Volume: 54 Pages: 438-449
- Related Report
  2012 Final Research Report
[Journal Article] DSGEモデルによる公共投資の効果分析とモデルの時変パラメータ推定2012
- Author(s)
  加藤裕人,宮城俊彦
- Journal Title
  
  土木学会論文集D3
  
  Volume: Vol.68,N0.5
- NAID
  130004559667
- Related Report
  2012 Final Research Report
[Journal Article] Informed-user algorithms that converges to Nash equilibrium2012
- Author(s)
  Miyagi, Toshihiko
- Journal Title
  
  Procedia-Social and Behavioral Sciences
  
  Volume: Vol.54 Pages: 438-449
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] 適応的経路選択モデルにおける経路集合の限定手法と経路分散パラメータの推定法2011
- Author(s)
  宮城俊彦・遠藤雅人
- Journal Title
  
  土木学会論文集D3
  
  Volume: Vol.67,No.5
- Related Report
  2012 Final Research Report
[Journal Article] An adaptive learning algorithm for a route choice problem in uncertain traffic environments2011
- Author(s)
  Miyagi,T.
- Journal Title
  
  Urban Transport
  
  Volume: Vol.17 Pages: 43-52
- Related Report
  2012 Final Research Report
[Journal Article] An adaptive learning algorithm for a route choice problem in uncertain traffic environments2011
- Author(s)
  Miyagi, T.
- Journal Title
  
  Urban Transport XVII
  
  Volume: 17巻 Pages: 43-52
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] 適応的経路選択モデルにおける経路集合の限定手法と経路分散パラメータの推定法2011
- Author(s)
  宮城俊彦, 遠藤雅人
- Journal Title
  
  土木学会論文集D3
  
  Volume: Vol.67, No.5 Pages: 1541-1552
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] テレワーク通勤への転換行動分析のためのリグレット・マッチング・モデル2010
- Author(s)
  宮城俊彦・石黒雅彦
- Journal Title
  
  地域学研究
  
  Volume: 39 Pages: 911-926
- NAID
  130000263139
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Regret Matchingを用いた経路選択行動分析:不完全交通情報を仮定した粒子モデル2010
- Author(s)
  宮城俊彦・石黒雅彦
- Journal Title
  
  土木計画学研究・論文集
  
  Volume: 27 Pages: 531-538
- NAID
  130006275141
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] 社会資本整備を内包した経済成長モデルのパラメータ推定2010
- Author(s)
  加藤裕人・宮城俊彦・仲原由布子
- Journal Title
  
  土木計画学研究・論文集
  
  Volume: 27 Pages: 41-48
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Presentation] Adaptive Learning Algorithms for Traffic Games with Naive Users2013
- Author(s)
  Miyagi, T., G.C. Peque, Jr and J. Fukumoto
- Organizer
  the 20th International Symposium on Traffic Theory and Transportation
- Place of Presentation
  Noordwijk, the Netherlands
- Year and Date
  2013-07-17
- Related Report
  2012 Final Research Report
[Presentation] 経路選択行動モデルの構造パラメータ推定について2012
- Author(s)
  池田愛・宮城俊彦
- Organizer
  土木計画学研究・講演集
- Place of Presentation
  埼玉大学
- Year and Date
  2012-11-03
- Related Report
  2012 Final Research Report
[Presentation] 混雑ゲームにおけるNon-atomicモデルの数値計算特性について2012
- Author(s)
  張洋・G.C.Peque,Jr.・宮城俊彦
- Organizer
  土木計画学研究・講演集土木学会
- Place of Presentation
  埼玉大学
- Year and Date
  2012-11-03
- Related Report
  2012 Final Research Report
[Presentation] 多数エージェントの動学的離散的選択行動モデルに対する定常均衡アプローチについて2012
- Author(s)
  宮城俊彦
- Organizer
  日本地域学会学術発表論文集49,日本地域学会
- Place of Presentation
  立正大学.
- Year and Date
  2012-10-07
- Related Report
  2012 Final Research Report
[Presentation] Informed-user algorithms that converges to Nash equilibrium in traffic games2012
- Author(s)
  Miyagi, T., and G.C. Peque, Jr.
- Organizer
  the 15th meeting of the Euro Working Group onTransportation
- Place of Presentation
  CiteDescartes, France.
- Year and Date
  2012-09-11
- Related Report
  2012 Final Research Report
[Presentation] Traffic Games with Incomplete Travel Information2012
- Author(s)
  Miyagi, T., and G.C. Peque,Jr.
- Organizer
  土木計画学研究講演集、土木学会
- Place of Presentation
  京都大学
- Year and Date
  2012-06-02
- Related Report
  2012 Final Research Report
[Presentation] 動的経路選択行動の室内実験による検証と分析2011
- Author(s)
  池田愛・宮城俊彦
- Organizer
  土木計画学研究・講演集土木学会
- Place of Presentation
  岐阜大学
- Year and Date
  2011-11-27
- Related Report
  2012 Final Research Report
[Presentation] 動学的確率的一般均衡モデルの時変パラメータ推定による財政政策の計効果分析2011
- Author(s)
  加藤裕人・宮城俊彦
- Organizer
  土木計画学研究・講演集,土木学会
- Place of Presentation
  岐阜大学
- Year and Date
  2011-11-27
- Related Report
  2012 Final Research Report
[Presentation] Estimation of Structural Parameters of the Economic Growth Model with Subspace System Identification2011
- Author(s)
  Kato, H. and Miyagi,T.
- Organizer
  the 22nd Pacific Regional Science Conference, RSAI
- Place of Presentation
  Seoul KyoYuk MunHwa HoeKwan, Seoul, Korea
- Year and Date
  2011-07-05
- Related Report
  2012 Final Research Report
[Presentation] 動的経路選択行動の室内実験による検証と分析2011
- Author(s)
  池田愛, 宮城俊彦
- Organizer
  土木計画学研究委員会
- Place of Presentation
  岐阜大学工学部
- Related Report
  2011 Annual Research Report
[Presentation] オンライン凸計画問題と交通ネットワークゲーム2010
- Author(s)
  宮城俊彦
- Organizer
  土木計画学研究発表会春大会
- Place of Presentation
  名古屋工業大学
- Related Report
  2010 Annual Research Report
[Presentation] カルマンフィルターを応用した所要時間推定法の提案実用性2010
- Author(s)
  宮田輝星・宮城俊彦
- Organizer
  土木計画学研究発表会春大会
- Place of Presentation
  名古屋工業大学
- Related Report
  2010 Annual Research Report
[Presentation] DSGEモデルによる社会資本整備効果の計測法2010
- Author(s)
  仲原由布子・宮城俊彦
- Organizer
  土木計画学研究発表会秋大会
- Place of Presentation
  山梨大学
- Related Report
  2010 Annual Research Report
[Presentation] 確率的仮想プレイに基づく強化学習モデルと行動パラメータ推定2010
- Author(s)
  遠藤雅人・宮城俊彦
- Organizer
  土木計画学研究発表会秋大会
- Place of Presentation
  山梨大学
- Related Report
  2010 Annual Research Report
[Presentation] 部分空間同定法を用いた経済成長モデルの構造パラメータ推定2010
- Author(s)
  加藤裕人・宮城俊彦
- Organizer
  日本地域学会第47回年次大会
- Place of Presentation
  政策研究大学院大学
- Related Report
  2010 Annual Research Report
[Presentation] 離散的経路選択モデルとNash均衡2010
- Author(s)
  宮城俊彦
- Organizer
  第24回応用地域学会研究発表大会
- Place of Presentation
  名古屋大学
- Related Report
  2010 Annual Research Report
[Presentation] Traffic Games with Incomplete Travel Information
- Author(s)
  G.C. Peque,Jr.
- Organizer
  土木学会土木計画学研究会
- Place of Presentation
  京都大学
- Related Report
  2012 Annual Research Report
[Presentation] 経路選択行動モデルの構造パラメータ推定について
- Author(s)
  池田　愛
- Organizer
  土木学会土木計画学研究会
- Place of Presentation
  埼玉大学
- Related Report
  2012 Annual Research Report
[Presentation] 混雑ゲームにおけるNon-atomic モデルの数値計算特性について
- Author(s)
  張　洋
- Organizer
  土木学会土木計画学研究会
- Place of Presentation
  埼玉大学
- Related Report
  2012 Annual Research Report
[Presentation] nformed-user algorithms that converges to Nash equilibrium in traffic games
- Author(s)
  G.C. Peque,Jr.
- Organizer
  the 15th meeting of the Euro Working Group on Transportation
- Place of Presentation
  Cite Descartes, France.
- Related Report
  2012 Annual Research Report
[Presentation] 多数エージェントの動学的離散選択行動モデルに対する定常均衡アプローチについて
- Author(s)
  宮城俊彦
- Organizer
  日本地域学会
- Place of Presentation
  埼玉大学
- Related Report
  2012 Annual Research Report
[Presentation] Adaptive Learning Algorithms for Traffic Games with Naive Users
- Author(s)
  G.C. Peque,Jr.
- Organizer
  the 20th International Symposium on Transportation and Traffic Theory
- Place of Presentation
  Delft University
- Related Report
  2012 Annual Research Report

Theory of Reinforcement Learning and Algorithms of Route Choice in Transportation Networks

Principal Investigator

MIYAGI Toshihiko 東北大学, 大学院・情報科学研究科, 教授 (20092968)

¥8,190,000 (Direct Cost: ¥6,300,000、Indirect Cost: ¥1,890,000)

Report

Research Products

[Journal Article] Adaptive Learning Algorithms for Traffic Games with Naive Users2013

Author(s)

Journal Title

Related Report

[Journal Article] 経路選択行動に関する室内実験2013

Author(s)

Journal Title

URL

Related Report

[Journal Article] 経路選択行動に関する室内実験2013

Author(s)

Journal Title

Related Report

[Journal Article] Adaptive Learning Algorithms for Traffic Games with Naive Users2013

Author(s)

Journal Title

Related Report

[Journal Article] Estimation of Structural Parameters of the Economic Growth Model with Subspace System Identification2012

Author(s)

Journal Title

Related Report

[Journal Article] Informed-user algorithms that converges to Nash equilibrium in traffic games2012

Author(s)

Journal Title

Related Report

[Journal Article] DSGEモデルによる公共投資の効果分析とモデルの時変パラメータ推定2012

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Informed-user algorithms that converges to Nash equilibrium2012

Author(s)

Journal Title

Related Report

[Journal Article] 適応的経路選択モデルにおける経路集合の限定手法と経路分散パラメータの推定法2011

Author(s)

Journal Title

Related Report

[Journal Article] An adaptive learning algorithm for a route choice problem in uncertain traffic environments2011

Author(s)

Journal Title

Related Report

[Journal Article] An adaptive learning algorithm for a route choice problem in uncertain traffic environments2011

Author(s)

Journal Title

Related Report

[Journal Article] 適応的経路選択モデルにおける経路集合の限定手法と経路分散パラメータの推定法2011

Author(s)

Journal Title

Related Report

[Journal Article] テレワーク通勤への転換行動分析のためのリグレット・マッチング・モデル2010

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Regret Matchingを用いた経路選択行動分析:不完全交通情報を仮定した粒子モデル2010

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 社会資本整備を内包した経済成長モデルのパラメータ推定2010

Author(s)

Journal Title

Related Report

[Presentation] Adaptive Learning Algorithms for Traffic Games with Naive Users2013

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 経路選択行動モデルの構造パラメータ推定について2012

Author(s)

Organizer

Place of Presentation