動的大規模データ環境における適応推薦

Research Project

Project/Area Number	15J09850
Research Category	Grant-in-Aid for JSPS Fellows
Allocation Type	Single-year Grants
Section	国内
Research Field	Intelligent informatics
Research Institution	The University of Tokyo
Principal Investigator	小宮山純平東京大学, 情報理工学系研究科, 特別研究員(DC2)
Project Period (FY)	2015-04-24 – 2017-03-31
Project Status	Declined (Fiscal Year 2016)
Budget Amount *help	¥1,700,000 (Direct Cost: ¥1,700,000) Fiscal Year 2016: ¥800,000 (Direct Cost: ¥800,000) Fiscal Year 2015: ¥900,000 (Direct Cost: ¥900,000)
Keywords	多腕バンディット問題 / 確率的最適化 / 探索と活用のトレードオフ / コールドスタート問題 / 情報推薦 / 情報検索 / ウェブデータ活用 / 大規模データ処理
Outline of Annual Research Achievements	本年度は、機械学習・学習理論分野で３本の論文が採択され、研究結果を発表した。これらの発表は、システムの未知のパラメータ（推薦すべきデータの性質に依存した不確定性）を効率的に学習できる確率的バンディット問題という共通の数理的基盤を持ちながら、複数の問題への応用範囲を持った内容となっている。１本目の論文（ICML2015で発表）はオンライン広告の推薦、２本目の論文（COLT2015で発表）は検索エンジンのランキング最適化を目的とし、いずれも実データを基にしたシミュレーションで既存手法の1/5から1/10のデータで学習が行える、また計算効率も良い手法の提案となっている。３本目の論文（NIPS2015で発表）は、これらの問題にひそむ共通のデータ構造に関する研究を行い、前述の２論文をを含む広いクラスの問題に対する推薦アルゴリズム（PM-DMED，部分モニタリング経験尤度最小化法）を提案した。このアルゴリズムは少ないデータサイズで情報理論的に最適な推薦を行うことができる。ユーザに対してどのようなコンテンツを推薦すればよいかに関して、ウェブサービスのようなフィードバック（ユーザのアクション）を観測できるような推薦をうまく動かし、ユーザの望むコンテンツを提示することができる。これらの論文はいずれも関連分野のトップ国際会議での発表であり、データを逐次学習する機械学習研究の最先端のものであると考える。
Research Progress Status	翌年度、交付申請を辞退するため、記入しない。
Strategy for Future Research Activity	翌年度、交付申請を辞退するため、記入しない。

Report

(1 results)

2015 Annual Research Report

Research Products
(13 results)

All 2016 2015 Other

All Journal Article (3 results) (of which Peer Reviewed: 3 results, Open Access: 3 results, Acknowledgement Compliant: 3 results) Presentation (8 results) (of which Int'l Joint Research: 1 results, Invited: 4 results) Remarks (2 results)

[Journal Article] Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays2015
- Author(s)
  J. Komiyama, J. Honda, and H. Nakagawa
- Journal Title
  
  Proceedings of the 32nd International Conference on Machine Learning
  
  Volume: 1 Pages: 1152-1161
- Related Report
  2015 Annual Research Report
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Regret Lower Bound and Optimal Algorithm in Dueling Bandit Problem2015
- Author(s)
  J. Komiyama, J. Honda, H. Kashima, and H. Nakagawa
- Journal Title
  
  Proceedings of the 28th Annual Conference on Learning Theory
  
  Volume: 1 Pages: 1141-1154
- Related Report
  2015 Annual Research Report
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Regret Lower Bound and Optimal Algorithm in Finite Stochastic Partial Monitoring2015
- Author(s)
  J. Komiyama, J. Honda, and H. Nakagawa
- Journal Title
  
  Proceedings of the 29th Neural Information Processing Systems
  
  Volume: 1
- Related Report
  2015 Annual Research Report
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Presentation] 確率的バンディット問題の近年の研究動向について2016
- Author(s)
  小宮山純平
- Organizer
  若手研究者のための大学間合同セミナー（STRセミナー）
- Place of Presentation
  北海道大学
- Year and Date
  2016-03-21
- Related Report
  2015 Annual Research Report
[Presentation] 部分モニタリング問題における漸近最適アルゴリズム2016
- Author(s)
  小宮山純平
- Organizer
  統計学と機械学習における数理とモデリング（シンポジウム）
- Place of Presentation
  東京工業大学
- Year and Date
  2016-02-21
- Related Report
  2015 Annual Research Report
[Presentation] 確率的バンディット問題における効率的な学習アルゴリズム2015
- Author(s)
  小宮山純平
- Organizer
  「学習とメカニズムデザイン」ワークショップ
- Place of Presentation
  九州大学
- Year and Date
  2015-12-01
- Related Report
  2015 Annual Research Report
- Invited
[Presentation] 比較バンディット問題における最適アルゴリズム2015
- Author(s)
  小宮山純平
- Organizer
  第23回情報論的学習理論と機械学習研究会
- Place of Presentation
  つくば市（エポカルつくば）
- Year and Date
  2015-11-25
- Related Report
  2015 Annual Research Report
- Invited
[Presentation] 最適コンテンツ提示問題のための効率的なアルゴリズム2015
- Author(s)
  小宮山純平
- Organizer
  日本応用数理学会年会
- Place of Presentation
  金沢大学
- Year and Date
  2015-09-09
- Related Report
  2015 Annual Research Report
- Invited
[Presentation] Regret Lower Bound and Optimal Algorithm in Dueling Bandit Problem2015
- Author(s)
  Junpei Komiyama
- Organizer
  Machine Learning Summer School 2015 Kyoto
- Place of Presentation
  京都大学
- Year and Date
  2015-08-23
- Related Report
  2015 Annual Research Report
- Int'l Joint Research
[Presentation] Regret Lower Bound and Optimal Algorithm in Dueling Bandit Problem2015
- Author(s)
  小宮山純平
- Organizer
  ERATO感謝祭SeasonII
- Place of Presentation
  国立情報学研究所（一橋講堂）
- Year and Date
  2015-08-03
- Related Report
  2015 Annual Research Report
- Invited
[Presentation] 比較バンディット問題における最適なアルゴリズム　～ランキング手法比較や選好情報学習を目的として～2015
- Author(s)
  小宮山純平、本多淳也、鹿島久嗣、中川裕志
- Organizer
  第21回情報論的学習理論と機械学習研究会
- Place of Presentation
  沖縄科学技術大学院大学
- Year and Date
  2015-06-23
- Related Report
  2015 Annual Research Report
[Remarks] MultiBanditLib:a multi-play multi-armed bandit lib
- URL
  https://github.com/jkomiyama/multiplaybanditlib
- Related Report
  2015 Annual Research Report
[Remarks] DuelingBanditLib: a simple dueling bandit library
- URL
  https://github.com/jkomiyama/duelingbanditlib
- Related Report
  2015 Annual Research Report

動的大規模データ環境における適応推薦

Principal Investigator

小宮山 純平 東京大学, 情報理工学系研究科, 特別研究員(DC2)

¥1,700,000 (Direct Cost: ¥1,700,000)

Report

Research Products

[Journal Article] Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays2015

Author(s)

Journal Title

Related Report

[Journal Article] Regret Lower Bound and Optimal Algorithm in Dueling Bandit Problem2015

Author(s)

Journal Title

Related Report

[Journal Article] Regret Lower Bound and Optimal Algorithm in Finite Stochastic Partial Monitoring2015

Author(s)

Journal Title

Related Report

[Presentation] 確率的バンディット問題の近年の研究動向について2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 部分モニタリング問題における漸近最適アルゴリズム2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 確率的バンディット問題における効率的な学習アルゴリズム2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 比較バンディット問題における最適アルゴリズム2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 最適コンテンツ提示問題のための効率的なアルゴリズム2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Regret Lower Bound and Optimal Algorithm in Dueling Bandit Problem2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Regret Lower Bound and Optimal Algorithm in Dueling Bandit Problem2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 比較バンディット問題における最適なアルゴリズム ～ランキング手法比較や選好情報学習を目的として～2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Remarks] MultiBanditLib:a multi-play multi-armed bandit lib

URL

Related Report

[Remarks] DuelingBanditLib: a simple dueling bandit library

URL

Related Report

小宮山純平東京大学, 情報理工学系研究科, 特別研究員(DC2)

[Presentation] 比較バンディット問題における最適なアルゴリズム　～ランキング手法比較や選好情報学習を目的として～2015