This course will cover a mixture of the following topics:
Yisong Yue yyue@caltech.edu
Stephan Zheng stephan@caltech.edu
Hoang Le hmle@caltech.edu
May be interesting for final project
Note: schedule is subject to change.
Date | Papers | Presenters | Materials | |
3/29/2016 | Introduction & Administrivia Follow the Leader Algorithm & Perceptron |
Yisong Yue | [slides] | |
3/31/2016 | Online Learning with Experts & Multiplicative Weights Algorithm | Stephan Zheng | [slides] | |
4/5/2016 | Online Convex Optimization | Ellen Feldman, Gautam Goel, Milan Cvitkovic Mentor: Yisong |
[slides] | |
4/7/2016 | Multi-armed Bandits & UCB1 Algorithm | Connor Lee, Ritvik Mishra, Hoang Le Mentor: Hoang |
[slides] | |
4/12/2016 | Linear Bandits & Applications | Feng Bi, Joon Sik Kim, Leiya Ma, Pengchuan Zhang Mentor: Yisong |
[slides] | |
4/14/2016 | Monte Carlo Tree Search & Applications | Suraj Nair, Peter Kundzicz, Vansh Kumar, Kevin An Mentor: Stephan |
[slides] |
|
4/19/2016 | Q-Learning for Reinforcement Learning & Applications | Timothy Chou, Charlie Tong, Vincent Zhuang Mentor: Stephan |
[slides] |
|
4/21/2016 | Apprenticeship Learning for Reinforcement Learning & Applications | Nick Haliday, Audrey Huang, Ritwik Anand, Dryden Bouamalay Mentor: Hoang |
[slides] |
|
4/26/2016 | Imitation Learning | Richard Zhu, Andrew Kang Mentor: Hoang |
[slides] |
|
4/28/2016 | Active Learning for Supervised Learning | Daniel Gu, Matthew Morgan, Keegan Ryan, Matthew Clark Mentor: Hoang |
[slides] |
|
5/3/2016 | Active Learning for Decision Making | Joe Marino, Grant Van Horn, Alvita Tran, Remy Yang Mentor: Yisong |
[slides] |
|
5/5/2016 | Crowdsourcing | Madhav Mohandas, Vincent Zhuang, Richard Zhu Mentor: Yisong |
[slides] |
|
5/10/2016 | Machine Teaching | Justin Leong, Kevin Tang, Zilong Chen, Kaikai Sheng Mentor: Yisong |
[slides] |
|
5/12/2016 | Machine Teaching for Crowdsourcing | Nancy Cao, Andrew Chico, Betsy Fu, Daniel Wang Mentor: Yisong |
[slides] |
|
5/17/2016 | Modeling Human Decision Making | Zachary Fein, Eric Gorlin, Emily Mazo, Kc Emezie Mentor: Hoang |
[slides] |
|
5/19/2016 | Combinatorial Action Spaces & Adaptive Routing | Luciana Cendon, Tobias Bischoff, Jiyun Ivy Xiao, Brennan Young Mentor: Yisong |
[slides] |
|
5/24/206 | Dueling Bandits | Fabian Boemer, Kushal Agarwal, Jialin Song, Aman Agarwal Mentor: Yisong |
[slides] |
|
5/26/2016 | Coactive Learning | Rohan Batra, Avishek Dutta, Nand Kishore, Siddarth Murching Mentor: Hoang |
[slides] |
|
5/31/2016 | Bayesian Optimization | Dimitar Ho, Danni Ma Mentor: Stephan |
[slides] |
|
6/2/2016 | Off-Policy Evaluation | Miguel Aroca-Ouellete, Akshta Athawale, Mannat Singh Mentor: Hoang |
[slides] |
|
Note: some papers belong to multiple categories.
Basic Online Learning
Online Learning with Experts
More Papers on Full Information Online Learning
Basic Multi-Armed Bandits (Partial Information Online Learning)
Bandit Convex Optimization
Bandits with Dependent Arms
Pure Exploration in Multi-Armed Bandits
Contextual Bandits
Bayesian Optimization
Online Learning in Combinatorial Action Spaces
Active Learning
Online Learning from Preference Feedback
Reinforcement Learning and Imitation Learning
Off Policy Evaluation and Learning
Crowdsourcing
Machine teaching
Modeling Human Decision Making & Interpreting Human Feedback
Safe Exploration
Connections to Game Theory