EE290O

Course Schedule / Syllabus

Event	Date	Description	Course Materials
Lecture	R 8/23	1b. Multi-agent systems. Introduction to reinforcement learning. Course overview.	[slides]
H0	R 8/23	Homework 0 released	[Hw0]
H1	8/28	Homework 1 released	[Homework 1]
Lecture	8/28	2a. Deep RL and Markov Decision Processes.	[slides] Sutton and Barton, Ch 3
Lecture	8/30	Traffic control and dynamics. Microscopic models. Macroscopic models. Formulating MDPs.	[slides] Treiber and Kesting, Ch 6.1 to 6.3, 10.1 & 10.2, and 11.3
Discussion		Flow, SUMO, OpenAI Gym setup	[slides] links to tutorials
Lecture	9/4	Tabular MDPs. Value iteration. Policy iteration.	[slides] Sutton and Barton, Ch 4.1-4.4
Lecture	9/6	Math review. Linear algebra. Calculus review. Matrix calculus. Basic optimization (GD, SGD).	[slides]
Discussion
Lecture	9/11	Introduction to Neural Networks	[slides] Link 1 Link 2
Lecture	9/13	Approximate Dynamic Programming	[slides]
H1	9/13	Homework 1 due
H2	9/13	Homework 2 released
Lecture	9/18	Traffic data, and estimation	[slides] Treiber and Kesting, Ch 16
Lecture	9/20	Traffic congestion	[slides] Treiber and Kesting, Ch 15
Lecture	9/25	Policy optimization: Derivative-free methods + Finite Difference Methods	[slides] ???
H2	9/27	Homework 2 due
H3	9/27	Homework 3 released
Lecture	9/27	Policy optimization: Policy Gradients I	[slides] ???
Lecture	10/2	Policy optimization: Policy Gradients II
Lecture	10/4	Cooperative Multi-agent RL: Nash Equilibria, Non-stationarity, Multi-agent architectures, Project Introductions
Project	10/9	Project Proposals due
Lecture	10/9	Advanced Cooperative Multi-agent RL: Techniques for handling non-stationarity
Lecture	10/11	Monte-Carlo Tree Search
Lecture	10/16	Project Proposal Presentations
Lecture	10/18	Advanced Cooperative Multi-agent RL: Hierarchical RL, Self-play, GANs
H3	10/18	Homework 3 due
H4	10/18	Homework 4 released
Lecture	10/23	Policy optimization: Policy Gradients III
Lecture	10/25	Exploration + Reward shaping
Lecture	10/30	Best practices in RL
Lecture	11/1	Introduction to Game Theory
H4	11/1	Homework 4 due
	11/6	No Class
	11/8	No Class
Lecture	11/13	Lecture from Flow team
Project	11/13	Project Update Due
Lecture	11/15	Guest lecture
	11/20	Open Project Office Hours
	11/22	Thanksgiving \| No Class
	11/27	Guest lecture or Open Project Office Hours
	11/29	Guest lecture or Open Project Office Hours
	12/04	RRR
	12/06	Project paper / Presentations
	12/11	Finals Week
	12/13	Finals Week

EE290O | Deep multi-agent reinforcement learning with applications to autonomous traffic

Course Schedule / Syllabus