CMPT 409/981 - Optimization for Machine Learning (Fall 2022)

Lectures (beginning Sep 8): Monday (2.30 pm - 3.20 pm) (WMC 2200) and Thursday (2.30 pm - 4.20 pm) (AQ 5037).

Instructor: Sharan Vaswani
Instructor office hours: Monday 4 pm - 5 pm (TASC-1 8221)

Course Objective: This course introduces the foundational concepts of convex and non-convex optimization with applications to machine learning. It will give the students experience in 1. Proving theoretical guarantees for optimization algorithms, 2. Analyzing machine learning (ML) problems from an optimization perspective and 3. Developing and analyzing new optimization methods for ML applications.

Prerequisites: Linear Algebra, Multivariable Calculus, (Undergraduate) Machine Learning

Textbook: There is no required textbook. We will use the following resources:

Grading: Assignments 50%, Project 50%

Piazza for course-related questions.

List of topics

Schedule

Date Topics Slides References Homework
Thursday Sep 8 Course logistics, Lipschitz continuity, Smoothness [L1] [Matrix Cookbook] [List of inequalities] [Linear Algebra Recap]
Monday Sep 12 Gradient descent convergence for smooth non-convex functions, Exact line-search [L2]
Thursday Sep 15 Convergence of GD with Back-tracking Armijo Line-search, Convex sets/functions [L3] Nocedal and Wright (3.1, 3.2), Boyd (2, 3)
Monday Sep 19 Holiday
Thursday Sep 22 Convergence of GD for smooth, convex and strongly-convex functions [L4] Bubeck (3.2, 3.4), [Convex Optimization Cheat Sheet] Assignment 1 released
Monday Sep 26 Projections onto convex sets, Projected GD, Nesterov acceleration and its convergence for smooth, convex functions [L5] Bubeck (3.7)
Thursday Sep 29 Holiday
Monday Oct 3 Heavy-Ball momentum and its convergence for quadratics [L6] [Notes]
Thursday Oct 6 Newton method and its convergence for smooth, strongly-convex functions [L7] Boyd (9.5) Assignment 1 due
Monday October 10 Holiday
Thursday Oct 13 Stochastic Gradient Descent and its convergence for smooth (convex) functions [L8] [More refined analysis]
Monday Oct 17 Stochastic Gradient Descent and its convergence for smooth, convex functions [L9] [Equivalent definitions of smoothness] [Equivalent definitions of strong convexity] Assignment 2 released
Thursday Oct 20 Stochastic Gradient Descent and its convergence for smooth, strongly-convex functions, Interpolation [L10] [SGD proof for smooth, strongly convex functions] [More refined analysis]
Monday Oct 24 Convergence of SGD for smooth, strongly-convex functions under interpolation, Convergence of SGD for smooth functions under strong growth condition [L11] [SGD under interpolation]
Thursday Oct 27 Stochastic Line-Search and its convergence under interpolation, Variance reduction, SVRG and its convergence [L12] [Stochastic Line-Search] [SVRG]
Monday Oct 31 Subgradient Descent and its convergence for Lipschitz, convex functions [L13] Bubeck (3.1) Project Proposal due
Thursday Nov 3 Online Convex Optimization - Online Gradient Descent, Follow the (regularized) leader and their regret bounds [L14] Orabona (2.1 - 2.2, 4.1, 7.1-7.3) Assignment 2 due
Monday Nov 7 AdaGrad and its regret bounds for convex, Lipschitz functions [L15] Orabona (4.2) [Original AdaGrad paper]
Thursday Nov 10 AdaGrad and its regret bound for convex, smooth functions; Stochastic minimization of smooth, non-convex functions using AdaGrad Norm [L16] [AdaGrad for smooth functions] [AdaGrad Norm for smooth, non-convex functions] Assignment 3 released
Monday Nov 14 Adam and its non-convergence, AMSGrad [L17] [Original Adam paper] [Non-convergence of Adam and AMSGrad]
Thursday Nov 17 AMSGrad and its convergence for smooth, convex functions [L18] [AMSGrad for smooth, convex functions]
Monday Nov 21 Min-Max Optimization - Gradient Descent Ascent and its convergence for convex-concave games [L19]
Thursday Nov 24 Gradient Descent Ascent, Extra Gradient and their convergence for convex-concave games, Wrapping up [L20] Assignment 4 released
Monday Nov 28 NeurIPS
Thursday Dec 1 NeurIPS
Monday Dec 5 Project Presentations [2.30 pm - 6 pm in TASC-1 9204]
Tuesday Dec 6 Project Presentations [1 pm - 4 pm in TASC-1 9204]
Tuesday Dec 12 Assignments 3, 4 due
Tuesday Dec 15 Final Project Report due

Related Courses