CMPT 409/981 - Optimization for Machine Learning (Fall 2024)

Lectures (beginning Sep 5): Tuesday (1.30 pm - 2.20 pm) and Thursday (12.30 pm - 2.20 pm) (SWH 10051).

Instructor: Sharan Vaswani
Instructor office hours: Thursday, 2.30 pm - 3.30 pm (TASC-1 8221)

Teaching Assistant; Qiushi Lin
TA office hours: Monday, 9.30 am - 10.30 am (ASB 9814)

Course Objective: Numerical optimization plays a key role in designing better algorithms for data science and artificial intelligence. This course introduces the foundational concepts of convex and non-convex optimization with applications to machine learning. It will give the students experience in 1. Proving theoretical guarantees for optimization algorithms, 2. Analyzing machine learning (ML) problems from an optimization perspective and 3. Developing and analyzing new optimization methods for ML applications.

Prerequisites: Probability (e.g. CMPT 210, STAT 270, STAT 271), Linear Algebra (e.g. MATH 240), Basics of Multivariable Calculus (e.g. MATH 251). It is encouraged (though not necessary) for students to have taken (or be simultaneously taking) an introductory course on machine learning (e.g. CMPT 410/726).

Textbook: There is no required textbook. We will use the following resources:

Grading: Assignments 48%, Project 50%, Participation 2%

Piazza for course-related questions.

List of topics

Schedule

Date Topics Slides References Homework
Thu September 5 Course logistics, Lipschitz continuity, Smoothness [L1] [Matrix Cookbook] [List of inequalities] [Linear Algebra Recap] Assignment 0 released
Tue September 10 Gradient descent convergence for smooth non-convex functions [L2] [Multivariable Calculus]
Thursday Sep 12 Exact line-search, Convergence of GD with Back-tracking Armijo Line-search, Convex sets/functions [L3] Nocedal and Wright (3.1, 3.2), Boyd (2, 3) Assignment 0 due
Tue September 17 Convergence of GD for smooth, convex functions [L4] Bubeck (3.2, 3.4), [Convex Optimization Cheat Sheet] Assignment 1 released
Thu September 19 Nesterov acceleration and its convergence, Strongly-convex functions and GD convergence [L5] Bubeck (3.7)
Tue September 24 Heavy-Ball momentum [L6]
Thu September 26 Convergence of Heavy-Ball momentum for quadratics, Preconditioned Gradient Descent and Newton method [L7] [Notes] Assignment 1 due
Tue October 1 Convergence of Newton method for smooth, strongly-convex functions [L8] Boyd (9.5)
Thu October 3 Dealing with constrained domains, Stochastic gradient descent and its convergence for smooth functions [L9] [Equivalent definitions of smoothness] [Equivalent definitions of strong convexity] Assignment 2 released
Tue October 8 Stochastic Gradient Descent and its convergence for smooth, convex functions [L10]
Thu October 10 Convergence of SGD for smooth, strongly-convex functions, Interpolation [L11] [SGD proof for smooth, strongly convex functions] [SGD under interpolation]
Tue October 15 Holiday
Thu October 17 SGD under interpolation, Stochastic Line-Search [L12] [Stochastic Line-Search]
Tue October 22 No class Project Proposal due
Thu October 24 Variance reduction & Convergence of SVRG, Lipschitz, convex functions [L13] [SVRG], Bubeck (3.1) Assignment 2 due
Tue October 29 Convergence of Subgradient Descent, Introduction to Online Optimization [L14] Bubeck (3.1)
Thu October 31 Online Convex Optimization, Online Gradient Descent, Online Mirror Descent [L15] Orabona (2.1 - 2.2, 4.1, 7.1-7.3)
Tue November 5 Online Mirror Descent [L16] Bubeck (4.2-4.3) Assignment 3 released
Thu November 7 Follow the (regularized) leader [L17] Orabona (4.1, 4.2, 7.1-7.3)
Tue November 12 AdaGrad and its regret bound [L18] [Original AdaGrad paper] [AdaGrad for smooth functions]
Thu November 14 Convergence of AdaGrad, Adam [L19] [Original Adam paper]
Tue November 19 Non-convergence of Adam [L20] [Non-convergence of Adam and AMSGrad] Assignment 4 released
Thu November 21 Min-Max Optimization, Gradient Descent Ascent and its convergence, Extra Gradient [L21]
Tue November 26 Convergence of Extra Gradient [L22]
Thu November 28 Project Presentations
Tue December 3 Project Presentations
Tue Dec 10 Assignments 3, 4 due
Tue Dec 17 Final Project Report due

Related Courses