CSE 250A. Principles of Artificial Intelligence:
Probabilistic Reasoning and Decision-Making

Subject

Probabilistic methods for reasoning and decision-making under uncertainty. Topics include: inference and learning in directed probabilistic graphical models; prediction and planning in Markov decision processes; applications to computer vision, robotics, speech recognition, natural language processing, and information retrieval.

Prerequisites

The course is aimed broadly at advanced undergraduates and beginning graduate students in mathematics, science, and engineering. Prerequisites are elementary probability, multivariable calculus, linear algebra, and basic programming ability in some high-level language such as Python, Matlab, R, Julia, Java, or C. Programming assignments are completed in the language of the student's choice.

Relation to other courses

CSE 250a covers largely the same topics as CSE 150a, but at a faster pace and more advanced mathematical level. The homework assignments and exams in CSE 250A are also longer and more challenging. In general you should not take CSE 250a if you have already taken CSE 150a.

Administrivia

  1. Instructor: Lawrence Saul
    Office hour: Wed 3-4 pm (zoom)

  2. Teaching assistants:
    1. Aditi Mavalankar
    2. Dongxia (Allen) Wu
    3. Htut Khine Win
    4. Pan Xia
    5. Shubham Chaudhary
    6. Umesh Singla
    7. Xinghan Wang
    8. Zihao Zhou
    9. Zhuowen Zou

  3. Lectures:
    Tue/Thu 3:30-4:50 pm (zoom) [UPDATED 10/21]

  4. TA discussion sessions
    1. Sun 4-5 pm (zoom) - Xinghan
    2. Mon 10-11 am (zoom) - Shubham
    3. Wed 9-10 am (zoom) - Htut Khine
    4. Sat 2-3 pm (zoom) - Zhuowen
    5. Wed 7:30-8:30 pm (zoom) - Aditi
    6. Thu 10-11 am (zoom) - Zihao
    7. Thu 6-7 pm (zoom) - Umesh
    8. Fri 5:30-6:30 pm (zoom) - Dongxia
    9. Fri 8:30-9:30 am (zoom) - Pan

  5. TA office hours
    1. Sun 5-6 pm (zoom) - Xinghan
    2. Mon 11-noon (zoom) - Shubham
    3. Wed 10-11 am (zoom) - Htut Khine
    4. Wed 3-4 pm (zoom) - Zhuowen
    5. Wed 8:30-9:30 pm (zoom) - Aditi
    6. Thu 11-noon (zoom) - Zihao
    7. Thu 7-8 pm (zoom) - Umesh
    8. Fri 7:30-8:30 am (zoom) - Pan
    9. Fri 4:30-5:30 pm (zoom) - Dongxia

  6. Grading:
    (75%) best 8 of 9 homework assignments
    (25%) take-home final exam

Textbooks

The course does not closely follow a particular text; the lectures are meant to be self-contained. Nevertheless, the following texts (though not required) may be useful as general references:

Canvas

Enrolled students should monitor Canvas for more information, including course announcements, homework assignments, and additional resources.

Syllabus

Thu Sep 23 Administrivia and course overview.
Tue Sep 28 Modeling uncertainty, review of probability, explaining away. HW 1 out.
Thu Sep 30 Belief networks: from probabilities to graphs.
Tue Oct 05 Representing conditional probability tables. Conditional independence and d-separation. HW 1 due.
HW 2 out.
Thu Oct 07 Probabilistic inference in polytrees.
Tue Oct 12 More algorithms for inference: node clustering, cutset conditioning, likelihood weighting. HW 2 due.
HW 3 out.
Thu Oct 14 Markov Chain Monte Carlo algorithms for inference. Learning from complete data.
Tue Oct 19 Maximum likelihood estimation. Markov models of language. Naive Bayes models of text. HW 3 due.
HW 4 out.
Thu Oct 21 Linear regression and least squares. Detour on numerical optimization.
Tue Oct 26 Logistic regression, gradient descent, Newton's method. Learning from incomplete data. HW 4 due.
HW 5 out.
Thu Oct 28 EM algorithm for discrete belief networks: derivation and proof of convergence.
Tue Nov 02 EM algorithms for word clustering and linear interpolation. HW 5 due.
HW 6 out.
Thu Nov 04 EM algorithms for noisy-OR and matrix completion. Discrete hidden Markov models.
Tue Nov 09 Computing likelihoods and Viterbi paths in hidden Markov models. HW 6 due.
HW 7 out.
Wed Nov 10 Make-up lecture. Forward-backward algorithm in HMMs. Gaussian mixture models.
Thu Nov 11 Veterans Day holiday.
Tue Nov 16 Linear dynamical systems. Reinforcement learning and Markov decision processes. HW 7 due.
HW 8 out.
Thu Nov 18 State and action value functions, Bellman equations, policy evaluation, greedy policies.
Tue Nov 23 Policy improvement and policy iteration.
Value iteration. Algorithm demos.
HW 8 due.
HW 9 out.
Thu Nov 25 Thanksgiving holiday.
Tue Nov 30 Convergence of value iteration. Model-free algorithms. Temporal difference prediction.
Thu Dec 02 Q-learning, RL in large state spaces.
Bonus topics. Course wrap-up.
HW 9 due.
Mon Dec 06 Remote (take-home) final exam.