This webpage is for an old version of the course; content may be out of date!
CSE 158/258: Web Mining and Recommender Systems
Autumn 2020, Monday/Wednesday 17:00-18:20 PST, Twitch
For those unable to access twitch, or attend the lecture time, all recordings will be posted here on the day following the lecture
CSE 158 and 258 are undergraduate and graduate courses devoted to current methods for recommender systems, data mining, and predictive analytics. No previous background in machine learning is required, but all participants should be comfortable with programming (all example code will be in Python), and with basic optimization and linear algebra.
The course meets twice a week on Monday/Wednesday evenings, starting October 5. Meetings are livestreamed on twitch, but recordings will also be made available here.
There is no textbook for the course, though chapter references will be provided from Pattern Recognition and Machine Learning (Bishop), and from Charles Elkan's 2013 course notes. Links are also provided to our Coursera Specialization, which covers similar material.
Office hours:
Office hours (and instructions to access) for each class are posted to Piazza
Assessment:
Grading:
- Each Homework is worth 8%. Your lowest (of four) homework grades is dropped (or one homework can be skipped).
- The (take-home) Midterm is worth 26%.
- Each Assignment is worth 25%.
- Assignment 2 is a group assignment. All other assessment must be completed individually.
- All assessments are due before the Monday lecture on the due date. Late submissions are not accepted.
1 | Supervised Learning: Regression |
---|
Monday October 5 / Wednesday October 7:
- Least-squares regression
- Overfitting and regularization
- Training, validation, and testing
Other resources:
Coursera slides (introductory):
Code examples:
Lecture materials |
lecture 1 video |
lecture 2 video |
slides |
+ annotations |
2 | Supervised Learning: Classification |
---|
Monday October 12 / Wednesday October 14:
- Logistic regression
- SVMs
- Multiclass and multilabel classification
- How to evaluate classifiers
Other resources:
Coursera slides:
Code examples:
Lecture materials |
lecture 3 video |
lecture 4 video |
slides |
+ annotations |
3 | Dimensionality Reduction and Clustering |
---|
Monday October 19 / Wednesday October 21:
- Principal Component Analysis
- K-means & hierarchical clustering
- Community detection
Other resources:
Code examples:
Lecture materials |
lecture 5 video |
lecture 6 video |
slides |
+ annotations |
Monday October 26 / Wednesday October 28:
- Collaborative Filtering
- Latent Factor Models
Other resources:
Coursera slides:
Code examples:
Kaggle pages (Assignment 1):
Lecture materials |
lecture 7 video |
lecture 8 video |
slides |
+ annotations |
Monday November 2 / Wednesday November 4:
- Sentiment analysis
- Bags-of-words
- TF-IDF
- Stopwords, stemming, and low-dimensional representations of text
Other resources:
Code examples:
Lecture materials |
lecture 9 video |
lecture 10 video |
slides |
+ annotations |
- Midterm released 6:30pm Monday Nov 9
- Midterm due on gradescope 6:30pm Tuesday Nov 10
Lecture materials |
lecture 11 video |
No lecture | November 11 (Veteran's Day) |
---|
Monday November 9:
- Crawling and parsing data from the Web
- Manipulating time and date data
- Simple plotting with Matplotlib
- General-purpose gradient descent in Tensorflow
Code examples:
Lecture materials |
slides |
+ annotations |
lecture 12 video |
7 | Data Mining in Social Networks |
---|
Monday November 16 / Wednesday November 18
- Power-laws and small-worlds
- Random graph models
- Triads and weak ties
- HITS and PageRank
Other resources:
Lecture materials |
slides |
+ annotations |
lecture 13 video |
lecture 14 video |
8 | State-of-the-art Recommender Systems |
---|
Monday November 23
- State-of-the-art Recommender Systems
- Bayesian Personalized Ranking
- Factorizing Personalized Markov Chains for Next-Basket Recommendation
- Personalized Ranking Metric Embedding for Next New POI Recommendation
- Real-world Applications
- Recommending product sizes to customers
- Playlist prediction via Metric Embedding
Lecture materials |
slides |
+ annotations |
lecture 15 video |
Monday November 30:
- Matching & marriage problems
Wednesday December 2:
- AdWords
- Bandit algorithms
Lecture materials |
slides |
+ annotations |
lecture 16 video |
10 | Modeling Temporal and Sequence Data |
---|
Monday December 7 / Wednesday December 9
- Sliding windows and autoregression
- Temporal dynamics in recommender systems
- Temporal dynamics in text and social networks
Code examples: