Date | Topic (+ Notes) | Video | Link | Assignment (latex) | Project |
---|---|---|---|---|---|
Mon 8.18 | Class Overview | - | |||
Wed 8.20 | Statistics Phenomena (S) | - | M4D 2.2-2.3 | MMDS 1.2 | FoDS 12.4 | ||
Mon 8.25 | Similarity : Language Embeddings (S) | - | M4D 4.4 | ||
Wed 8.27 | Similarity : Metric Distances (S) | - | M4D 4 - 4.3 | MMDS 3.5 + 7.1 | FoDS 8.1 | Statistical Phenomenon | |
Mon 9.01 | |||||
Wed 9.03 | Similarity : ANN: HNSW GraphSearch (S) | - | M4D 4.4 | MMDS 3.7 + 7.1.3 | ||
Mon 9.08 | Similarity : Jaccard + Distribution Distances (S) | - | M4D 4.4 | ||
Wed 9.10 | Similarity : K-grams -> LSH (S) | - | M4D 4.3-4.4 + 4.6 | MMDS 3.1 + 3.2 + 3.3 + 3.4| FoDS 7.3 | Proposal | |
Mon 9.15 | Clustering : Hierarchical (S) | - | M4D 8.5, 8.2 | MMDS 7.2 | FoDS 7.7 | ||
Wed 9.17 | Clustering : K-Means (S) | - | M4D 8-8.3 | MMDS 7.3 | FoDS 7.2-3 | Similarity | |
Mon 9.22 | Clustering : Spectral (S) | - | M4D 10.3 | MMDS 10.4 | FoDS 7.5 | ||
Wed 9.24 | Clustering : Choosing k | - | M4D 10.3 | Data Collection Report | |
Mon 9.29 | Streaming : Model, Sampling, and Quantiles | - | MMDS 4.3 | ||
Wed 10.01 | Streaming : Misra-Greis and Count-Min (S) | - | M4D 11.1 - 11.2.2 | FoDS 6.2.3 | MMDS 6 | Clustering | |
Mon 10.06 | |||||
Wed 10.08 | |||||
Mon 10.13 | Streaming : Count Sketch, Distinct Counting, and Apriori (S) | - | M4D 11.2.3-4 | FoDS 6.2.3 | MMDS 4.3 | ||
Wed 10.15 | Streaming : Misc | - | MMDS 4.1 | Frequency Estimation | |
Wed 10.15 | |||||
Mon 10.20 | Dim Reduce : SVD + PCA (S) | - | M4D 7-7.3, 7.5 | FoDS 4 | ||
Wed 10.22 | Dim Reduce : Random Projections (S) | - | M4D 7.1, 7.10 | FoDS 2.7 | Intermediate Report | |
Mon 10.27 | Dim Reduce : Matrix Sketching (S) | - | M4D 11.3 | MMDS 9.4 | FoDS 2.7 + 7.2.2 | arXiv | ||
Wed 10.29 | Dim Reduce : Metric Learning (S) | - | M4D 7.6-7.8 | LDA | LDML | ||
Mon 11.03 | Noise : Outliers and Robust Estimation (S) | - | M4D 7.10 + 8.6 | MMDS 9.1 | FoDS 2.9 | Tutorial | robust mean | ||
Wed 11.05 | Noise : Anomaly Detection (S) | - | Dimensionality Reduction | ||
Mon 11.10 | Noise : Encoding Concepts and Bias | [Ethics Read, VERB] | |||
Wed 11.12 | Noise : (Differential) Privacy (S) | - | McSherry | Dwork | ||
Wed 11.19 | Graph Analysis : Markov Chains (S) | - | M4D 10.1 | MMDS 10.1 + 5.1 | FoDS 5 | Weckesser | ||
Mon 11.24 | Graph Analysis : PageRank (S) | - | M4D 10.2 | MMDS 5.1 + 5.4 | ||
Wed 11.26 | Graph Analysis : Graph Embeddings + Review (S) | - | Graphs + Noise | Final Report | |
Mon 12.01 | Graph Analysis : Communities (S) | - | M4D 10.4 | MMDS 10.2 + 5.5 | FoDS 8.8 + 3.4 | ||
Wed 12.03 | |||||
Thu 12.04 | Poster Outline | ||||
Fri 12.12 | Poster Day !!! (1:00-3:00pm) | Poster Presentation |