Following is a list of our units, slides will be posted here, when available.
Unit
| Title
| Slides | HW
|
1
| Intro to Data Mining
| Topic 1-DMIntro.ppt
| HW1.html Solution |
2
| EDA/Visualization Tools
| Topic 2-EDAViz.ppt
| HW2.html Solution
|
3
| Data Mining Concepts
| Topic3-DMConcepts
|
|
4
| Regression
| Topic4.1-RegressionZorych.pdf
Topic4.2-RegressionVolinsky.ppt
|
|
5
| Classification
| Topic5.1-ClassificationMadigan.pdf
Topic5.2-ClassificationVolinsky.ppt
| HW3.html
Solution |
6
| Clustering and Unsupervised Learning
| Topic6-Clustering.ppt Topic6.2-ClusteringExample.ppt
|
|
7
| Text Mining
| Topic7-TextMining.ppt
|
|
| Midterm
| Sample Midterm Questions
MidtermReview
| Midterm Solutions
R: Tree Code
Class Data
|
8
| Web Mining
| Topic8-WebMining.ppt
| HW4.html
Solution
|
9
| Advanced Classification: SVM and Neural Nets | Topic9-AdvancedClassfication.ppt
| HW5.pdf
Solution
|
10
| Ensemble Methods | Topic10-EnsembleMethods.ppt
|
|
11
| Bayesian Methods - Ken Shirley | Topic11-BayesianMethods.pdf
|
|
12
| Reccomender Systems and the Netflix Prize
| Topic12-Recommender Systems / Netflix Prize | HW6.html
Solution
|
13
| Social and other Networks
| Topic13-Networks
| ExtraCreditHW.html
Extra Credit Solution
|
14
| Class Presentations
|
| Term Project Notes
|
15
| Final Exam
| Final Review
| Final Answer Key
|
http://www2.research.att.com/~volinsky/DataMining/Columbia2011/Columbia2011.html