Page 1 of 14 file:///Users/jtleek/Dropbox/Jeff/teaching/2013/coursera/week3/005kmeansClustering/index.html#1
K-means clustering K-means clustering
Jeffrey Leek, Assistant Professor of Biostatistics
Johns Hopkins Bloomberg School of Public Health
2/3/13 8:06 PM K-means clustering
Page 2 of 14 file:///Users/jtleek/Dropbox/Jeff/teaching/2013/coursera/week3/005kmeansClustering/index.html#1
Can we find things that are close together? Can we find things that are close together?
How do we define close?
How do we group things?
How do we visualize the grouping?
How do we interpret the grouping?
2/14
2/3/13 8:06 PM K-means clustering
Page 3 of 14 file:///Users/jtleek/Dropbox/Jeff/teaching/2013/coursera/week3/005kmeansClustering/index.html#1
How do we define close? How do we define close?
Most important step
Distance or similarity
Pick a distance/similarity that makes sense for your problem
3/14
2/3/13 8:06 PM K-means clustering
Page 4 of 14 file:///Users/jtleek/Dropbox/Jeff/teaching/2013/coursera/week3/005kmeansClustering/index.html#1
K-means clustering K-means clustering
A partioning approach
Requires
Produces
Pick by eye/intuition
Pick by cross validation/information theory, etc.
Determining the number of clusters
-
-
-
Different # of clusters
Different number of iterations
-
-
14/14