Unsupervised Learning
we are just given output data, without any inputs. The goal is to discover interesting structure in the data
Discovering clusters
Let denote the number of clusters. Our first goal is to estimate the distribution over the number of clusters,
Estimating which cluster each point belongs to
Let represent the cluster to which data point i is assigned.
Real world applications
- Autoclass system in astronomy
- Clustering users into groups in e-commerce