Unsupervised Learning

we are just given output data, without any inputs. The goal is to discover interesting structure in the data

Discovering clusters

Let KK denote the number of clusters. Our first goal is to estimate the distribution over the number of clusters, p(KD)p(K|D)

K=argmaxK p(KD) K' = argmax_K\space p(K|D)

Estimating which cluster each point belongs to

Let zi{1,...,K}z_i \in \{1, . . . , K\} represent the cluster to which data point i is assigned.

Real world applications

  1. Autoclass system in astronomy
  2. Clustering users into groups in e-commerce

results matching ""

    No results matching ""