What is hierarchical clustering? What is agglomerative hierarchical clustering? What is divisive hierarchical clustering?
The hierarchical clustering method produces a set of nested clusters organized as a hierarchical tree by performing Hierarchical decomposition (merge or split) of data point's base on a similarity or distance matrix Depending on whether the hierarchical decomposition is formed in a bottom-up (merging) or top-down (splitting) fashion a hierarchical clustering method can be classified into two main categories. They are:
Agglomerative hierarchical clustering: It works in a bottom-up manner. That is, each object is initially considered as a single-element cluster. At each step of the algorithm, the two clusters that are the most similar are combined into a new bigger cluster. This procedure is iterated until all points are members of just one single big cluster. The result is a tree as shown in the figure below. Agglomerative clustering is good at identifying small clusters.
- Do not have to assume any particular number of clusters. Any desired number of clusters can be obtained by 'cutting the tree at the proper level.
- They may correspond to meaningful taxonomies for example in biological sciences Sh(e.g., animal kingdom, phylogeny reconstruction).
- Hierarchical clustering methods can encounter difficulties regarding the selection of merge or split points. Such a decision is critical because once a group of objects is merged or split, the process at the next step will operate on the newly generated clusters. It will neither undo what was done previously, nor perform object swapping between clusters. Thus, merge or split decisions, if not well chosen, may lead to low-quality clusters.
- The methods do not scale well because each decision of merge or split needs to examine and evaluate many objects or clusters.
- No global objective function is directly minimized
Comments
Post a Comment