What is hierarchical clustering? What is agglomerative hierarchical clustering? What is divisive hierarchical clustering?

- December 29, 2021

The hierarchical clustering method produces a set of nested clusters organized as a hierarchical tree by performing Hierarchical decomposition (merge or split) of data point's base on a similarity or distance matrix Depending on whether the hierarchical decomposition is formed in a bottom-up (merging) or top-down (splitting) fashion a hierarchical clustering method can be classified into two main categories. They are:

Agglomerative hierarchical clustering: It works in a bottom-up manner. That is, each object is initially considered as a single-element cluster. At each step of the algorithm, the two clusters that are the most similar are combined into a new bigger cluster. This procedure is iterated until all points are members of just one single big cluster. The result is a tree as shown in the figure below. Agglomerative clustering is good at identifying small clusters.

Advantages of Hierarchical clustering:

Do not have to assume any particular number of clusters. Any desired number of clusters can be obtained by 'cutting the tree at the proper level.
They may correspond to meaningful taxonomies for example in biological sciences Sh(e.g., animal kingdom, phylogeny reconstruction).

Disadvantages of Hierarchical clustering:

Hierarchical clustering methods can encounter difficulties regarding the selection of merge or split points. Such a decision is critical because once a group of objects is merged or split, the process at the next step will operate on the newly generated clusters. It will neither undo what was done previously, nor perform object swapping between clusters. Thus, merge or split decisions, if not well chosen, may lead to low-quality clusters.

The methods do not scale well because each decision of merge or split needs to examine and evaluate many objects or clusters.

No global objective function is directly minimized

Search This Blog

Notes for BSc CSIT

What is hierarchical clustering? What is agglomerative hierarchical clustering? What is divisive hierarchical clustering?

Comments

Post a Comment

Popular posts from this blog

Discuss classification or taxonomy of virtualization at different levels.

What is RMI? Discuss stub and skeleton. Explain its role in creating distributed applications.