AAU Student Projects - visit Aalborg University's student projects portal
A master's thesis from Aalborg University
Book cover


k-divisive hierarchical clustering

Author

Term

4. term

Publication year

2003

Abstract

I dette speciale introduceres en ny delende (top-down) hierarkisk klyngealgoritme, kaldet den k-divisive hierarkiske klyngealgoritme. Klyngedannelse grupperer lignende datapunkter, og hierarkisk klyngedannelse bygger et træ (et dendrogram), som viser, hvordan grupper deles eller samles. Delende metoder starter med alle data i én klynge og splitter dem i mindre klynger. Den foreslåede metode opbygger hierarkiet ved gentagne gange at splitte en klynge i to underklynger ved hjælp af k-means (en udbredt metode, der opdeler data i k grupper). Der foreslås to stopkriterier, som afgør, hvornår en klynge ikke skal deles yderligere, med målet om at skabe et tydeligt dendrogram af høj kvalitet. Algoritmen og stopkriterierne er implementeret og testet på både syntetiske og virkelige datasæt.

This thesis introduces a new divisive hierarchical clustering algorithm, called the k-divisive hierarchical clustering algorithm. Clustering groups similar data points, and hierarchical clustering builds a tree (a dendrogram) that shows how groups split or merge. Divisive methods start with all data in one cluster and split it into smaller clusters. The proposed method builds the hierarchy by repeatedly splitting a cluster into two sub-clusters using the k-means algorithm (a common method that partitions data into k groups). Two stopping criteria are proposed to decide when a cluster should no longer be split, with the goal of producing a clear, high-quality dendrogram. The algorithm and the stopping criteria were implemented and tested on both synthetic and real-world datasets.

[This abstract was generated with the help of AI]