K-means clustering

The K-means clustering algorithm, a commonly used clustering algorithm, is an iterative unsupervised learning process used to minimize the distance of the data point from the average data point in the cluster.^[1] The k-means algorithm is one of the fastest clustering algorithms available.^[2] K-means can group data only unsupervised based on the similarity of customers to each other. It is a type of partitioning clustering, as it divides the data into K non-overlapping subsets or clusters without any cluster internal structure or labels. The objective of k-means is to form clusters in such a way that similar samples go into a cluster, and dissimilar samples fall into different clusters. It aims to minimize the “intra cluster” distances and maximize the “inter-cluster” distances, and to divide the data into non-overlapping clusters without any cluster-internal structure.^[3]

References

↑ 7 Innovative Uses of Clustering Algorithms in the Real Worlddatafloq.com
↑ KMeansscikit-learn.org
↑ Intro to k-MeansCoursera

[1] 7 Innovative Uses of Clustering Algorithms in the Real Worlddatafloq.com

[2] KMeansscikit-learn.org

[coursera-3] Intro to k-MeansCoursera

[1]

[2]

[3]

See also

References