K-means clustering: Difference between revisions

Latest revision as of 21:10, 31 March 2020

The K-means clustering algorithm, a commonly used clustering algorithm, is an iterative unsupervised learning process used to minimize the distance of the data point from the average data point in the cluster.^[1] The k-means algorithm is one of the fastest clustering algorithms available.^[2] K-means can group data only unsupervised based on the similarity of customers to each other. It is a type of partitioning clustering, as it divides the data into K non-overlapping subsets or clusters without any cluster internal structure or labels. The objective of k-means is to form clusters in such a way that similar samples go into a cluster, and dissimilar samples fall into different clusters. It aims to minimize the “intra cluster” distances and maximize the “inter-cluster” distances, and to divide the data into non-overlapping clusters without any cluster-internal structure.^[3]

References

↑ 7 Innovative Uses of Clustering Algorithms in the Real Worlddatafloq.com
↑ KMeansscikit-learn.org
↑ Intro to k-MeansCoursera

[1] 7 Innovative Uses of Clustering Algorithms in the Real Worlddatafloq.com

[2] KMeansscikit-learn.org

[coursera-3] Intro to k-MeansCoursera

[1]

[2]

[3]

@@ Line 1: / Line 1: @@
-The K-Means clustering algorithm, a commonly used clustering algorithm, is an iterative process used to minimize the distance of the data point from the average data point in the cluster.<cite>[https://datafloq.com/read/7-innovative-uses-of-clustering-algorithms/6224]</cite> The k-means algorithm is one of the fastest clustering algorithms available.<cite>[https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html]</cite>
+The '''K-means clustering''' algorithm, a commonly used clustering algorithm, is an iterative [[unsupervised learning]] process used to minimize the distance of the data point from the average data point in the cluster.<ref>[https://datafloq.com/read/7-innovative-uses-of-clustering-algorithms/6224 7 Innovative Uses of Clustering Algorithms in the Real World]datafloq.com</ref> The k-means algorithm is one of the fastest clustering algorithms available.<ref>[https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html KMeans]scikit-learn.org</ref> K-means can group data only unsupervised based on the similarity of customers to each other. It is a type of partitioning [[clustering]], as it divides the data into K non-overlapping subsets or clusters without any cluster internal structure or labels. The objective of k-means is to form clusters in such a way that similar samples go into a cluster, and dissimilar samples fall into different clusters. It aims to minimize the “intra cluster” distances and maximize the “inter-cluster” distances, and to divide the data into non-overlapping clusters without any cluster-internal structure.<ref name="coursera">[https://www.coursera.org/learn/machine-learning-with-python/lecture/Ky5Wf/intro-to-k-means Intro to k-Means]Coursera</ref>
+== See also ==
+* [[Clustering]]
+* [[Cluster]]
+== References ==