K-means clustering: Difference between revisions

From Machinelearning
(Created page with "The K-Means clustering algorithm, a commonly used clustering algorithm, is an iterative process used to minimize the distance of the data point from the average data point in...")
 
No edit summary
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
The K-Means clustering algorithm, a commonly used clustering algorithm, is an iterative process used to minimize the distance of the data point from the average data point in the cluster.<cite>[https://datafloq.com/read/7-innovative-uses-of-clustering-algorithms/6224]</cite> The k-means algorithm is one of the fastest clustering algorithms available.<cite>[https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html]</cite>
The '''K-means clustering''' algorithm, a commonly used clustering algorithm, is an iterative [[unsupervised learning]] process used to minimize the distance of the data point from the average data point in the cluster.<ref>[https://datafloq.com/read/7-innovative-uses-of-clustering-algorithms/6224 7 Innovative Uses of Clustering Algorithms in the Real World]datafloq.com</ref> The k-means algorithm is one of the fastest clustering algorithms available.<ref>[https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html KMeans]scikit-learn.org</ref> K-means can group data only unsupervised based on the similarity of customers to each other. It is a type of partitioning [[clustering]], as it divides the data into K non-overlapping subsets or clusters without any cluster internal structure or labels. The objective of k-means is to form clusters in such a way that similar samples go into a cluster, and dissimilar samples fall into different clusters. It aims to minimize the “intra cluster” distances and maximize the “inter-cluster” distances, and to divide the data into non-overlapping clusters without any cluster-internal structure.<ref name="coursera">[https://www.coursera.org/learn/machine-learning-with-python/lecture/Ky5Wf/intro-to-k-means Intro to k-Means]Coursera</ref>
 
== See also ==
 
* [[Clustering]]
* [[Cluster]]
 
== References ==

Latest revision as of 21:10, 31 March 2020

The K-means clustering algorithm, a commonly used clustering algorithm, is an iterative unsupervised learning process used to minimize the distance of the data point from the average data point in the cluster.[1] The k-means algorithm is one of the fastest clustering algorithms available.[2] K-means can group data only unsupervised based on the similarity of customers to each other. It is a type of partitioning clustering, as it divides the data into K non-overlapping subsets or clusters without any cluster internal structure or labels. The objective of k-means is to form clusters in such a way that similar samples go into a cluster, and dissimilar samples fall into different clusters. It aims to minimize the “intra cluster” distances and maximize the “inter-cluster” distances, and to divide the data into non-overlapping clusters without any cluster-internal structure.[3]

See also

References