Clustering: Difference between revisions

From Machinelearning
No edit summary
Line 1: Line 1:
'''Clustering''' is an [[unsupervised learning]] technique. It is used for grouping data points, or objects that are somehow similar. Clustering means finding clusters in a dataset, unsupervised.<ref name="MLPython">[https://www.coursera.org/learn/machine-learning-with-python/lecture/Nlxjw/intro-to-clustering Intro to Clustering]Coursera</ref>
'''Clustering''' is an [[unsupervised learning]] technique. It is used for grouping data points, or objects that are somehow similar. Clustering means finding clusters in a dataset, unsupervised.<ref name="MLPython">[https://www.coursera.org/learn/machine-learning-with-python/lecture/Nlxjw/intro-to-clustering Intro to Clustering]Coursera</ref>
== Motivation ==
Generally, clustering can be used for one of the following purposes<ref name="MLPython"/>:
* Exploratory data analysis
* Summary generation
* Outlier detection
* Finding duplicates
* Pre-processing step


== Types pof clustering ==
== Types pof clustering ==

Revision as of 22:25, 31 March 2020

Clustering is an unsupervised learning technique. It is used for grouping data points, or objects that are somehow similar. Clustering means finding clusters in a dataset, unsupervised.[1]

Motivation

Generally, clustering can be used for one of the following purposes[1]:

  • Exploratory data analysis
  • Summary generation
  • Outlier detection
  • Finding duplicates
  • Pre-processing step

Types pof clustering

Some divide clustering into two subgroups[2]:

  • Hard clustering: Each data point either belongs to a cluster completely or not.
  • Soft clustering: A probability or likelihood is assigned for putting data points into separate clusters.

Clustering vs classification

Algorithms

Some of the commonly used clustering algorithms are[3]:


others

Applications

References

References