Data Science

Machine Learning with Python - Clustering

Hiru_93 2022. 10. 7. 20:27
  • What is clustering?

 

A group of objects that are similar to other objects in the cluster, and dissimilar to data points in other clusters.

  • Then what is the difference between classification and clustering?
The main difference is classification is used for labeled data, whereas clustering is used for non-labeled data(비지도 학습에 주로 사용)
  • Where is clustering used?

  • How can we determine the similarity or dissimilarity?

Clustering 의 핵심은 intra cluster distances는 최소화하고, inter cluster distances는 최대화하는 것이다