machinelearning
-
Machine Learning with Python - ClusteringData Science 2022. 10. 7. 20:27
What is clustering? A group of objects that are similar to other objects in the cluster, and dissimilar to data points in other clusters. Then what is the difference between classification and clustering? The main difference is classification is used for labeled data, whereas clustering is used for non-labeled data(비지도 학습에 주로 사용) Where is clustering used? How can we determine the similarity or d..
-
Machine Learning with Python - Classification(작성중)Data Science 2022. 8. 20. 23:01
- What is classification? A supervised approach, categorizing some unknown items into a discrete set of categories of classes - Normally, unlabeled test case 에는 defualt 값을 지정해 0또는 1로 표시한다. -> binary classifier Category가 여럿인 multi-class classification 도 있다 - Classification 의 종류 - K-Nearest Neighbor classification(KNN algorithm) 이란? 인접한 변수끼리 묶어 주는 것 - K-nearest neighbors algorism process 1. Pick a..
-
Machine Learning with Python - Regression(Simple, Multiple, Non-linear regression)Data Science 2022. 8. 19. 11:03
Regression: a process of predicting a continuous value Types of regression models: Simple Regression / Multiple Regression Simple Linear Regression: one independent variable(x)을 갖고 하나의 dependent variable(y)을 도출해 내는 것 Multiple Linear Regression: 여러개의 Independent variable 을 갖고 하나의 dependent variable 을 도출해 내는 것 Simple Linear Regression 공식. 세타1은 coefficient 라고 불리고, 쎄타0는 Intercept라고 불린다 How to find t..
-
Machine Learning with Python - IntroData Science 2022. 8. 18. 17:50
Python libraries for machine learning Numpy, Pandas, Scikit-learn Scikit-learnd의 기능: preprocessing, model_selection, building classifier, fitting the model, confusion_matrix (결과 출력) Supervised vs Unsupervised learning(지도학습 vs 비지도학습) Supervised model: how to teach? by labeling the dataset Unsupervised learning techniques: Dimension reduction / Density estimation / market basket analysis / Cluster..