Machine Learning

Comprehensive Guide to Machine Learning Algorithms and Methods

From foundations to advanced methods - A systematic path to mastering supervised learning, unsupervised learning, and practical ML engineering

Machine Learning
저자

Kwangmin Kim

공개

2024년 11월 10일

1 Learning Path

1.1 Phase 1: Foundations (2-3개월, 수학적 기초와 ML 핵심 개념 확립)

Mathematical Prerequisites - Linear Algebra ⭐ - Calculus and Optimization ⭐ - Probability and Statistics ⭐

ML Fundamentals - ML 개념과 학습 유형 - Bias-Variance Tradeoff ⭐ - Data Preprocessing - Feature Engineering ⭐

Basic Algorithms - Linear Regression ⭐ - Logistic Regression ⭐ - Decision Trees ⭐

1.2 Historical Methods

1.3 Phase 2: Core Algorithms (2-3개월, 주요 알고리즘)

Classification & Regression - SVM ⭐ - KNN - Naive Bayes - Regularization ⭐

Ensemble Methods - Random Forest ⭐ - Gradient Boosting ⭐ - XGBoost ⭐ - LightGBM ⭐

Unsupervised Learning - K-Means ⭐ - Hierarchical Clustering - PCA ⭐ - DBSCAN

1.4 Phase 3: Advanced Methods (2-3개월, 고급 기법)

Model Evaluation - Cross-Validation ⭐ - Performance Metrics ⭐ - Hyperparameter Tuning ⭐

Advanced Topics - Imbalanced Learning ⭐ - Gaussian Processes - Model Interpretability (SHAP) ⭐

Special Applications - Time Series - NLP Basics - Recommender Systems

1.5 Phase 4: ML Engineering (2개월, 프로덕션 배포와 운영)

Production ML - Model Serving - Model Monitoring ⭐ - A/B Testing for Models

Scalability - Distributed Training - Model Compression - Feature Stores

2 Implementation Notes

2.1 Python Libraries

Interpretability - import shap ⭐ - import lime

Production - import mlflow ⭐ - import bentoml

3 Foundations

3.1 Mathematical Prerequisites

3.2 ML Fundamentals

4 Supervised Learning

4.1 Regression (회귀)

4.2 Classification (분류)

4.3 Ensemble Methods (앙상블 방법)

5 Unsupervised Learning

5.1 Clustering (군집화)

5.2 Dimensionality Reduction (차원 축소)

5.3 Anomaly Detection (이상 탐지)

5.4 Association Rule Learning (연관 규칙 학습)

6 Model Evaluation and Selection

6.1 Performance Metrics (성능 지표)

6.2 Model Validation (모델 검증)

6.3 Hyperparameter Tuning (하이퍼파라미터 튜닝)

6.4 Model Selection (모델 선택)

7 Advanced Topics

7.1 Imbalanced Learning (불균형 학습)

7.2 Probabilistic Models (확률 모형)

7.3 Online Learning (온라인 학습)

7.4 Semi-Supervised Learning (준지도학습)

7.5 Transfer Learning (전이학습)

7.6 Interpretability and Explainability (해석가능성)

7.7 Fairness and Bias (공정성과 편향)

8 ML Engineering

8.1 Production ML (프로덕션 ML)

8.2 Scalability (확장성)

9 Special Applications

9.1 ML for Longitudinal Data (종단 데이터 ML)

9.2 Time Series Analysis (시계열 분석)

9.3 Natural Language Processing (자연어처리)

9.4 Computer Vision (컴퓨터 비전)

9.5 Recommender Systems (추천 시스템)

10 Key Resources

10.1 Books

  • Foundations:
    • Bishop (2006). “Pattern Recognition and Machine Learning”
    • Murphy (2012). “Machine Learning: A Probabilistic Perspective”
    • Hastie, Tibshirani, Friedman (2009). “The Elements of Statistical Learning” ⭐
  • Practical:
    • Géron (2019). “Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow”
    • Kuhn and Johnson (2013). “Applied Predictive Modeling”
  • Advanced:
    • Raschka and Mirjalili (2019). “Python Machine Learning”
    • Chollet (2021). “Deep Learning with Python”

10.2 Online Courses

  • Andrew Ng - Machine Learning (Coursera) ⭐
  • Fast.ai - Practical Deep Learning for Coders
  • Stanford CS229 - Machine Learning

10.3 Papers

  • Random Forests: Breiman (2001). “Random Forests”
  • XGBoost: Chen and Guestrin (2016). “XGBoost: A Scalable Tree Boosting System”
  • SHAP: Lundberg and Lee (2017). “A Unified Approach to Interpreting Model Predictions”

Subscribe

Enjoy this blog? Get notified of new posts by email: