Module

Machine Learning Fundamentals

Progress80%

16 / 20 pages

Lesson 1: What is Machine Learning?

Lesson 2: Linear Regression from Scratch

Lesson 3: Visualizing the Loss Landscape

Lesson 4: Logistic Regression (Classification)

Lesson 5: K-Nearest Neighbors (Distance)

Lesson 6: Evaluation Metrics (From Scratch)

Lesson 7: Unsupervised Learning & K-Means

Lesson 8: Dimensionality Reduction with PCA

Lesson 9: Decision Trees & Splits

Lesson 10: Regularization (L1 & L2)

Lesson 11: K-Fold Cross Validation

Lesson 12: Naive Bayes — Probabilistic Classifier

Lesson 13: Support Vector Machines (SVM)

Lesson 14: Gradient Boosting & AdaBoost

Lesson 15: DBSCAN — Density-Based Clustering

Lesson 16: Gaussian Mixture Models (GMM)

Lesson 17: Ensemble Methods — Combine Multiple Models

Back to Module Overview

Alt+←/→to navigatePage16/2080

Support Vector Machines (SVM) · Page 1 of 1

The Maximum Margin Principle

30 min Advanced

Support Vector Machines (SVM)

The Core Idea

SVM finds the best line (or hyperplane) that separates two classes with the maximum margin (distance to nearest points).

Intuition

Class A:  ●●●  ___________  ○○○  : Class B

The line position matters!
Too close to A? Will misclassify new A points.
Too close to B? Will misclassify new B points.
SVM finds the perfect balance (maximum margin).

Linear vs Non-Linear

Linear SVM

For linearly separable data:

2x + 3y + 1 = 0  (decision boundary)

Find weights (w) and bias (b) such that margin is maximized.

Non-Linear SVM (Kernel Trick)

For non-linear data (spirals, circles), use kernels to transform to higher dimensions:

Original 2D data (not separable)
   ↓ (Kernel transformation)
Higher dimension (separable)

Common Kernels:

Linear: For linearly separable data
RBF (Radial Basis Function): Most popular, handles most cases
Polynomial: Useful for polynomial relationships
Sigmoid: Similar to neural networks

Support Vectors

Points closest to the decision boundary. Only these matter!

If you move a far-away point, decision boundary doesn't change
If you move a support vector, boundary shifts
SVM is efficient: stores only support vectors (often small fraction of data)

Pros & Cons

Pros:

✓ Works well on tabular data
✓ Non-linear kernels handle complex boundaries
✓ Memory efficient (only stores support vectors)
✓ Works well in high-dimensional spaces

Cons:

✗ Slow on large datasets (O(n²) or worse)
✗ Hard to interpret which features matter
✗ Sensitive to feature scaling (must normalize!)
✗ Hyperparameter tuning critical (C, gamma)

main.py

OUTPUT

▶Click "Run Code" to execute…