MACHINE LEARNINGSHIPPED

CIFAR-10 Softmax Classifier

A single-layer softmax classifier from scratch in NumPy — full training pipeline, gradient verification against autograd, and a systematic hyperparameter study. 39.30% test accuracy.

ROLESole author

TEAMIndividual — KTH Deep Learning in Data Science

TIMELINEMar 2026 — Jun 2026

CATEGORYMachine Learning

SOURCE REPOSITORY

39.30%

TEST ACCURACY

~1e-16

GRADIENT MAX ERROR

λ=0.1, η=0.001

BEST CONFIG

Overview

The first project in KTH's Deep Learning in Data Science course: a single-layer softmax classifier built from scratch in NumPy, trained on CIFAR-10 with mini-batch gradient descent and L2 regularization. Final test accuracy: 39.30% with the optimal hyperparameters (λ = 0.1, η = 0.001) — solid for a linear model staring at raw pixels.

The pipeline

Data normalization, forward pass, cross-entropy loss, and analytic backpropagation — all hand-derived and hand-implemented.
Gradients verified against PyTorch autograd to machine precision (max error ~1e-16), so every later experiment ran on provably correct arithmetic.
Weight visualization: rendering each class's weight matrix as an image reveals the ghostly "template" the linear model learns for each class — a blurry car, a sky-over-green-field for deer.

Hyperparameter study

I ran a systematic sweep across learning rates and regularization strengths and documented two clean findings in the LaTeX report:

High learning rates cause oscillation — the loss curve never settles, bouncing around the minimum.
Strong regularization improves weight interpretability at the cost of accuracy — the class templates become visibly smoother and more recognizable as λ grows, while test accuracy drops.

Source

Code and report: github.com/rippyboii/cifar10-softmax-classifier

APPENDIX A // MEDIA

IMAGE SLOT — AWAITING UPLOAD

FIG.01 — LEARNED WEIGHT TEMPLATES PER CIFAR-10 CLASS

IMAGE SLOT — AWAITING UPLOAD

FIG.02 — TRAIN/VALIDATION LOSS ACROSS CONFIGURATIONS