Classifying Digits with Logistic Regression

Personal Projects #Data Science#Python

Overview#

A digit classification project using PyTorch to implement logistic regression on the MNIST dataset, demonstrating foundational deep learning concepts including forward pass, backpropagation, and optimization.

Key Achievements#

Achieved 82.6% accuracy on test set after 5 epochs
Implemented PyTorch training pipeline

Implementation#

Custom Logistic Regression Class#

Built a PyTorch model with:

Constructor: Initializes mapping input pixels to output classes (784px to 10)
Forward Method: Uses linear transformation to calculate softmax probabilities

Training Pipeline#

Data Loading: Batch processing to split MNIST into training and test sets
Loss Function: Cross-entropy for multiclass classification
Optimizer: Stochastic Gradient Descent with learning rate 0.001
Training Loop: Forward pass -> calculate loss -> backward pass -> update weights

Hyperparameters#

Parameter	Value	Reason
Input Size	784	28x28 px flattened
Num Classes	10	Digits 0-9
Epochs	5	Training iterations
Batch Size	100	Memory efficiency
Learning Rate	0.001	Standard SGD

Training Process#

1
# For each epoch:
2
for images, labels in trainingData:
3

4
    # set images and labels
5
    images = Variable(images.view(-1,28*28))
6
    labels = Variable(labels)
7

8
    # reset gradients to 0
9
    optimize.zero_grad()
10

11
    # forward pass
12
    output=model(images)
13
    loss = cross_entropy(output,labels)
14

15
    # backward pass
16
    loss.backward()
17

18
    # update weights
19
    optimize.step()