Intel Scene Classification

PyTorch / EfficientNetV2 / Transfer Learning

ML Project

Overview

A multi-class scene classification system using EfficientNetV2-S with two-phase transfer learning from ImageNet pre-trained weights. The model classifies 224×224 RGB images into 6 scene categories: Buildings, Forest, Glacier, Mountain, Sea, and Street.

Training follows a two-phase strategy: Phase 1 freezes the backbone and trains only the custom classification head for 5 epochs (lr=1e-3), then Phase 2 unfreezes the entire network for fine-tuning over 15 epochs (lr=1e-4). Training uses AdamW optimizer, CosineAnnealingLR scheduling, mixed precision, and label smoothing (0.1) for regularization.

Achieves 95% top-1 accuracy with ~20.3M parameters. Grad-CAM visualizations are included to provide interpretability — highlighting the image regions the model attends to when making predictions.

Technologies

PyTorch

EfficientNetV2-S

Transfer Learning

Grad-CAM

AdamW

Python

Results

Top-1 Accuracy: 95%

Parameters: ~20.3M

6 scene categories

Includes Grad-CAM

← Back to Projects