Custom CNN Medical Image Classification with Transfer Learning PyTorch Tutorial

deep_learning

Custom CNN Medical Image Classification with Transfer Learning PyTorch Tutorial

Learn to build custom CNNs for medical image classification using PyTorch and transfer learning. Master chest X-ray pneumonia detection with preprocessing, evaluation, and deployment techniques.

Jul 31, 2025

Custom CNN Medical Image Classification with Transfer Learning PyTorch Tutorial

I’ve been exploring how deep learning can transform healthcare, particularly in diagnosing conditions like pneumonia from chest X-rays. The potential to assist medical professionals with accurate, rapid analysis motivates this work. Let me guide you through building a robust CNN system using PyTorch - I’ll share practical insights and code from my experiments.

Medical imaging presents distinct hurdles. High-resolution data requires careful handling to preserve critical details. Class imbalance often skews datasets - healthy cases typically outnumber problematic ones. Pathological signs can be incredibly subtle. Why does this matter? Because diagnostic tools demand exceptional reliability. We must ensure models focus on clinically relevant features and perform consistently across diverse patient groups.

Setting up our environment starts with key libraries:

import torch
import torchvision.transforms as T
from torch.utils.data import DataLoader, WeightedRandomSampler
import albumentations as A
from albumentations.pytorch import ToTensorV2
import cv2

Handling medical images requires specialized preprocessing. My custom dataset class incorporates techniques like CLAHE (Contrast Limited Adaptive Histogram Equalization) to enhance subtle features:

class MedicalImageDataset(Dataset):
    def _load_and_preprocess_image(self, img_path):
        image = cv2.imread(str(img_path), cv2.IMREAD_GRAYSCALE)
        image = cv2.resize(image, (224, 224))
        clahe = cv2.createCLAHE(clipLimit=2.0, tileGridSize=(8, 8))
        image = clahe.apply(image)
        return cv2.cvtColor(image, cv2.COLOR_GRAY2RGB)

Augmentation strategies must preserve diagnostic integrity. Notice how these transformations maintain clinical relevance while expanding our dataset:

train_transform = A.Compose([
    A.Rotate(limit=10, p=0.3),
    A.HorizontalFlip(p=0.5),
    A.RandomBrightnessContrast(brightness_limit=0.1, p=0.3),
    A.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),
    ToTensorV2()
])

For model architecture, transfer learning accelerates development while custom CNNs offer control. How do we balance these approaches? Here’s a flexible implementation:

def create_model(use_pretrained=True):
    if use_pretrained:
        model = models.densenet121(weights='IMAGENET1K_V1')
        model.classifier = nn.Linear(model.classifier.in_features, 2)
    else:
        model = nn.Sequential(
            nn.Conv2d(3, 32, kernel_size=3, padding=1),
            nn.ReLU(),
            nn.MaxPool2d(2),
            nn.Conv2d(32, 64, kernel_size=3, padding=1),
            nn.ReLU(),
            nn.MaxPool2d(2),
            nn.AdaptiveAvgPool2d((1,1)),
            nn.Flatten(),
            nn.Linear(64, 2)
    return model.to(device)

Class imbalance requires strategic handling. Weighted loss functions and sampling techniques prevent model bias toward majority classes:

def get_weighted_sampler(dataset):
    class_weights = dataset.get_class_weights()
    sample_weights = [class_weights[label] for _, label in dataset]
    return WeightedRandomSampler(sample_weights, len(sample_weights))

train_loader = DataLoader(train_data, batch_size=32, 
                         sampler=get_weighted_sampler(train_data))

Evaluation extends beyond accuracy. Medical contexts demand scrutiny of sensitivity, specificity, and AUC-ROC:

def calculate_metrics(y_true, y_pred, y_proba):
    print(classification_report(y_true, y_pred))
    print(f"AUC-ROC: {roc_auc_score(y_true, y_proba[:,1]):.4f}")
    cm = confusion_matrix(y_true, y_pred)
    sensitivity = cm[1,1]/(cm[1,1]+cm[1,0])
    specificity = cm[0,0]/(cm[0,0]+cm[0,1])
    print(f"Sensitivity: {sensitivity:.4f}, Specificity: {specificity:.4f}")

For interpretability, Grad-CAM visualizations reveal where the model focuses - crucial for clinical trust:

from gradcam import GradCAM

def generate_gradcam(model, image, target_layer):
    cam = GradCAM(model=model, target_layer=target_layer)
    heatmap = cam(input_tensor=image.unsqueeze(0))
    return heatmap

Deployment considerations differ significantly in healthcare. Model explanations become non-negotiable. Regulatory compliance requires thorough documentation. Continuous monitoring ensures performance consistency across demographic groups. How might we validate real-world effectiveness before clinical integration?

Through iterative refinement, we achieve models that complement medical expertise. The journey involves constant balancing - between model complexity and explainability, between computational efficiency and diagnostic precision. I’ve found that maintaining a feedback loop with medical professionals substantially improves practical utility.

What possibilities open when we combine deep learning with medical imaging? I’m excited by the potential to expand diagnostic access and reduce human fatigue factors. If you found these insights valuable, share your thoughts below - like this article if it helps your projects, and share with others exploring AI in medicine. Your experiences could help shape better tools for healthcare.

Share: Facebook Twitter Reddit LinkedIn WhatsApp Telegram Pinterest Email Instagram

deep_learning

Custom CNN Medical Image Classification with Transfer Learning PyTorch Tutorial

Our Creations

We are on Medium

Similar Posts

Master PyTorch CNN Image Classification: Build Custom Models with Transfer Learning and Advanced Training

Build Custom CNN with Transfer Learning PyTorch: Complete Image Classification Tutorial 2024

Build Convolutional Autoencoder for Image Denoising with PyTorch: Complete Implementation Guide

Build Real-Time Object Detection System: YOLOv8 OpenCV Python Tutorial for Beginners 2024

How to Build a Sound Classification System with Deep Learning and Python

Custom CNN PyTorch Tutorial: Image Classification with Data Augmentation and Transfer Learning