Project Showcase - CVV_15M

Overview

CVV_15M_SARS-CoV-2 is a convolutional neural network to classify chest X-ray images with high accuracy on mobile hardware. Finely tuned for performance on Apple M-series CPUs.

Key Features

High-accuracy classification
Three-class discrimination: COVID-19, normal, and viral pneumonia
Mixed precision training
TensorFlow Metal acceleration for Apple M

Project Gallery

Model architecture and training visualization

Confusion matrix and performance metrics

Background & Methodology

The implementation incorporates ReLU activation functions, LRN, overlapping pooling, and dropout layers. The model can extract intricate patterns from X-ray images at a high degree of accuracy.

ReLU Activation Function

TensorFlow employs ReLU activations:

\[ f(x) = \max(0, x) \]

ReLU accelerates trainingon smaller datasets by preventing vanishing gradients and promoting sparse activations.

Local Response Normalization (LRN)

To enhance feature selectivity, our model implements LRN which ecourages higher competition among neurons:

\[ b_{i,x,y} = \frac{a_{i,x,y}}{\left(k + \alpha \sum_{j=\max(0,i-\frac{n}{2})}^{\min(N-1,i+\frac{n}{2})} (a_{j,x,y})^2 \right)^{\beta}} \]

LRN helps minimize redundancy in datasets with overlapping image features by reducing the impact of already learned features that should be omitted.

Overlapping Pooling

By using overlapping pooling regions (\( s < z \)), the model gains a spatial representation of the novelties in the X-rays.

Dropout Regularization

To counter overfitting, dropout (0.5 rate) is applied in the fully connected layers:

\[ P(h_j \mid x) = \sum_i P(h_j \mid i) P(i \mid x) \]

Mixed Precision Training

We implemented TensorFlow's mixed precision training policy to optimize computational efficiency:

This approach allows us to:

Reduce memory consumption by using 16-bit representation during computation
Maintain numerical stability with 32-bit master weights
Leverage the M-series processors' optimized matrix multiplication units

Implementation Details

Data Pipeline & Preprocessing

The model was trained on a dataset of 17,000 chest X-ray images across three categories.

Model Configuration

Our implementation leverages a carefully designed structure:

Four convolutional blocks with increasing filter depths (64-512)
MaxPooling layers after each convolutional block
Two fully-connected layers (1024 neurons each) with dropout regularization
Final classification layer with softmax activation
Approximately 15 million trainable parameters

Performance & Results

Our model achieved >95% accuracy classifying among three categories:

Model Configuration

Image count: 17,000
IMAGE_SIZE: 224
BATCH_SIZE: 16
EPOCHS: 30
NUM_CLASSES: 3
Test accuracy: 0.9403

Confusion Matrix

	Covid	Normal	Pneumonia
Covid	708	10	6
Normal	70	922	8
Pneumonia	11	14	244

Performance Analysis

COVID-19 Detection: The model correctly identified 708 out of 724 COVID-19 cases (97.8% recall), demonstrating exceptional sensitivity for this critical class
Normal Classification: 922 out of 1000 normal cases were correctly identified (92.2% recall), with most misclassifications (70 cases) incorrectly labeled as COVID-19
Pneumonia Identification: The model correctly classified 244 out of 269 pneumonia cases (90.7% recall)
Cross-Class Errors: Very few COVID-19 cases were misclassified as pneumonia (6 cases) and vice versa (11 cases)

Technology Stack

Core Libraries: TensorFlow 2.10, Keras, NumPy, OpenCV
Data Processing: Pandas, scikit-learn, tqdm
Visualization: Matplotlib, TensorBoard
Hardware Acceleration: TensorFlow Metal, Mixed Precision Training
Development Environment: Python 3.9, macOS Monterey, Jupyter Notebook
Hardware: Apple M1 Pro (16GB Unified Memory)