Image Classification using pre-trained DenseNet model in PyTorch | by Bhimraj Yadav
In this tutorial, you will learn how to classify images using a pre-trained DenseNet model in Pytorch. DenseNet is trained on more than a million images from the ImageNet database. This model can classify images into 1000 object categories, such as keyboard, mouse, pencil, etc.
Try it out in Google Colab.
GitHub Link: Image-Recognition-App-using-FastAPI-and-PyTorch
6 Simple Steps to be followed:
- Install and import required libraries
- Download an example Image and ImageNet output classes and load them
- Apply transformations to the image
- Load DenseNet model
- Get the final result/output.
- Plot the result
Step 1: Install and import required libraries
# Optional Part: To create an environment
python -m venv venv
source venv/bin/activate# install these libraries
pip install torch torchvision Pillow matplotlib
After installing these required libraries, we can import them easily as shown below.
import torch
from PIL import Image
from torchvision import transforms,models
import matplotlib.pyplot as plt
Step 2: Download an example Image and ImageNet output classes and load them.
# Download an example image from the pytorch website
!wget "https://github.com/pytorch/hub/raw/master/images/dog.jpg" -O "dog.jpg"# Download ImageNet labels
!wget https://raw.githubusercontent.com/pytorch/hub/master/imagenet_classes.txt# Read the image classes
with open("imagenet_classes.txt", "r") as f:
img_classes = [s.strip() for s in f.readlines()]print("Total Image Classes :{}".format(len(img_classes)))
print(img_classes[:5]) #displaying first five classes
# Printed Output
Total Image Classes :1000
['tench', 'goldfish', 'great white shark', 'tiger shark', 'hammerhead']
Step 3: Apply transformations to the image
# transforming input image
filename = "dog.jpg"
input_image = Image.open(filename)preprocess = transforms.Compose([
transforms.Resize(256),
transforms.CenterCrop(224),
transforms.ToTensor(),
transforms.Normalize(mean=[0.485, 0.456, 0.406],
std=[0.229, 0.224, 0.225]),
])input_tensor = preprocess(input_image)# create a mini-batch as expected by the model
input_batch = input_tensor.unsqueeze(0)
Step 4: Load densenet model
# model
model = models.densenet121(pretrained=True)
model.eval() # sets the model in evaluation mode# For more info about evaluation:# https://stackoverflow.com/questions/60018578/what-does-model-eval-do-in-pytorch# move the input and model to GPU for speed if availableif torch.cuda.is_available():
input_batch = input_batch.to('cuda')
model.to('cuda')
Step 5: Get the final result/output
with torch.no_grad():
output = model(input_batch)# Tensor of shape 1000, with confidence scores over Imagenet's 1000 classes
# The output has unnormalized scores. To get probabilities, you can run a softmax on it.
probabilities = torch.nn.functional.softmax(output[0], dim=0)# Show top categories per image
top5_prob, top5_catid = torch.topk(probabilities, 5)for id,prob in zip(top5_catid,top5_prob):
print(f'Image class: {img_classes[id]}\t----- {prob * 100:.4f} %')# Printed Output
Image class: Samoyed ----- 86.7997 %
Image class: white wolf ----- 4.3434 %
Image class: keeshond ----- 2.5546 %
Image class: Arctic fox ----- 2.3647 %
Image class: Pomeranian ----- 1.7686 %
Step 5: Plot the result
classes = [img_classes[id] for id in top5_catid]
probabilities = [prob * 100 for prob in top5_prob]# creating the bar plot
fig, (ax1,ax2) = plt.subplots(1,2,figsize=(24, 8))
ax1.set_title('Input Image of Dog')
ax1.imshow(input_image)ax2.bar(classes, probabilities)
ax2.set_xlabel("Top 5 Classes")
ax2.set_ylabel("Probabilities")
ax2.set_title("Image Classification")
plt.show()
References:
[1]: Huang et al. (2017). Densely Connected Convolutional Networks. Available: https://arxiv.org/abs/1608.06993 https://github.com/liuzhuang13/DenseNet
[2]: DENSENET By Pytorch Team. Available: https://pytorch.org/hub/pytorch_vision_densenet
[3]: DenseNet. Available: https://paperswithcode.com/method/densenet
Appendices: