Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Image Classification with Vision Transformers (ViT) Using Python

Eran Feit via YouTube

Overview

Coursera Plus Annual Sale: All Certificates & Courses 25% Off!
Learn how to implement image classification using Vision Transformers (ViT) in this 14-minute Python tutorial. Follow along as the instructor demonstrates loading an image with OpenCV, preprocessing it for the ViT model, and performing classification using the ViT-Base-Patch16-224 model from Hugging Face. Watch as the predicted label is displayed on the image and saved as an output file. Access the complete code for this tutorial through the provided link and explore additional computer vision resources on the instructor's blog. The tutorial covers installation requirements and provides a step-by-step coding walkthrough to help you understand how to leverage transformer architecture for image classification tasks.

Syllabus

00:00 Introduction
00:23 Installation
09:13 Let's start coding ...

Taught by

Eran Feit

Reviews

Start your review of Image Classification with Vision Transformers (ViT) Using Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.