Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Fine-Tuning ConvNeXT Vision Transformer for Custom Dog Breed Classification

Eran Feit via YouTube

Overview

Coursera Plus Monthly Sale: All Certificates & Courses 40% Off!
This tutorial guides you through fine-tuning a ConvNeXT vision transformer model for custom dog breed classification using Hugging Face Transformers and PyTorch. Learn the complete workflow from loading and preprocessing custom image datasets with datasets and torchvision to implementing training loops with validation and early stopping. Master essential techniques including transforming images with AutoImageProcessor for optimal ConvNeXT performance, fine-tuning pre-trained models on new datasets, saving and loading models for inference, and making predictions with fine-tuned Vision Transformer models. The 32-minute video includes practical chapters covering installation, dataset exploration, data transformation and loading, model building and training, and model testing. Access the complete code for the tutorial through the provided link and explore additional computer vision and visual language model tutorials in the creator's playlists.

Syllabus

00:00 Introduction
01:29 Installation
04:13 Discover the dataset
05:34 Transform and load the data
17:37 Build and train the vision transformer model
26:53 Test the model

Taught by

Eran Feit

Reviews

Start your review of Fine-Tuning ConvNeXT Vision Transformer for Custom Dog Breed Classification

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.