Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Image Retrieval System Using Vision Transformer and FAISS with GUI - Tutorial 351

DigitalSreeni via YouTube

Overview

Coursera Plus Monthly Sale: All Certificates & Courses 40% Off!
Learn to build a powerful content-based image retrieval system in this 11-minute tutorial that demonstrates how to combine Vision Transformer (ViT) and FAISS technologies with a user-friendly graphical interface. Master the implementation of a cross-platform Python application that leverages ViT-B/16 pre-trained models for feature extraction and FAISS IVF indexing for efficient similarity searches. Explore the development of an interactive GUI that supports batch operations, multiple image formats (PNG, JPG, JPEG, WebP), and provides convenient features like double-click image opening and folder access. Discover how to incorporate optional GPU acceleration for enhanced processing speed while ensuring compatibility across Windows, macOS, and Linux operating systems. Follow along with practical code examples and implementation guidance for both GUI and programmatic API approaches to image retrieval.

Syllabus

351 - Image Retrieval Made Easy With GUI. Uses ViT and FAISS

Taught by

DigitalSreeni

Reviews

Start your review of Image Retrieval System Using Vision Transformer and FAISS with GUI - Tutorial 351

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.