Comprehensive Python Beautiful Soup Web Scraping Tutorial - Find-Find All, CSS Select, Scrape Table
Keith Galli via YouTube
Overview
Syllabus
- Intro & Video Overview
- What is web scraping?
- Introduction to HTML
Using the beautiful soup library
- Loading in a webpage requests library
- Starting to scrape
- find & find_all methods
- Finding specific text/strings in our HTML regex
- Select method CSS path selections
- Grabbing the string/text from an HTML element
- Getting a property of HTML element href, src, id, class, etc
- Code navigation parents, children, siblings
Let’s practice our skills!
- Exercise #1: Grab all social links on webpage in 3 different ways
- Exercise #2: Scrape an HTML table into a Pandas Dataframe
- Exercise #3: Grab all fun facts that contain the word “is”
- Exercise #4: Use beautiful soup to help download an image from a webpage
- Exercise #5: Solve the mystery challenge!!!
Taught by
Keith Galli