Solving Real World Data Science Tasks With Python Beautiful Soup - Movie Dataset Creation

Solving Real World Data Science Tasks With Python Beautiful Soup - Movie Dataset Creation

Keith Galli via YouTube Direct link

- Video overview

1 of 18

1 of 18

- Video overview

Class Central Classrooms beta

YouTube playlists curated by Class Central.

Classroom Contents

Solving Real World Data Science Tasks With Python Beautiful Soup - Movie Dataset Creation

Automatically move to the next video in the Classroom when playback concludes

  1. 1 - Video overview
  2. 2 - Check out DataCamp! sponsored
  3. 3 - Setup
  4. 4 Task #1: Scrape the infobox from Toy Story 3 wiki page save in python dictionary
  5. 5 Task #2: Scrape infobox for all movies in List of Disney Films save as list of dictionaries
  6. 6 - Robots.txt Are you allowed to scrape a site?
  7. 7 - Task #2: Scrape infobox for all movies in List of Disney Films save as list of dictionaries
  8. 8 - Save & Load dataset checkpoint JSON file
  9. 9 Task #3: Clean our data!
  10. 10 - Task #3.1: Strip out all references [1],[2],etc from HTML
  11. 11 - Task #3.2: Split up the long strings
  12. 12 - Task #3.3: Examine errors we are getting
  13. 13 - Task #3.4: Convert “Running time” field to an integer
  14. 14 - Task #3.5: Convert “Budget” & “Box office” fields to floats
  15. 15 - Task #3.6: Convert dates into datetime objects
  16. 16 - Saving our data again using Pickle
  17. 17 Task #4: Attach IMDB, Metascore, and Rotten Tomatoes scores to dataset working with APIs
  18. 18 Task #5: Save final dataset as a JSON file and as a CSV file

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.