## Master Data Manipulation with Python's Most Powerful Libraries
Unlock the full potential of your data with Python's essential data manipulation libraries: pandas and NumPy. In this Track, you'll learn how to efficiently clean, reshape, and analyze real-world datasets using the most popular tools in the Python data ecosystem. By the end of the Track, you'll have the skills to tackle any data manipulation challenge that comes your way.
## Become a pandas Power User
Dive deep into the pandas library and learn how to:
* Import and clean data from various sources like CSV, Excel, and SQL databases
* Calculate statistics and create informative visualizations
* Reshape data from wide to long format for efficient analysis
* Combine multiple datasets using advanced joining and merging techniques
* Work with hierarchical data using multi-index DataFrames
## Streamline Your Workflow with NumPy
Discover the power of NumPy, the foundation of Python's data science stack. You'll learn how to create, sort, filter, and update NumPy arrays while efficiently performing complex mathematical operations on large datasets. Additionally, you'll explore how to integrate NumPy with pandas to create a seamless data manipulation workflow and optimize your code for improved speed and performance.
## Hands-On Learning with Real Datasets
Practice your new skills on real-world data, including:
* New York City's tree census
* Customer purchase data
* Stock market prices
* Online review datasets
By working with authentic datasets, you'll gain practical experience that directly translates to your own data manipulation projects.
## Advance Your Data Science Journey
Whether you're an aspiring data scientist or an experienced analyst looking to upgrade your skills, mastering data manipulation is essential. The techniques you'll learn in this Track serve as the foundation for machine learning, data visualization, and statistical analysis. By honing your data manipulation skills, you'll be prepared to tackle advanced data science concepts and real-world challenges.
## Start Manipulating Data with Confidence
Take the first step towards becoming a data manipulation expert. With a combination of interactive exercises, real-world datasets, and hands-on projects, this Track provides a comprehensive learning experience. By the end of the Track, you'll have the confidence and skills to wrangle any dataset and uncover valuable insights. Start your journey to mastering data manipulation today!
Overview
Syllabus
- Data Manipulation with pandas
- Learn how to import and clean data, calculate statistics, and create visualizations with pandas.
- Getting a Good Night's Sleep
- Reshaping Data with pandas
- Reshape DataFrames from a wide to long format, stack and unstack rows and columns, and wrangle multi-index DataFrames.
- Joining Data with pandas
- Learn to combine data from multiple tables by joining data together using pandas.
- Introduction to NumPy
- Master your skills in NumPy by learning how to create, sort, filter, and update arrays using NYC’s tree census.
Taught by
Richie Cotton, Maggie Matsui, Aaren Stubberfield, Maria Eugenia Inzaugarat, and Izzy Weber