Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

DataCamp

Data Manipulation in Python

via DataCamp

Overview

## Master Data Manipulation with Python's Most Powerful Libraries Unlock the full potential of your data with Python's essential data manipulation libraries: pandas and NumPy. In this Track, you'll learn how to efficiently clean, reshape, and analyze real-world datasets using the most popular tools in the Python data ecosystem. By the end of the Track, you'll have the skills to tackle any data manipulation challenge that comes your way. ## Become a pandas Power User Dive deep into the pandas library and learn how to: * Import and clean data from various sources like CSV, Excel, and SQL databases * Calculate statistics and create informative visualizations * Reshape data from wide to long format for efficient analysis * Combine multiple datasets using advanced joining and merging techniques * Work with hierarchical data using multi-index DataFrames ## Streamline Your Workflow with NumPy Discover the power of NumPy, the foundation of Python's data science stack. You'll learn how to create, sort, filter, and update NumPy arrays while efficiently performing complex mathematical operations on large datasets. Additionally, you'll explore how to integrate NumPy with pandas to create a seamless data manipulation workflow and optimize your code for improved speed and performance. ## Hands-On Learning with Real Datasets Practice your new skills on real-world data, including: * New York City's tree census * Customer purchase data * Stock market prices * Online review datasets By working with authentic datasets, you'll gain practical experience that directly translates to your own data manipulation projects. ## Advance Your Data Science Journey Whether you're an aspiring data scientist or an experienced analyst looking to upgrade your skills, mastering data manipulation is essential. The techniques you'll learn in this Track serve as the foundation for machine learning, data visualization, and statistical analysis. By honing your data manipulation skills, you'll be prepared to tackle advanced data science concepts and real-world challenges. ## Start Manipulating Data with Confidence Take the first step towards becoming a data manipulation expert. With a combination of interactive exercises, real-world datasets, and hands-on projects, this Track provides a comprehensive learning experience. By the end of the Track, you'll have the confidence and skills to wrangle any dataset and uncover valuable insights. Start your journey to mastering data manipulation today!

Syllabus

  • Data Manipulation with pandas
    • Learn how to import and clean data, calculate statistics, and create visualizations with pandas.
  • Getting a Good Night's Sleep
  • Reshaping Data with pandas
    • Reshape DataFrames from a wide to long format, stack and unstack rows and columns, and wrangle multi-index DataFrames.
  • Joining Data with pandas
    • Learn to combine data from multiple tables by joining data together using pandas.
  • Introduction to NumPy
    • Master your skills in NumPy by learning how to create, sort, filter, and update arrays using NYC’s tree census.

Taught by

Richie Cotton, Maggie Matsui, Aaren Stubberfield, Maria Eugenia Inzaugarat, and Izzy Weber

Reviews

Start your review of Data Manipulation in Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.