Overview
Explore big data analytics using Python and Stratosphere in this 20-minute EuroPython Conference talk. Learn about Stratosphere's distributed platform for advanced analytics, featuring rich operators, iterative data flows, efficient runtime, and automatic program optimization. Discover how to leverage Stratosphere's new Python programming interface to easily work with big data. Gain insights into key concepts like operators, data source connectors, data flows, compiler, and iterative algorithms. Understand the project's origins, open-source community, and transition to Apache Flink. Follow along with practical examples, including the "Word Count" demonstration, and learn about working with local files, containers, operations, and distribution. Engage with the speaker during the Q&A session to deepen your understanding of this powerful big data analytics tool.
Syllabus
Introduction
What is Stratosphere
What is Flink used for
Working on local files
Summary
Word Count
Why Python
How Python works
Containers
Operations
Distribution
Questions
Taught by
EuroPython Conference