Dive into the advanced features of Dask, a popular Python library for scaling and parallelizing code, in this comprehensive tutorial from PyCon US. Explore task graph optimization, the worker and scheduler plugin system, and techniques for inspecting a cluster's internal state. Gain a deeper understanding of Dask's internals, learn about its more advanced capabilities, and discover how to effectively apply these features to data-intensive workloads. Suitable for those familiar with Dask's basics, this 2-hour 18-minute session will equip you with the knowledge to extend the PyData ecosystem to larger-than-memory or distributed environments and parallelize custom algorithms and workflows.
Overview
Syllabus
TUTORIAL / James Bourbeau, Julia Signell / Hacking Dask: Diving Into Dask;s Internals
Taught by
PyCon US