About the workshop¶

We will be taking a look at the basic concepts of data pipelines as well as practical use cases using Python and libraries like pandas, matplotlib, and tensorflow.

About you:¶

Some experience using the command line
Intermediate Python knowledge / use
Be able to apply what we learn and adopt to your use cases
Interested in data and systems
Aspring or current data engineering
Some knowledge about systems and databases (enough to be dangerous)

Our focus for the day¶

Greater understanding on how to apply data pipelines using the Python and libraries in the Python scientific ecosystem
Focus on concepts (rather than complex implementations)
Practical knowledge application
Create the building blocks needed for your day-to-day work

Keeping on track¶

You will find 🚦 across the tutorial examples. We will use this to identify how folks are doing over the workshop (if following along in person). These will indicate practical or hands-on portions of the tutorial.

Additional tutorial (PyCon US)¶

For another (much longer) tutorial integrating MYSQL and Twitter stream data check out https://github.com/trallard/airflow-tutorial

Also in the upcoming months I have planned:

Deploying Airflow in Kubernetes (AKS)
In depth programmatic report generation with Airflow and papermill
Airflow + dagster https://github.com/dagster-io/dagster
Airflow + R?

About the workshop¶

About you:¶

Our focus for the day¶

Keeping on track¶

Additional tutorial (PyCon US)¶

Table of Contents

Navigation

Related Topics