As Data Engineers, our jobs regularly include scheduling or scaling workflows.

But have you ever asked yourself, can I scale my scheduling ?

It turns out that you can! But doing so raises a number of issues that need to be addressed.

In this talk we’ll be:

  • Recapping Asset-aware scheduling in Apache Airflow
  • Discussing diverse methods to upscale our scheduling
  • Solving the issue of maintaining our Airflow Asset synchronized between instances
  • Comparing our professional push based solution and the built-in solution from AIP-82 and the pros and cons of each method.

I hope you will enjoy it!

Sébastien Crocquevieille

Senior Data Engineer @ Numberly