On this dialogue, I goal to discover the evolving traits in information orchestration and information modelling, highlighting the developments in instruments and their core advantages for information engineers. Whereas Airflow has been the dominant participant since 2014, the info engineering panorama has considerably reworked, now addressing extra subtle use instances and necessities, together with assist for a number of programming languages, integrations, and enhanced scalability. I’ll study modern and maybe unconventional instruments that streamline my information engineering processes, enabling me to effortlessly create, handle, and orchestrate sturdy, sturdy, and scalable information pipelines.
Over the past decade we witnessed a “Cambrian explosion” of assorted ETL frameworks for information extraction, transformation and orchestration. It’s not a shock that lots of them are open-source and are Python-based.
The preferred ones:
- Airflow, 2014
- Luigi, 2014
- Prefect,2018
- Temporal, 2019
- Flyte, 2020
- Dagster, 2020
- Mage, 2021
- Orchestra, 2023