Prefect vs. Dagster (from 2/2022) #2485
Closed
bendnorman
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Note: This discussion contains information about our comparison between prefect and dagster that originally lived in a google doc. This comparison was created in February 2022 so some information, stats, links might be out of date.
Prefect
Prefect’s main goal is to solve “negative engineering” which is when engineers write defensive code to make sure the positive code runs. Positive code is code to achieve an objective, for example, write an ETL to clean energy data. Examples of negative engineering from Prefect :
Prefect will be releasing Prefect Orion soon, their 2.0 release. While their guiding philosophy is to tackle negative engineering, they add a new hypothesis to the mix: DAGs create complexity.
It seems like Prefect wants to be as flexible as possible and make it easy for users to add Prefect features to pieces of existing “workflow” code. Do we want a super flexible orchestrator that doesn’t require DAGs? I can see how this philosophy would be helpful for Airflow users with large existing workflows who want a more flexible tool.
Dagster
Dagster is focused on solving the issue of “ Big Complexity ”. They don’t actually define the term, but I think this is the sentiment:
They believe orchestration is the layer in the data stack to tie data tools together:
What sets Dagster apart from other orchestration tools is its focus on developing tools to process and document data assets. It seems like delivering high-quality data is more important to Dagster than solving software infrastructure problems.
It’s interesting how Prefect and Dagster recently changed their philosophy on DAGs. Prefect thinks DAGs are too restrictive for complex data workflows. Dagster believes data asset DAGs are more important than computation DAGs.
Community
Both of these tools are supported by open-core startups, with most of the contributions being made by employees of the VC backed companies. We run the risk of the companies folding or close-sourcing the tools. Worst case scenario, there isn't enough community support to keep the project going, and we'd need to refactor our code base again. I gathered some statistics about each project (I included Airflow for comparison).
Features
I created a matrix of how Prefect and Dagster address Catalyst’s orchestration needs.
Beta Was this translation helpful? Give feedback.
All reactions