Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add support for DaskExecutor #816

Closed
wants to merge 6 commits into from

Conversation

cherusk
Copy link

@cherusk cherusk commented Jan 6, 2024

What issues does your PR fix?

What does your PR do?

This PR brings about a new Airflow executor backend on Dask/Distributed cluster.

Checklist

For all Pull Requests

@cherusk cherusk marked this pull request as draft January 6, 2024 15:47
@cherusk cherusk changed the title Feature/dask executor {DRAFT] Feature/dask executor Jan 6, 2024
@cherusk cherusk changed the title {DRAFT] Feature/dask executor [DRAFT] Feature/dask executor Jan 6, 2024
@cherusk cherusk force-pushed the feature/dask_executor branch 11 times, most recently from 94892d5 to 9a115cf Compare January 6, 2024 17:30
@cherusk cherusk marked this pull request as ready for review January 6, 2024 17:30
@cherusk cherusk requested a review from gsemet as a code owner January 6, 2024 17:30
@cherusk
Copy link
Author

cherusk commented Jan 6, 2024

@thesuperzapper this should be it. Please give it a gander for now.

I've not tested this outside of this PR yet.

@cherusk cherusk changed the title [DRAFT] Feature/dask executor Feature/dask executor Jan 6, 2024
Define dask executor as component in values and reflect chart dependency
in chart config.

Signed-off-by: Matthias Tafelmeier <[email protected]>
Reflect new executor type DaskExecutor.

Signed-off-by: Matthias Tafelmeier <[email protected]>
Airflow does need the dask scheduler endpoint configured. That is done
as for the other config vars via env vars.

We also autodetect the dask sub chart release to automate generating the
connection string from airflow to the dask cluster.

Signed-off-by: Matthias Tafelmeier <[email protected]>
Necessary otherwise we get a dependencies out of sync error.

Exact error:
"the lock file (Chart.lock) is out of sync with the dependencies file (Chart.yaml). Please update the dependencies"

Signed-off-by: Matthias Tafelmeier <[email protected]>
@cherusk cherusk force-pushed the feature/dask_executor branch from 9a115cf to 84a1520 Compare January 6, 2024 20:18
Signed-off-by: Matthias Tafelmeier <[email protected]>
@cherusk
Copy link
Author

cherusk commented Jan 9, 2024

@thesuperzapper @gsemet could you please prioritize this up? Thanks!

Copy link

stale bot commented Mar 17, 2024

This issue has been automatically marked as stale because it has not had activity in 60 days.
It will be closed in 7 days if no further activity occurs.

Thank you for your contributions.


Issues never become stale if any of the following is true:

  1. they are added to a Project
  2. they are added to a Milestone
  3. they have the lifecycle/frozen label

@stale stale bot added the lifecycle/stale lifecycle - this is stale label Mar 17, 2024
@thesuperzapper thesuperzapper added this to the airflow-8.10.0 milestone Apr 24, 2024
@stale stale bot removed lifecycle/stale lifecycle - this is stale labels Apr 24, 2024
@thesuperzapper thesuperzapper changed the title Feature/dask executor feat: add support for DaskExecutor Apr 30, 2024
@thesuperzapper
Copy link
Member

@cherusk thanks for the PR, but it looks like as of Airflow 2.9.0, the upstream project has decided to stop supporting DaskExecutor:

I understand that will be frustrating!

@thesuperzapper thesuperzapper removed this from the airflow-8.10.0 milestone Apr 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Feature - add dask executor support
2 participants