Update airflow resources and split out dbt tests and alerts #433

chowbao · 2024-07-16T15:16:42Z

PR Checklist

PR Structure

This PR has reasonably narrow scope (if not, break it down into smaller PRs).
This PR avoids mixing refactoring changes with feature changes (split into two PRs
otherwise).
This PR's title starts with the jira ticket associated with the PR.

Thoroughness

This PR adds tests for the most critical parts of the new functionality or fixes.
I've updated the README with the added features, breaking changes, new instructions on how to use the repository.

What

Reducing airflow resources to reduce costs and splitting out tests from the elementary alert DAG

Why

Airflow resources were originally set for using captive core. Now that we have moved to CDP and observed the actual resources needed. We can greatly reduce the resources requested
The elementary alert DAG has a dependency on a dbt test. If this test fails the elementary alerts won't actually fire. Separating them out should avoid this

Known limitations

N/A

chowbao · 2024-07-16T15:17:29Z

airflow_variables_dev.json

    },
-    "state": {
+    "stellar-etl": {


Added new stellar-etland dbt resources. This will make it easy to adjust resources without adjusting resources for every dag/task

chowbao · 2024-07-16T15:19:01Z

dags/dbt_singular_tests_dag.py

Technically this could have been added to the existing dbt eho/state DAG. I opted to separate it out because it is purely for running anomaly tests/other tests which felt different enough to warrant a standalone DAG

dags/history_tables_dag.py

amishas157 · 2024-07-16T18:10:05Z

dags/dbt_data_quality_alerts_dag.py


-    # DAG task graph
-    start_tests >> singular_tests >> singular_tests_elementary_alerts


This looks good. But curious why were we using empty operator?

~~Also, different DAG is okay, but we can create independent tasks in same DAG as well, correct?~~
Explained in #433 (comment)

elementary_alerts singular_tests_elementary_alerts

amishas157 · 2024-07-16T18:13:12Z

airflow_variables_prod.json

+        "cpu": "0.5",
        "ephemeral-storage": "1Gi",
-        "memory": "5Gi"
+        "memory": "1Gi"


For my own knowledge, are the historical resource usage metrics tracked somewhere?

I don't believe it is. In theory you could find it in GCP logs

amishas157

Looks good to me. Left some q inline for my understanding

sydneynotthecity · 2024-07-16T18:28:19Z

dags/dbt_data_quality_alerts_dag.py

@@ -18,7 +18,7 @@
    default_args=get_default_dag_args(),
    start_date=datetime(2024, 6, 25, 0, 0),
    description="This DAG runs dbt tests and Elementary alerts at a half-hourly cadence",
-    schedule="*/15,*/45 * * * *",  # Runs every 15th minute and every 45th minute
+    schedule="15,45 * * * *",  # Runs every 15th minute and every 45th minute


Is there a reason we're running at the 15th and 45th minutes instead of */30 ****?

Yes. So the other dbt DAGs run on */30 with task run times around 10 mins. If we were to run the quality alerts at the same time we would have a slightly larger gap in time before we are notified. Running at a 15 min offset to those runs hopefully alerts faster

Alternatively I don't see the harm in running every 15 mins like it has been lol

Edit: Updated to just run every 15 mins

sydneynotthecity · 2024-07-16T18:28:23Z

airflow_variables_prod.json

If we're no longer separating resources out between dbt, default and stellaretl, is it worth us storing configs beyond the default?

I think it's worth it. I was going to delete all the resources and just make everything run with the default settings but I have a theory that dbt and stellar-etl both have different resource requirements. So in the case we do need to bump up/down resources for one service we wouldn't necessarily OOM or over-allocate the other service

👌 sounds good to me. Your theory sounds plausible

Update airflow resources and split out dbt tests and alerts

10fbfbe

chowbao requested a review from a team as a code owner July 16, 2024 15:16

chowbao commented Jul 16, 2024

View reviewed changes

chowbao added 4 commits July 16, 2024 11:21

lint

c2c75ae

linting

2b93ac0

lint

ea7ee6a

change stellar-etl resource name to stellaretl because dash is invalid

b9b3307

amishas157 reviewed Jul 16, 2024

View reviewed changes

dags/history_tables_dag.py Show resolved Hide resolved

amishas157 reviewed Jul 16, 2024

View reviewed changes

amishas157 approved these changes Jul 16, 2024

View reviewed changes

sydneynotthecity reviewed Jul 16, 2024

View reviewed changes

chowbao added 3 commits July 16, 2024 15:07

update schedule to every 15 mins

a32cf69

Update images

11284e1

Update resources

96392ad

chowbao merged commit ceec401 into master Jul 18, 2024
4 checks passed

amishas157 deleted the update-airflow-resources branch July 26, 2024 18:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update airflow resources and split out dbt tests and alerts #433

Update airflow resources and split out dbt tests and alerts #433

chowbao commented Jul 16, 2024

chowbao Jul 16, 2024

chowbao Jul 16, 2024

amishas157 Jul 16, 2024 •

edited

Loading

amishas157 Jul 16, 2024

chowbao Jul 16, 2024

amishas157 left a comment

sydneynotthecity Jul 16, 2024

chowbao Jul 16, 2024

chowbao Jul 16, 2024 •

edited

Loading

sydneynotthecity Jul 16, 2024

chowbao Jul 16, 2024

sydneynotthecity Jul 16, 2024


		# DAG task graph
		start_tests >> singular_tests >> singular_tests_elementary_alerts

Update airflow resources and split out dbt tests and alerts #433

Update airflow resources and split out dbt tests and alerts #433

Conversation

chowbao commented Jul 16, 2024

PR Structure

Thoroughness

What

Why

Known limitations

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amishas157 Jul 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amishas157 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chowbao Jul 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amishas157 Jul 16, 2024 •

edited

Loading

chowbao Jul 16, 2024 •

edited

Loading