You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have tested with the :latest image tag (i.e. quay.io/argoproj/workflow-controller:latest) and can confirm the issue still exists on :latest. If not, I have explained why, in detail, in my description below.
I have searched existing issues and could not find a match for this bug
After upgrade 3.6.0 (and latest) from 3.5.8, http://argo-server/workflows/?&limit=50 takes response time too long.
I've found there is log for argo-server
Query: SELECT count(*) as total FROM "argo_archived_workflows" WHERE (("clustername" = $1 AND "instanceid" = $2) AND not exists (select 1 from argo_archived_workflows_labels where clustername = argo_archived_workflows.clustername and uid = argo_archived_workflows.uid and name = 'workflows.argoproj.io/controller-instanceid'))
Arguments: []interface {}{"default", ""}
Error: upper: slow query
Time taken: 363.58917s
<...>
time="2024-11-27T10:52:09.899Z" level=info duration=6m3.617456665s method=GET path=/api/v1/workflows/ size=33993 status=0
There seems to be an issue with the count query fetching archived_workflow after the upgrade.
You can see memory of database has significant increase.
There are 2 million archived workflows in the database:
# SELECT count(*) as total FROM "argo_archived_workflows" WHERE (("clustername" = 'default' AND "instanceid" = '') AND not exists (select 1 from argo_archived_workflows_labels where clustername = argo_archived_workflows.clustername and uid = argo_archived_workflows.uid and name = 'workflows.argoproj.io/controller-instanceid'))
postgres-# ;
total
---------
2040568
(1 row)
In helm values.yaml, we need to set workflows ttl 400days.
Pre-requisites
:latest
image tag (i.e.quay.io/argoproj/workflow-controller:latest
) and can confirm the issue still exists on:latest
. If not, I have explained why, in detail, in my description below.What happened? What did you expect to happen?
After upgrade 3.6.0 (and latest) from 3.5.8,
http://argo-server/workflows/?&limit=50
takes response time too long.I've found there is log for
argo-server
There seems to be an issue with the count query fetching archived_workflow after the upgrade.
You can see memory of database has significant increase.
There are 2 million archived workflows in the database:
In helm values.yaml, we need to set workflows ttl 400days.
Version(s)
a1f67794be68c2bcdfb8900e20ce18b0ab9115ebfe0adf1f1f5100eb6bd7604c
Paste a minimal workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflows that uses private images.
.
Logs from the workflow controller
Logs from in your workflow's wait container
The text was updated successfully, but these errors were encountered: