Rewrite deploy to minimize API requests #38

tianon · 2024-04-02T21:49:53Z

This replaces our crane cp/crane index append based deploy workflow with more custom code that can generate the full index/manifest list for each tag in a fully deterministic and offline way (based on the data we've already got locally in builds.json), and code that can take that result and push it directly with minimal API requests, especially in the no-op case (where crane index append would make something like $2*(N+1)$ API requests to no-op, this new code should be only $1$ total per tag in the no-op case, even if we later add a pre-flight HEAD request which this code currently skips in favor of directly doing PUT and handling errors as a miss).

This also includes new integration tests and even some new jq tests to cover the new code. As noted in a comment in the code, in theory we can use ocimem in the future to add more unit tests to this code, but it's harder to successfully/correctly test the edge cases that way (since we'd then have to synthesize content in-process instead of relying on pre-synthesized content pre-existing), so I felt pretty good about leaving that as a TODO for now.

(I have been using this successfully for deploying my own personal images, FWIW. 👍)

Closes #22

…has to clobber top-level bin/)

I was testing a minor memory-usage improvement to `Set`, but it turns out it doesn't actually matter (and this helped me determine that, so I might as well keep it).

… manifests vs blobs

…rect HTTP error handling

…e text / comment text)

… once)

tianon · 2024-04-02T21:50:07Z

This is real big. For now, I've left it in "many WIP commits" mode, which might make it easier or more interesting to review, but I don't think we should merge all these commits (and I was planning to squash them).

tianon · 2024-04-02T23:14:26Z

For purposes of comparison, the Go coverage percentage on main is currently 77.1% total, and this PR is at 77.9% total 😄

tianon · 2024-04-02T23:52:27Z

Jenkinsfile.deploy

+	rateLimitBuilds([
+		count: 1,
+		durationName: 'hour',
+		userBoost: true,
+	]),
 	pipelineTriggers([
-		// TODO https://github.com/docker-library/meta-scripts/issues/22
-		//upstream(threshold: 'UNSTABLE', upstreamProjects: 'meta'),
-		cron('H H/2 * * *'),
-		// (we've dropped to only running this periodically to avoid it clogging the whole queue for a no-op, which also gives build+meta more time to cycle and get deps so they have a higher chance to all go out at once -- see the above linked issue)
+		upstream('meta'),
+		cron('H H/6 * * *'), // run every few hours whether we "need" it or not


This effectively sets us to run deploy every time meta succeeds, but at most once per hour (unless we run manually). I don't personally think this is right, but it's maybe close enough for first pass (triggers relatively frequently, but not so frequently that we cause problems for anyone else).

I'd also add the reminder that this job running is only a blocker for users to see new builds, not for dependent/child builds, so it's pretty reasonable for them to have a 1-2 hour delay (and we can always run manually if we know the builds are done and need to get a deploy out faster than that).

whalelines

Looks good, a couple questions

.test/oci-sort-platforms/test.jq

whalelines · 2024-04-03T21:16:14Z

cmd/deploy/input.go

+		// panic instead of error because this should've already been handled/normalized above (so this is a coding error, not a runtime error)
+	}
+
+	return normal, nil


That is quite a function.

Yeahhhhhhh -- I actually wrote this as part of main() itself, and when I stepped back from the implementation I almost died and thus forced myself to re-write it in such a way that I could load it up with unit tests so I could have higher confidence that all the edge cases in every branch of this behemoth are actually covered properly.

tianon added 19 commits March 26, 2024 17:04

WIP: first pass at deploy

66f28a3

WIP: second pass, now with more cowbell

ebfe34b

WIP: refactor coverage handling (cleaner, more consistent, no longer …

e390732

…has to clobber top-level bin/)

WIP: the start of more "jq" tests

61fb74c

WIP: add a few TODOs

33ed5de

Add a benchmark for om.OrderedMap.Set

7acd821

I was testing a minor memory-usage improvement to `Set`, but it turns out it doesn't actually matter (and this helped me determine that, so I might as well keep it).

Add explicit Reference.StringWithKnownDigest unit test

ec3149d

WIP: refactor EnsureManifest loop with more correct handling of child…

28856d0

… manifests vs blobs

Update to use the new ociregistry.HTTPError for more consistent/cor…

1046d55

…rect HTTP error handling

WIP: remove TODO that was implemented elsewhere (and fix error messag…

2e16d56

…e text / comment text)

WIP: also normalize descriptor field ordering

824c566

WIP: assume pre-normalized platform (no reason to normalize more than…

c5612aa

… once)

WIP: initial "deploy" data munging helpers plus tests

234f627

WIP: update Jenkinsfile.deploy to use new deploy code

31036dc

WIP: remove example-commands symlink so Git detects rename better

6906397

WIP: add delay for racy registry startup

c63cf58

WIP: remove trap once it's no longer necessary

faaffd5

WIP: typo

3d919cd

WIP: remove unnecessary TODOs

20240c7

tianon commented Apr 2, 2024

View reviewed changes

whalelines approved these changes Apr 3, 2024

View reviewed changes

yosifkit approved these changes Apr 18, 2024

View reviewed changes

yosifkit marked this pull request as ready for review April 18, 2024 18:52

yosifkit merged commit ab38f95 into docker-library:main Apr 18, 2024
1 check passed

yosifkit deleted the deploy branch April 18, 2024 18:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite deploy to minimize API requests #38

Rewrite deploy to minimize API requests #38

tianon commented Apr 2, 2024

tianon commented Apr 2, 2024

tianon commented Apr 2, 2024

tianon Apr 2, 2024

whalelines left a comment

whalelines Apr 3, 2024

tianon Apr 3, 2024

Rewrite deploy to minimize API requests #38

Rewrite deploy to minimize API requests #38

Conversation

tianon commented Apr 2, 2024

tianon commented Apr 2, 2024

tianon commented Apr 2, 2024

tianon Apr 2, 2024

Choose a reason for hiding this comment

whalelines left a comment

Choose a reason for hiding this comment

whalelines Apr 3, 2024

Choose a reason for hiding this comment

tianon Apr 3, 2024

Choose a reason for hiding this comment