Introduce profile-guided optimization to APM Server #13859

1pkg · 2024-08-09T23:10:54Z

Context

Go compiler added support for profile-guided optimization (PGO) since Go 1.20, thus allowing to build further optimized Go binaries using CPU profiles. Depending on the workload, optimizations and the profile quality one can expect to extract somewhere from 2-14% of performance gains according to go.dev/blog which is quite meaningful. Further more for CPU bound services, in general, the expected results should be somewhere in the upper bound of the expected performance gains.

Requested Changes

We should capture this performance improvement in APM Server.

We should consider how we will collect the profiles, initially we could leverage existing benchmarks workflow to collect CPU profiles which will be included into the future PGO builds.

We should consider how we will store profiles and include them into the builds, the simplest and most convenient approach to commit them directly in the source code for easy distribution and repeatable builds.

Additional Links

go.dev/doc/pgo
go.dev/blog/pgo

1pkg · 2024-09-11T21:44:28Z

The existing benchmarks in APM Server are highly coupled to the ElasticSearch stack, making them over indexing ES performance into the benchmark results. This could lead to a large distribution between individual benchmark results, up to 20%, and make them hard to measure and reason about. This is a particular problem for PGO, since PGO only adds an incremental gain to the performance. In my synthetical local test when benchmarking an isolated instance of APM Server the result was an average 5% increase in the throughput. These results are really hard to observe when the existing benchmark hide them between ES performance.

The solution to this problem would be to use a separate second set of benchmark workflow that targets a http API stub instead of real ES instance. This way we should be able to sufficiently isolate APM Server performance from underlying ES performance.

1pkg added the enhancement label Aug 9, 2024

1pkg added this to the 8.16 milestone Aug 9, 2024

1pkg self-assigned this Aug 9, 2024

1pkg mentioned this issue Aug 15, 2024

PGO: update the existing benchmarks workflow to enable PGO builds #13884

Merged

2 tasks

1pkg mentioned this issue Sep 18, 2024

Add Moxy tool to apm-perf elastic/apm-perf#158

Merged

mergify bot mentioned this issue Oct 3, 2024

[8.x] PGO: update the existing benchmarks workflow to enable PGO builds (backport #13884) #14245

Merged

2 tasks

1pkg closed this as completed Oct 4, 2024

rubvs mentioned this issue Oct 28, 2024

Update APM Server throughput number for 8.16.0 elastic/observability-docs#4441

Open

10 tasks

1pkg mentioned this issue Nov 19, 2024

PGO: Improve the existing PR process #14672

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce profile-guided optimization to APM Server #13859

Introduce profile-guided optimization to APM Server #13859

1pkg commented Aug 9, 2024

1pkg commented Sep 11, 2024

Introduce profile-guided optimization to APM Server #13859

Introduce profile-guided optimization to APM Server #13859

Comments

1pkg commented Aug 9, 2024

Context

Requested Changes

Additional Links

1pkg commented Sep 11, 2024