Spark triggering massive amounts of "ThreadDump" safepoints, causing stutter/lag #458

pietro-lopes · 2024-09-19T04:50:56Z

Description

Some players at ATM10 are having some lag spikes and I asked them to turn on safepoint and GC logs to see what is going on and turn out this is happening:

With spark https://mclo.gs/q9v2q3W

Without spark https://mclo.gs/XpTnbfu

Reproduction Steps

Happens just by having spark (maybe the background profiler?)

Looks like it is happening to very few people, I can't reproduce it at Linux (PopOS) or Windows 10.

Expected Behaviour

Don't know, is it suffering from safepoint bias (at least for Windows)?

Platform Information

Minecraft Version: 1.21.1
Platform Type: client
Platform Brand: Neoforge
Platform Version: Neo 21.1.47

Spark Version

1.10.97

Logs and Configs

No response

Extra Details

here is some random spark from that player if you need to grab some PC/config specs
https://spark.lucko.me/fPQnwEqJ2K

SirYwell · 2024-09-19T08:15:10Z

It seems like the Reaching safepoint time is pretty high every now and then. It might be related to GC (I'm also seeing allocation stalls, that might indicate that memory just isn't sufficient). Does that also happen with either other GCs or more memory assigned?

The way spark takes thread dumps without async-profiler requires threads to be at a safepoint, but safepoint bias is more about less precise measurements than performance overhead/lag spikes.

pietro-lopes · 2024-09-19T10:02:21Z

Another person
https://spark.lucko.me/gghY5nDptL (for spec references)

With spark (at this time didn't asked to use the gc debug option, only safepoint)
https://mclo.gs/NX3UTPO

(nearly ~21s of pause only for ThreadDump, on an aplication running for 232s)

No spark
https://mclo.gs/gETRTlg
(now a total of ~2s of pause for app running for ~236s)

pietro-lopes · 2024-09-19T22:18:20Z

And now just another player had same issue and fixed by disabling background profiler.
We will ship that config disabled by default for now.

pietro-lopes added the bug Something isn't working label Sep 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark triggering massive amounts of "ThreadDump" safepoints, causing stutter/lag #458

Spark triggering massive amounts of "ThreadDump" safepoints, causing stutter/lag #458

pietro-lopes commented Sep 19, 2024

SirYwell commented Sep 19, 2024

pietro-lopes commented Sep 19, 2024

pietro-lopes commented Sep 19, 2024

Spark triggering massive amounts of "ThreadDump" safepoints, causing stutter/lag #458

Spark triggering massive amounts of "ThreadDump" safepoints, causing stutter/lag #458

Comments

pietro-lopes commented Sep 19, 2024

Description

Reproduction Steps

Expected Behaviour

Platform Information

Spark Version

Logs and Configs

Extra Details

SirYwell commented Sep 19, 2024

pietro-lopes commented Sep 19, 2024

pietro-lopes commented Sep 19, 2024