Faster testing using pytest-xdist #83

tmillenaar · 2024-10-23T09:33:44Z

This is just a nice-to-have. It speed up the tests on my machine from 4 minutes down to one minute by running tests in parallel. The --dist worksteal is important here, because some of the tests take significantly longer than others. With worksteal enabled, if one process is testing a slow test, other processes can run the other tests in the meantime.

…n gitlab's CI

tmillenaar · 2024-10-23T10:06:11Z

It looks like the CICD pytest duration went down from about 4 minutes to 2.5 minutes. I did have to use -n=logical instead of -n=auto. It seems while still a nice gain, the gain in performance is lower than on my own machine. I imagine the virtual CPU we get assigned in the github CICD does not have that many threads available. I think the difference between -n=auto and -n=logical is that logical uses hyperthreading, giving us more threads to work with.

suvayu · 2024-10-29T13:31:17Z

I would like to hold off on changes like this until we actually attempt to solve the slow test issue by addressing the root cause, converting to an inefficient format unnecessarily. This was there for backwards compatibility, but that appears to be unnecessary now.

HannoSpreeuw · 2024-10-30T09:25:21Z

I would like to hold off on changes like this until we actually attempt to solve the slow test issue by addressing the root cause, converting to an inefficient format unnecessarily. This was there for backwards compatibility, but that appears to be unnecessary now.

Right. As soon as we have replaced the source measurements, now collected as a containers.ExtractionResult instance of a list of extract.Detection instances by any data format that aligns well with RAM, there may be a considerable speedup, for TraP unit tests that process parameters from a large number of sources.
Most of the work we have to do is to adapt the PySE unit tests and TraP to accept this new data format that contains the source measurements.

@suvayu will be running TraP in debug mode to figure out exactly which source parameters TraP "eats".
I guess it will be the 17 floats and 1 Boolean that the extract.Detection.serialize produces, but that remains to be verified.
It seems to be the case, since we have

    serialized = [r.serialize(ew_sys_err, ns_sys_err) for r in results]

in tkp/steps/source_extraction.py

If confirmed, that would mean that on the PySE side our work would include collecting these 18 quantities in a single array, with number of rows equal to number of sources.

tmillenaar · 2024-11-04T08:32:10Z

I can adapt to the structure you choose, if you would like to use a 2d numpy array than I can work with that :)

Use pytest-xdist to speed up testing

4028e42

tmillenaar mentioned this pull request Oct 23, 2024

Enable python3.12 and test for all supported versions #84

Closed

Try to use -n=logical instead of -n=auto to investigate performance i…

72f2582

…n gitlab's CI

HannoSpreeuw requested a review from suvayu October 28, 2024 15:56

HannoSpreeuw mentioned this pull request Oct 31, 2024

Export source parameters in a RAM aligned format #90

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster testing using pytest-xdist #83

Faster testing using pytest-xdist #83

tmillenaar commented Oct 23, 2024 •

edited

Loading

tmillenaar commented Oct 23, 2024 •

edited

Loading

suvayu commented Oct 29, 2024

HannoSpreeuw commented Oct 30, 2024

tmillenaar commented Nov 4, 2024

Faster testing using pytest-xdist #83

Are you sure you want to change the base?

Faster testing using pytest-xdist #83

Conversation

tmillenaar commented Oct 23, 2024 • edited Loading

tmillenaar commented Oct 23, 2024 • edited Loading

suvayu commented Oct 29, 2024

HannoSpreeuw commented Oct 30, 2024

tmillenaar commented Nov 4, 2024

tmillenaar commented Oct 23, 2024 •

edited

Loading

tmillenaar commented Oct 23, 2024 •

edited

Loading