-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Full Scale Performance: Single Process, Sharrow On, Explicit Chunking #19
Comments
Run with full sample, sharrow on, and single process (1 TB memory, Intel Xeon Gold 6342 @ 2.8GHz machine).
Chunking is set to
For these models, the argument Here's how the memory profile looks like: Compare the above memory profile to the one with the same settings, except no chunking: Here's a comparison table of max memory usage of each step, with a negative difference meaning the chunking run uses less memory: <style> </style>
Out of the models with explicit chunking turned on, I am not seeing any real difference between chunking and no chunking in trip_destination, but:
Small differences in other model steps may be expected, but some other show bigger differences than I expected. I should note that the no chunking settings run was from two weeks ago and did not set the numba and openblaus environment variables to 0 (but the chunking run did), so it may be a theory as to why. |
Performed again on an RSG machine and got similar results (compare the below memory profile to #6 (comment) which was run on the same machine immediately before.) Set Looks like some investigation on trip destination spikes is warranted as the peak did not drop much there. Overall run time increased from 1267 mins without chunking to 1291 mins with chunking, or 21.1 to 21.5 hrs. Memory peak went from 442 GB to 321 GB. |
This is the issue to report on memory usage and runtime performance...
data_dir: "data-full" full scale skims (24333 MAZs)
households_sample_size: 0 (full scale 100% sample of households)
sharrow: "require"
multiprocess: false single process
chunk_training_mode: explicit
The text was updated successfully, but these errors were encountered: