construct the hashtables iteratively to save memory up to `B` times #2

xu3kev · 2023-01-11T04:10:09Z

Since each hashtable can be constructed once and then discarded after the union-find step, we can rearrange the for loop into constructing the B hashtables one by one instead of constructing them all at once. (Note that each hashtable consumes a large amount of memory because it holds the (document idx, hash) for the entire dataset.) We can then potentially save the memory usage by up to B times.

construct the hashtables iteratively to save memory usage

9242631

xu3kev force-pushed the reduce_mem branch from 5fdbf73 to 9242631 Compare January 11, 2023 04:40

xu3kev changed the title ~~construct the hashtables iteratively to save memory by B times~~ construct the hashtables iteratively to save memory up to B times Jan 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

construct the hashtables iteratively to save memory up to `B` times #2

construct the hashtables iteratively to save memory up to `B` times #2

xu3kev commented Jan 11, 2023 •

edited

Loading

construct the hashtables iteratively to save memory up to B times #2

Are you sure you want to change the base?

construct the hashtables iteratively to save memory up to B times #2

Conversation

xu3kev commented Jan 11, 2023 • edited Loading

construct the hashtables iteratively to save memory up to `B` times #2

construct the hashtables iteratively to save memory up to `B` times #2

xu3kev commented Jan 11, 2023 •

edited

Loading