You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
when add 1 billion values to the bloom filters, it cost nearly 4 hours on my server, I think it's possible to calculate hashes for batches of values in multithreads.
The text was updated successfully, but these errors were encountered:
My strategy was to create a lot of filters in parallel (lets say you have billion values; then you can create 1000 filters adding values to them in parallel) and then merge them using the .union() function.
when add 1 billion values to the bloom filters, it cost nearly 4 hours on my server, I think it's possible to calculate hashes for batches of values in multithreads.
The text was updated successfully, but these errors were encountered: