Deduplication: Replacing AVL tree with cache friendly data structure like B-tree #11606
vrajendra-datacore
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
B-tree has better memory locality and (CPU) caching. Multiple comparisons could be done once cache line is loaded (as each node has multiple values).
Once the incoming buffer is checksum'ed, it has to be looked up into dedup_table (AVL tree) to see if calculated checksum is already present or not. The intention is to reduce the look up time with B-tree.
Any thoughts/comments if we can see any improvement in performance?
Beta Was this translation helpful? Give feedback.
All reactions