Add an `ensureSize` function #6

alberic89 · 2024-08-12T20:35:17Z

A suggestion for the ensureSize function.

A lot of code duplication with resize, but I don't see how I can avoid it without loosing optimisations.
Or we need three other resize function for each “group” of resize.

alberic89 · 2024-08-13T10:33:14Z

Is it a good idea to execute ensureSize during each scoreImpl ?

fjebaker · 2024-08-13T12:32:56Z

I am reluctant to include an ensureSize member, as I don't think I am convinced of its benefits. resize made sense, as it would allow the user to shrink or expand the buffers as needed, but the user shouldn't need to be constantly doing memory things here, and I would prefer to encourage over-allocating rather than constantly checking and re-allocating.

I would instead consider something like fn hasSize(haystack_len: usize, needle_len: usize) bool which can be used to check if resizing is needed. Then, if a user really does want to dynamically allocate the sizes on the fly, they can easily write a 3 line function that checks hasSize, calls resize if false, and then calls score.

The optimization of only resizing rows or cols respectively is a very small optimization, and it's not in the hot path, so it makes very little difference. The largest memory operation is (re)allocating the matrices, which will have to be done regardless of whether the rows or cols grow. Given that it's also a little awkward to fit into the current code, I am inclined to reject it.

Would you agree?

Is it a good idea to execute ensureSize during each scoreImpl ?

No. scoreImpl should not touch memory things to avoid possibly raising errors, and to keep its execution behaviour consistent. The idea is very much that init sets up the finder (raising errors on failing preconditions), and then score uses it.

alberic89 · 2024-08-13T16:41:51Z

With time, I understand what you want to say. I come from the Python world, so a lot of principle of memory management are not familiar to me, and my poor English don't help. For me, it was better to hide the complexity of the memory allocation as far as possible. But it is in contradiction with the Zig principles.

So yes, I recognize that it was a bad idea, so we can reject it.

I will add an hasSize function from a clean base in another PR, so we can close this.

alberic89 added 2 commits August 12, 2024 22:33

feat: add an ensureSize function

1fbdcce

feat: avoid code duplication for resize

00e901c

alberic89 closed this Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an `ensureSize` function #6

Add an `ensureSize` function #6

alberic89 commented Aug 12, 2024

alberic89 commented Aug 13, 2024

fjebaker commented Aug 13, 2024

alberic89 commented Aug 13, 2024

Add an ensureSize function #6

Add an ensureSize function #6

Conversation

alberic89 commented Aug 12, 2024

alberic89 commented Aug 13, 2024

fjebaker commented Aug 13, 2024

alberic89 commented Aug 13, 2024

Add an `ensureSize` function #6

Add an `ensureSize` function #6