Integrating fast_float to optionally replace strtod #1260

parthpatel · 2024-11-04T17:42:24Z

Fast_float is a C++ header-only library to parse doubles using SIMD instructions. The purpose is to speed up sorted sets and other commands that use doubles. A single-file copy of fast_float is included in this repo. This introduces an optional dependency on a C++ compiler.

The use of fast_float is enabled at compile time using the make variable USE_FAST_FLOAT=yes. It is disabled by default.

Fixes #1069.

madolson · 2024-11-04T20:39:42Z

I suppose I don't really understand the benefit of a submodule vs inlining. Since we aren't tightly controlling the versioning between the main repo and releases, it becomes much harder to know if there is a security issue impacting a specific release. At the very least we should be pinning a specific version of fast_float that we are pulling.

parthpatel · 2024-11-04T21:16:56Z

Since we aren't tightly controlling the versioning between the main repo and releases, it becomes much harder to know if there is a security issue impacting a specific release. At the very least we should be pinning a specific version of fast_float that we are pulling.

We track fast_float commit id in the valkey repository with this change - see the copy pasted change from my commit below. For every valkey commit, we can look up exact version of fast_float at all times using this method. Pulling new fast_float version is as simple as git pull on the submodule.

Submodule fast_float added at e800ca

madolson · 2024-11-04T21:27:53Z

Does it get pulled automatically as part of the release into the release artifacts?

parthpatel · 2024-11-04T21:44:57Z

Does it get pulled automatically as part of the release into the release artifacts?

There is a "git submodule update --init" command in Makefile to initialize it automatically. So yes, It will automatically checkout the same commit every time during build.

madolson · 2024-11-04T22:01:23Z

There is a "git submodule update --init" command in Makefile to initialize it automatically. So yes, It will automatically checkout the same commit every time during build.

So we are adding a new dependency to the release process, since you need to be able to fetch the code from github. I think we should consider figuring out how to pull the code in when we do a release so that folks don't need to do git submodule update --init.

parthpatel · 2024-11-04T22:52:41Z

There is a "git submodule update --init" command in Makefile to initialize it automatically. So yes, It will automatically checkout the same commit every time during build.

So we are adding a new dependency to the release process, since you need to be able to fetch the code from github. I think we should consider figuring out how to pull the code in when we do a release so that folks don't need to do git submodule update --init.

The other approach would be to just check-in the whole git repo as a folder under valkey, which should work. What is the issue with git dependency in github release workflows? I can put a post-checkout hook that always initializes modules on checkout, as long as checkout happens outside of the release process.

codecov · 2024-11-05T00:09:07Z

Codecov Report

Attention: Patch coverage is 83.33333% with 3 lines in your changes missing coverage. Please review.

Project coverage is 70.76%. Comparing base (86f33ea) to head (3f7cee8).
Report is 25 commits behind head on unstable.

Files with missing lines	Patch %	Lines
src/valkey-cli.c	0.00%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable    #1260      +/-   ##
============================================
+ Coverage     70.55%   70.76%   +0.20%     
============================================
  Files           115      117       +2     
  Lines         63158    63305     +147     
============================================
+ Hits          44561    44797     +236     
+ Misses        18597    18508      -89

Files with missing lines	Coverage Δ
src/debug.c	`53.17% <ø> (+0.12%)`	⬆️
src/resp_parser.c	`98.47% <100.00%> (ø)`
src/sort.c	`94.82% <100.00%> (+0.01%)`	⬆️
src/t_zset.c	`95.65% <100.00%> (ø)`
src/util.c	`71.47% <100.00%> (+0.04%)`	⬆️
src/valkey_strtod.h	`100.00% <100.00%> (ø)`
src/valkey-cli.c	`55.53% <0.00%> (+1.69%)`	⬆️

... and 25 files with indirect coverage changes

---- 🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests
JS Bundle Analysis - Avoid shipping oversized bundles

parthpatel · 2024-11-05T17:10:50Z

I am dropping the submodule idea for now as it requires a larger discussion about release process. I don't have access to @swaingotnochill's repository or PR. Therefore, I pulled his commit into this CR to maintain his author-ship on the code he wrote. I integrated it with Redis and fixed the Makefile. It should build now and will be ready to push.

I will work on benchmarking this separately. I can also turn off fast_float by default if folks have concerns and test it on my branch.

swaingotnochill

Feel free to modify. I am on a vacation so it will be difficult for me to work until I am back. Cheers

parthpatel · 2024-11-05T17:34:22Z

@zuiderkwast or @madolson any pointers on how to solve almalinux issue with missing g++?

cd fast_float && make
make[3]: Entering directory '/__w/valkey/valkey/deps/fast_float'
g++ -std=c++11 -O3 -fPIC -c fast_float_strtod.cpp -o fast_float_strtod.o
make[3]: g++: Command not found
make[3]: *** [Makefile:9: fast_float_strtod.o] Error 127
make[3]: Leaving directory '/__w/valkey/valkey/deps/fast_float'
make[2]: *** [Makefile:80: fast_float] Error 2
make[2]: *** Waiting for unfinished jobs....

zuiderkwast · 2024-11-05T20:30:49Z

I'm skeptical to submodules too. Offline builds get complicated. Source releases get complicated. So far, we've used either vendored dependencies or system installed ones (like OpenSSL).

I prefer that we vendor this one. We could copy only one or a few files, as we've done with crc64 and some other small libraries?

…or replacing strtod Signed-off-by: Parth Patel <[email protected]>

* Simplified the interface to remove if branches. * Simplified Makefile to be more readable. * Integrating fast_float with the redis code base in resp_parser.c file. Signed-off-by: Parth Patel <[email protected]>

Signed-off-by: Parth Patel <[email protected]>

… issues. Signed-off-by: Parth Patel <[email protected]>

…d packages. Signed-off-by: Parth Patel <[email protected]>

Signed-off-by: Parth Patel <[email protected]>

…fy it explicitly in Makefile to fix 32-bit compilation issues. Signed-off-by: Parth Patel <[email protected]>

Signed-off-by: Parth Patel <[email protected]>

…i386 compilation Signed-off-by: Parth Patel <[email protected]>

…compilations Signed-off-by: Parth Patel <[email protected]>

Signed-off-by: Parth Patel <[email protected]>

…interface Makefile Signed-off-by: Parth Patel <[email protected]>

Signed-off-by: Parth Patel <[email protected]>

madolson

Looks really close now, I'm going to just apply some of these comments right away.

.github/workflows/ci.yml

deps/fast_float_c_interface/fast_float_strtod.cpp

src/valkey_strtod.h

src/unit/test_valkey_strtod.c

src/valkey_strtod.h

src/unit/test_valkey_strtod.c

Signed-off-by: Madelyn Olson <[email protected]>

src/valkey_strtod.h

Signed-off-by: Parth Patel <[email protected]>

Signed-off-by: Madelyn Olson <[email protected]>

.github/workflows/ci.yml

madolson · 2024-11-22T04:59:21Z

Running ASAN, https://github.com/valkey-io/valkey/actions/runs/11966743688.

Signed-off-by: Madelyn Olson <[email protected]>

src/Makefile

madolson

LGTM, @eifrah-aws can you help followup with the CMake changes like we discussed offline?

zuiderkwast

Thanks for pushing this through! Just one nit.

deps/fast_float_c_interface/fast_float_strtod.cpp

swaingotnochill · 2024-11-23T13:15:53Z

should we do a benchmark on different architectures for this change?

zuiderkwast · 2024-11-23T16:24:37Z

should we do a benchmark on different architectures for this change?

@swaingotnochill That never harms! I think it only affects commands with floats, like INCRBYFLOAT and sorted sets, so I would only benchmark things like that.

Errno is never set to zero by any C standard library function. This function mimics strtod. Signed-off-by: Viktor Söderqvist <[email protected]>

parthpatel mentioned this pull request Nov 4, 2024

feat: Integration with fast_float #1170

Closed

1 task

parthpatel force-pushed the unstable branch from f24f45a to 3fb6671 Compare November 4, 2024 18:08

parthpatel added the pending-refinement This issue/request is still a high level idea that needs to be further refined label Nov 4, 2024

parthpatel marked this pull request as draft November 4, 2024 21:19

parthpatel force-pushed the unstable branch from 5d0c176 to 6aec117 Compare November 4, 2024 23:54

parthpatel force-pushed the unstable branch from f8c9b48 to e9f0b3e Compare November 5, 2024 17:05

swaingotnochill reviewed Nov 5, 2024

View reviewed changes

parthpatel requested a review from madolson November 5, 2024 17:35

parthpatel changed the title ~~Integrating fast_float as a git submodule with Valkey to replace strtod invocation~~ Integrating fast_float with Valkey to replace strtod invocation Nov 5, 2024

parthpatel marked this pull request as ready for review November 6, 2024 21:48

swaingotnochill and others added 10 commits November 6, 2024 22:01

feat (deps): adds fast_float library and a c wrapper for from_chars f…

42c1d0c

…or replacing strtod Signed-off-by: Parth Patel <[email protected]>

Changes to complete the integration of fast_float into Redis codebase.

d25e29b

* Simplified the interface to remove if branches. * Simplified Makefile to be more readable. * Integrating fast_float with the redis code base in resp_parser.c file. Signed-off-by: Parth Patel <[email protected]>

Fixing code-style issue due to empty space

939a766

Signed-off-by: Parth Patel <[email protected]>

Adding 32-bit c++ libraries to library path to fix 32-bit compilation…

c2d6ff6

… issues. Signed-off-by: Parth Patel <[email protected]>

Trying another approach to fix the CI workflows by installing require…

597af83

…d packages. Signed-off-by: Parth Patel <[email protected]>

Adding -v to linker for more output to debug issues.

8d07687

Signed-off-by: Parth Patel <[email protected]>

32-bit stdc++ libraries are in a different folder, so needed to speci…

4f03150

…fy it explicitly in Makefile to fix 32-bit compilation issues. Signed-off-by: Parth Patel <[email protected]>

Also fixing the library to be non-cross version of libstdc++

e9e5209

Signed-off-by: Parth Patel <[email protected]>

This attempt uses the cross-compile libraries provided by ubuntu for …

d7c72cf

…i386 compilation Signed-off-by: Parth Patel <[email protected]>

Forwarding CFLAGS and LDFLAGS to fast_float Makefile to honor 32 bit …

d69ec21

…compilations Signed-off-by: Parth Patel <[email protected]>

parthpatel added 3 commits November 16, 2024 05:00

Applying yamlfmt suggestions

a69568f

Signed-off-by: Parth Patel <[email protected]>

Fixing 32-bit build by setting appropriate variables in fast_float_c_…

2cb45b5

…interface Makefile Signed-off-by: Parth Patel <[email protected]>

Fixing Makefile related issues failing CI

129fdbf

Signed-off-by: Parth Patel <[email protected]>

parthpatel requested review from madolson and zuiderkwast November 16, 2024 10:23

madolson reviewed Nov 18, 2024

View reviewed changes

Remove some unnecessary imports and use SPDX for license

43696b5

Signed-off-by: Madelyn Olson <[email protected]>

zuiderkwast reviewed Nov 18, 2024

View reviewed changes

src/valkey_strtod.h Outdated Show resolved Hide resolved

parthpatel added 2 commits November 19, 2024 02:32

Revising as per feedback

c8c3571

Signed-off-by: Parth Patel <[email protected]>

Fixing clang-format errors

97ef78c

Signed-off-by: Parth Patel <[email protected]>

parthpatel requested review from madolson and zuiderkwast November 19, 2024 20:29

Madelyn's opinion about workflows

91edab2

Signed-off-by: Madelyn Olson <[email protected]>

madolson reviewed Nov 22, 2024

View reviewed changes

.github/workflows/ci.yml Outdated Show resolved Hide resolved

madolson added 4 commits November 21, 2024 21:12

Fixed whitespace

dab5913

Signed-off-by: Madelyn Olson <[email protected]>

Remove exceptions to allow building with out c++ stdlib

41250e9

Signed-off-by: Madelyn Olson <[email protected]>

Remove extra build step

bc235ce

Signed-off-by: Madelyn Olson <[email protected]>

Remove fast float from daily, seems to need more work to build

0cfcc7f

Signed-off-by: Madelyn Olson <[email protected]>

zuiderkwast reviewed Nov 22, 2024

View reviewed changes

src/Makefile Outdated Show resolved Hide resolved

madolson approved these changes Nov 22, 2024

View reviewed changes

madolson requested a review from zuiderkwast November 22, 2024 21:51

zuiderkwast approved these changes Nov 23, 2024

View reviewed changes

deps/fast_float_c_interface/fast_float_strtod.cpp Outdated Show resolved Hide resolved

swaingotnochill approved these changes Nov 23, 2024

View reviewed changes

Don't set errno = 0 in fast_float_strtod

3f7cee8

Errno is never set to zero by any C standard library function. This function mimics strtod. Signed-off-by: Viktor Söderqvist <[email protected]>

zuiderkwast changed the title ~~Integrating fast_float with Valkey to replace strtod invocation~~ Integrating fast_float to optionally replace strtod Nov 25, 2024

zuiderkwast merged commit c4920bc into valkey-io:unstable Nov 25, 2024
53 of 57 checks passed

zuiderkwast added the release-notes This issue should get a line item in the release notes label Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrating fast_float to optionally replace strtod #1260

Integrating fast_float to optionally replace strtod #1260

parthpatel commented Nov 4, 2024 •

edited by zuiderkwast

Loading

madolson commented Nov 4, 2024 •

edited

Loading

parthpatel commented Nov 4, 2024

madolson commented Nov 4, 2024

parthpatel commented Nov 4, 2024

madolson commented Nov 4, 2024

parthpatel commented Nov 4, 2024 •

edited

Loading

codecov bot commented Nov 5, 2024 •

edited

Loading

parthpatel commented Nov 5, 2024

swaingotnochill left a comment

parthpatel commented Nov 5, 2024

zuiderkwast commented Nov 5, 2024

madolson left a comment

madolson commented Nov 22, 2024

madolson left a comment

zuiderkwast left a comment

swaingotnochill commented Nov 23, 2024

zuiderkwast commented Nov 23, 2024

Integrating fast_float to optionally replace strtod #1260

Integrating fast_float to optionally replace strtod #1260

Conversation

parthpatel commented Nov 4, 2024 • edited by zuiderkwast Loading

madolson commented Nov 4, 2024 • edited Loading

parthpatel commented Nov 4, 2024

Submodule fast_float added at e800ca

madolson commented Nov 4, 2024

parthpatel commented Nov 4, 2024

madolson commented Nov 4, 2024

parthpatel commented Nov 4, 2024 • edited Loading

codecov bot commented Nov 5, 2024 • edited Loading

Codecov Report

parthpatel commented Nov 5, 2024

swaingotnochill left a comment

Choose a reason for hiding this comment

parthpatel commented Nov 5, 2024

zuiderkwast commented Nov 5, 2024

madolson left a comment

Choose a reason for hiding this comment

madolson commented Nov 22, 2024

madolson left a comment

Choose a reason for hiding this comment

zuiderkwast left a comment

Choose a reason for hiding this comment

swaingotnochill commented Nov 23, 2024

zuiderkwast commented Nov 23, 2024

parthpatel commented Nov 4, 2024 •

edited by zuiderkwast

Loading

madolson commented Nov 4, 2024 •

edited

Loading

parthpatel commented Nov 4, 2024 •

edited

Loading

codecov bot commented Nov 5, 2024 •

edited

Loading