[BUG] Data sent to the store immediately after the store startup/initialization can be lost #387
Open
1 of 11 tasks
Labels
bug
Something isn't working
Willingness to contribute
Yes. I can contribute a fix for this bug independently.
Venice version
latest dev as of May 1st 2023
System information
Describe the problem
https://venicedb.slack.com/archives/C03SLQWRSLF/p1682719789550259
I am using VeniceSystemProducer to send the data like
What I found out is that unless I wait (like Thread.sleep(30000)) after store init/empty push I get no errors on write but I also don't get the data. After I added the sleep the data sent to the store shows up in get requests.
The store is in ONLINE state, I tried waiting like this and it passes the check immediately:
Testcontainers starts venice components then the test runs "--new-store"/"--update-store "/"--empty-push" using admin tool. This should be enough to start submitting the data.
Expected:
producer.put() to actually put the data into the store or fail.
Ideally a way to check the store for readiness.
Tracking information
No response
Code to reproduce bug
Will be available in PulsarVeniceSinkTest once the PR is ready for submission
What component(s) does this bug affect?
Controller
: This is the control-plane for Venice. Used to create/update/query stores and their metadata.Router
: This is the stateless query-routing layer for serving read requests.Server
: This is the component that persists all the store data.VenicePushJob
: This is the component that pushes derived data from Hadoop to Venice backend.Thin Client
: This is a stateless client users use to query Venice Router for reading store data.Fast Client
: This is a stateful client users use to query Venice Server for reading store data.Da Vinci Client
: This is an embedded, stateful client that materializes store data locally.Alpini
: This is the framework that fast-client and routers use to route requests to the storage nodes that have the data.Samza
: This is the library users use to make nearline updates to store data.Admin Tool
: This is the stand-alone client used for ad-hoc operations on Venice.Scripts
: These are the various ops scripts in the repo.The text was updated successfully, but these errors were encountered: