Interleaved Document Addition and Document Deletion makes the auto-batching inefficient. #554
Replies: 5 comments 3 replies
-
Suggested by Brent Dillingham Perhaps if we could send deletions as part of the partial update to the [
{
"id": 123,
"title": "Shazam ⚡️"
},
{
"id": 123,
"_delete": true
},
{
"id": 123,
"color": "green"
}
] Final state of {
"id": 123,
"color": "green"
} |
Beta Was this translation helpful? Give feedback.
-
Would using a boolean field on the document to remove it from the search results with a filter rather than deleting it could be a decent workaround? |
Beta Was this translation helpful? Give feedback.
-
Hello @brentd 👋 Following up on that subject, before thinking of providing an API endpoint to that, we will explore a way to batch addition and deletion together. See meilisearch/meilisearch#3440 Thank you! |
Beta Was this translation helpful? Give feedback.
-
Hello, everyone following this discussion 👋 We have just released the first RC (release candidate) of Meilisearch containing the batching of interleaved document addition and deletion! You can test it by using docker run -it --rm -p 7700:7700 -v $(pwd)/meili_data:/meili_data getmeili/meilisearch:v1.1.0-rc.0 You are welcome to communicate any feedback about this new implementation in this discussion. If you encounter any bugs, please report them here. 🎉 Official and stable release containing this change will be available on 3rd April 2023 |
Beta Was this translation helpful? Give feedback.
-
Hey folks 👋 It has been released with v1.1 ✨ |
Beta Was this translation helpful? Give feedback.
-
Asked on the Meilisearch slack community by Brent Dillingham.
With auto batching, I’ve noticed that document updates and deletes are batched separately, in the order in which the tasks entered the queue, which for our workload makes auto batching fairly ineffective since we have deletes interleaved with our document updates.
In other words, each batch is much smaller than they could have otherwise been. We have
~2M
documents and are updating the index several times per minute, so we’re fairly reliant on auto batching to use Meilisearch effectively.Is there any workaround that would allow the auto batcher to group updates and deletes together?
For example, is there a
_delete: true
special attribute I could use when performing a bulk update?Use case:
Beta Was this translation helpful? Give feedback.
All reactions