[Feature Request] Implements fingerprint ingest processor #13612
Labels
enhancement
Enhancement or improvement to existing feature or request
ingest-pipeline
v2.16.0
Issues and PRs related to version 2.16.0
v3.0.0
Issues and PRs related to version 3.0.0
Is your feature request related to a problem? Please describe
Currently we have community_id ingest processor which is used to generate community ID flow hash for network flow tuples based on the community id hash algorithm, but for common data such as application log or e-commerce data, we can also introduce a new type of ingest processor which can generate hash value based on part of the fields or all fields in a document, just like content hash, the fingerprint for each document can be used to deduplicate the documents and collapse search results.
The usage of the new fingerprint ingest processor could be:
or
, after executing the processor, a new field
fingerprint
will be added to each document, then users can use the value of that field to deduplicate documents:, another use case is for collapsing search results:
Describe the solution you'd like
Add a new ingest processor which can generate fingerprint for the incoming document.
Related component
Indexing
Describe alternatives you've considered
Generate the fingerprint in client side, which is not friendly for users.
Additional context
No response
The text was updated successfully, but these errors were encountered: