Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about dictionary option #7

Open
ethanresnick opened this issue Mar 8, 2023 · 0 comments
Open

Question about dictionary option #7

ethanresnick opened this issue Mar 8, 2023 · 0 comments

Comments

@ethanresnick
Copy link

Hi, thanks for this awesome package!

My understanding of zstd is that messages compressed with a dictionary can only be decompressed if that dictionary is available at decompression time, and that the dictionary is not embedded in the compressed data, so it must be provided out-of-band (with the options this package supports). Is that right?

If so, I'm very confused how a dictionary would work with Kafka, because my understanding is that the broker needs to decompress the batch of compressed messages it receives to (e.g.) find the boundaries between messages, verify the offsets, and (in the case of Confluent Schema Registry) validate the messages against the topic's schema.

Am I misunderstanding the extent to which the broker decompresses/processes/validates the incoming messages? If not, how does this all work with dictionaries + zstd?

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant