Implement schema checksums for quick incompatibility detection #19

zah · 2020-02-22T22:39:28Z

Starting from a designated list of root types, you can use enumAllSerializedFields and Nim's signatureHash to automatically generate a "checksum" that uniquely identifies the version of the type schemas used in a particular build of a project (please note that this will also include all transitively reached types that may appear as record fields).

This checksum can be used in file formats and network protocols to quickly detect situations where you are dealing with an older incompatible version of your software (for formats such as SSZ which don't provide backwards compatibility).

The text was updated successfully, but these errors were encountered:

disruptek · 2020-02-29T20:45:46Z

That's an awesome idea, how and where does it need to be implemented?

jangko · 2020-03-01T10:32:35Z

where

Here in this repo. Along with adequate amount of tests to prove it will generate unique ID for different types. Since it will be used to identify things, make sure it can be executed at compile time(e.g. put into a const)

how

You'll need to create a new public API and you'll need to exploit Nim types via macros and compile time procs. From there you turn those types into hashable/checksumable value. Then produce a final "checksum" value.

Update the readme.md, and you're done. Hardest part may be you'll need to fight with the Nim compiler itself and identify it's weakness/limitation regarding this voodo-black-magic feature then report them along with any gotchas you've found.

disruptek · 2020-03-05T00:58:29Z

The hope for incremental compilation is that, ultimately, it will be always-on. Whether the types are serialized to sqlite or some other format will be immaterial. We can use this today without any change to the compiler inputs, which I think is particularly attractive. Alternatively, we can reproduce the same functionality that the compiler already has.

I have a feeling you will want to go the latter route. That would, of course, allow you to loosen type validation to, say, validate a type of varchar(20) when stored to a varchar(30) field. I don't know if you want this; it's just the first enhancement that came to mind.

jangko · 2020-03-05T01:24:57Z

The medium or the destination of those bytes from our serializer will not have any impact to our deserializer. Both serializer and deserializer will analyze their input/output type respectively.

This feature also format independent, whether it is a json-serializer or protobuf-serializer, or msgpack-serializer, or ssz the usage of this feature will be same.

zah · 2020-03-05T09:39:11Z

The hope for incremental compilation is that, ultimately, it will be always-on. Whether the types are serialized to sqlite or some other format will be immaterial. We can use this today without any change to the compiler inputs, which I think is particularly attractive. Alternatively, we can reproduce the same functionality that the compiler already has.

I'm not sure I understand the reference to incremental compilation here, but if you imply that this form of signature checksum already exists in the form of signatureHash, please note that the main difference here is that we care only for the serialized fields (these may be a subset of all the fields and they can appear in a different order).

You need to check out the definition and the usages of enumAllSerializedFields. This is the helper mechanism that you can use to process the list of all serialized fields. The schemaHash API will just recursively use this helper to compute the final hash value.

zah added the bounty label Feb 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement schema checksums for quick incompatibility detection #19

Implement schema checksums for quick incompatibility detection #19

zah commented Feb 22, 2020 •

edited

Loading

disruptek commented Feb 29, 2020

jangko commented Mar 1, 2020

disruptek commented Mar 5, 2020

jangko commented Mar 5, 2020 •

edited

Loading

zah commented Mar 5, 2020

Implement schema checksums for quick incompatibility detection #19

Implement schema checksums for quick incompatibility detection #19

Comments

zah commented Feb 22, 2020 • edited Loading

disruptek commented Feb 29, 2020

jangko commented Mar 1, 2020

disruptek commented Mar 5, 2020

jangko commented Mar 5, 2020 • edited Loading

zah commented Mar 5, 2020

zah commented Feb 22, 2020 •

edited

Loading

jangko commented Mar 5, 2020 •

edited

Loading