Skip to content
forked from mendsley/bsdiff

bsdiff and bspatch are libraries for building and applying patches to binary files.

License

Notifications You must be signed in to change notification settings

drake127/bsdiff

 
 

Repository files navigation

bsdiff/bspatch

Build & Test License
bsdiff and bspatch are libraries for building and applying patches to binary files.

The original algorithm and implementation was developed by Colin Percival.
The algorithm is detailed in his paper, Naïve Differences of Executable Code. For more information, visit his website at http://www.daemonology.net/bsdiff/.

Contact

@drake127
https://github.com/drake127/bsdiff

@MatthewEndsley maintained this project separately from Colin's work, with the goal of making the core functionality easily embeddable in existing projects. Since Matthew no longer maintains his project, I took over with the original goal.

License

Copyright 2003-2005 Colin Percival
Copyright 2012-2018 Matthew Endsley
Copyright 2018-2020 Emanuel Komínek

This project is governed by the BSD 2-clause license. For details, see the file titled LICENSE in the project root folder.

Overview

There are two separate libraries in the project: bsdiff and bspatch. Each are self contained in bsdiff.c and bspatch.c. The easiest way to integrate is to simply copy the c file to your source folder and build it but static library is also provided.

The overarching goal was to modify the original bsdiff/bspatch code from Colin and eliminate external dependencies and provide a simple interface to the core functionality.

I've exposed relevant functions via the _stream classes. The only external dependency not exposed is memcmp in bsdiff.

This executable generates patches that are not compatible with the original bsdiff tool. The incompatibilities were motivated by the patching needs for the game AirMech https://www.carbongames.com and the following requirements:

  • Eliminate/minimize any seek operations when applying patches
  • Eliminate any required disk I/O and support embedded streams
  • Ability to easily embed the routines as a library instead of an external binary
  • Compile+run on all platforms we use to build the game (Windows, Linux, NaCl, OSX)

These incompatibilities also have adverse effect on patch size. Fortunately it's possible to use this project in both modes when used as a library and I plan to expose both formats in the future.

Compiling

The project uses CMake build tools and compiles under any moderately recent version of GCC, Clang or MSVC. If your compiler does not provide an implementation of <stdint.h> you can remove the header from the bsdiff/bspatch files and provide your own typedefs for the following symbols: uint8_t and int64_t.

The library itself can be built without any dependencies but to create proper patch files you need BZip2 library (libbz2-dev) as demonstrated by projects executables.

Examples

Each project has an optional main function that serves as an example for using the library. Simply define BSDIFF_EXECUTABLE or BSPATCH_EXECUTABLE to enable building the standalone tools.

Reference

bsdiff

enum bsdiff_stream_type
{
	BSDIFF_WRITECONTROL,
	BSDIFF_WRITEDIFF,
	BSDIFF_WRITEEXTRA
};

struct bsdiff_stream
{
	void * opaque;

	void * (* malloc)(size_t size);
	void (* free)(void * ptr);
	int (* write)(struct bsdiff_stream * stream, const void * buffer, size_t size,
	              enum bsdiff_stream_type type);
};

int bsdiff(const uint8_t * source, int64_t sourcesize, const uint8_t * target,
           int64_t targetsize, struct bsdiff_stream * stream);

In order to use bsdiff, you need to define functions for allocating memory and writing binary data. This behavior is controlled by the stream parameter passed to to bsdiff(...).

The opaque field is never read or modified from within the bsdiff function. The caller can use this field to store custom state data needed for the callback functions.

The malloc and free members should point to functions that behave like the standard malloc and free C functions.

The write function is called by bsdiff to write a block of binary data to the stream. The return value for write should be 0 on success and non-zero if the callback failed to write all data. In the default example, bzip2 is used to compress output data.

bsdiff returns 0 on success and -1 on failure.

bspatch

enum bspatch_stream_type
{
	BSDIFF_READCONTROL,
	BSDIFF_READDIFF,
	BSDIFF_READEXTRA,
};

struct bspatch_stream
{
	void * opaque;
	int (* read)(const struct bspatch_stream * stream, void * buffer, size_t length,
	             enum bspatch_stream_type type);
};

int bspatch(const uint8_t * source, const int64_t sourcesize, uint8_t * target,
            const int64_t targetsize, struct bspatch_stream * stream);

The bspatch function transforms the data for a file using data generated from bsdiff. The caller takes care of loading the old file and allocating space for new file data. The stream parameter controls the process for reading binary patch data.

The opaque field is never read or modified from within the bspatch function. The caller can use this field to store custom state data needed for the read function.

The read function is called by bspatch to read a block of binary data from the stream. The return value for read should be 0 on success and non-zero if the callback failed to read the requested amount of data. In the default example, bzip2 is used to decompress input data.

bspatch returns 0 on success and -1 on failure. On success, new contains the data for the patched file.

About

bsdiff and bspatch are libraries for building and applying patches to binary files.

Topics

Resources

License

Stars

Watchers

Forks

Languages

  • C 93.9%
  • CMake 6.1%