Distributed key-value store in Go with support for rebalancing and deletion of volume servers.
The store consists of a master server and n volume servers. The master server uses BadgerDB to store the "metakeys" - the keys that are saved in the volume servers, and which volume server they are stored in. The volume servers store the data as files.
The store uses jump consistent hash to quickly and efficiently calculate the correct bucket (volume server) in the range [0, n) to store the key.
Jump consistent hash is an extremely efficient algorithm that shows significant performance improvements over traditional hashing key distribution algorithms that we know. This projects aims to use it in practice.
When the master server is started, it checks if volume servers have been added. If so, it rebalances some keys by moving them to other volume servers in order to get a balanced distribution using jump consistent hash.
So, if you wish to add a volume server, you need to change the master's config yaml file accordingly, make sure all the volume servers are running, and restart the master server.
NOTE: When adding volume servers, make sure to add them to the bottom of the list in the master's config yaml file since the store works with ascending indices.
When deleting a volume server, the store will move all its keys to a new volume server chosen by the jump consistent hash alogirthm. It will then re-balance the rest of the cluster.
In order to delete a volume server from the cluster, you need to follow these steps:
- Make sure the master's config yaml file contains all the volume servers including the one you wish to delete
- Make sure all the volume servers are up, including the one you wish to delete
- Shut down the master server and run with
./tdkvs master -config=<config file> -delete=<index>
whereindex
is the index of the volume server in the yaml file (starting from 0) - Remove the volume server from the master's config yaml file and shut the volume server down
- Re run the master server as usual
Download the source code and build.
./tdkvs master -config=<config file>
The config yaml file for the master server should be as follows:
port: 3000
volumes:
- http://10.0.0.1:3001
- http://10.0.0.2:3001
- http://10.0.0.3:3001
./tdkvs volume -config=<config file>
The config yaml file for the volume server should be as follows:
port: 3001
path: /storage_directory/
Endpoint | Method | Description |
---|---|---|
/get/<key> | GET | Retrieve a value |
/set/<key> | PUT | Add or set the value of a key |
/delete/<key> | DELEET | Delete a key-value pair |