Skip to content

Latest commit

 

History

History
109 lines (80 loc) · 4.29 KB

README.md

File metadata and controls

109 lines (80 loc) · 4.29 KB

Kafka backup: Restore Kafka Consumer Offsets from S3

An application allowing to read the consumer groups offsets stored in S3 and apply them to the target Kafka cluster. The S3 sink stores the information under the following path: bucket[/prefix]/group/topic/partition, with the content being the offset (8 bytes array).

Configuration

The application requires the configuration file. The configuration file in HOCON format supports the following options:

Configuration Option Description
Kafka connection settings All Kafka settings should be prefixed with kafka.. For example: kafka.bootstrap.servers = "localhost:9092".
S3 location - aws.bucket: The name of the S3 bucket where consumer group offsets are stored.
- aws.prefix (Optional): The prefix of objects within the bucket.
groups An optional, comma-separated list of consumer groups to restore. If not specified, all groups stored in S3 will be restored. For example: groups = group1, group2.
AWS connection mode - aws.mode: Set to credentials to use provided credentials or default for AWS default credentials provider chain.
AWS Access Key - aws.access.key: AWS access key ID (only when aws.mode is set to credentials).
AWS Secret Key - aws.secret.key: AWS secret access key (only when aws.mode is credentials).
AWS Region - aws.region: AWS region (only when aws.mode is credentials).
AWS HTTP Retries - aws.http.retries: How many times a failed request is attempted. Default is 5
AWS HTTP Retry interval - aws.http.retry.inteval: The time in milliseconds to wait before an HTTP operation is retried. Default is 50.

Examples

Example 1
kafka {
  bootstrap.servers = "localhost:9092"
  # Add other Kafka settings here
}

aws {
  bucket = "your-s3-bucket"
  prefix = "optional-prefix"
}

# Optional: Specify the consumer groups to restore
groups = "group1,group2"

aws {
  mode = "credentials" # or "default"
  access.key = "your-access-key"
  secret.key = "your-secret-key"
  region = "your-aws-region"
}
Example 2
kafka.bootstrap.servers = "localhost:9092"

aws.bucket = "your-s3-bucket"
aws.prefix = "optional-prefix"

groups = "group1,group2"

aws.mode = "credentials" # or "default"
aws.access.key = "your-access-key"
aws.secret.key = "your-secret-key"
aws.region = "your-aws-region"

Running the application

It requires at least Java 8 to run.

To run the application, use the following command:

java -jar restore-consumer-groups-offset.jar --config <path-to-config-file> [--preview]

To build the application, run:

mvn clean install

This creates a tar.gz file in the target directory. When unpacked it contains the application jar and the dependencies:

--bin
|--- restore.sh
--lib
|--- restore-consumer-groups-offset-1.0.0.jar
|--- ...

Using the shell to run the application is recommended, as it sets the classpath and the Java options:

./restore.sh --config <path-to-config-file> [--preview]

To format the code run:

  mvn com.coveo:fmt-maven-plugin:format

To add license header, run:

mvn license:format