Skip to content

A simple, robust and flexible interface for streaming document events out of MongoDB.

License

Notifications You must be signed in to change notification settings

jgdef/mongo-follower

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

59 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mongo Follower

A simple, robust and flexible interface for streaming document events out of MongoDB.

Maven

    <dependency>
      <groupId>com.traackr</groupId>
      <artifactId>mongo-follower</artifactId>
      <version>1.0.0</version>
    </dependency>

Key features

  • Resumable. An oplog timestamp is maintained which allows for restarting MongoFollower to resume processing. The timestamp can be configured with:
    • updateInterval: control how often the file is updated.
    • updateDelay: subtracts some amount of time from the oplog so that oplog events aren't skipped during a restart.
  • Initial export. By starting the process with an initial export all documents can be exported effortlessly.
  • Runner harness. The Runner utilities make setting up a follower a breeze.

How it works

MongoFollower is a two step process to efficiently export data from a collection then keep it synchronized.

  1. An initial export gets the bulk of your historic documents out of the collection. This can be disabled by setting initialExport to false.
  2. A MongoDB oplog tailing process is started which keeps processing events as they occur.

The interface is driven by extending the MongoEventListener and providing an instance of the listener to the Runner utility.

    FollowerConfig tc = FollowerConfig.builder()
        .listener(new MyListener())
        .initialExport(false)
        .mongoConnectionString(connectionString)
        .mongoDatabase(database)
        .mongoCollection(collection)
        .oplogFile("/tmp/testapp/oplogfile")
        .build();

    Runner.run(tc);

Alternatively a property file can be used to make it easier to switch between development and production environments:

    Properties props = new Properties();
    InputStream steam = ...; // open file or resource
    props.load(stream);

    Runner.run(props, new MyListener());

Options

property name builderOption / type default value description
n/a listener / MongoEventListener --- A class extending the MongoEventListener interface to process events.
initial-export initialExport / Boolean false Enabling this flag will cause MongoFollower to start the process by exporting all existing documents from MongoDB into the event listener.
oplog-file oplogFile / String --- The absolute path to the oplog file, this needs to be accessible for reading and writing by the user running MongoFollower.
mongo.oplog-delay mongoConnectionString / String --- Standard MongoDB connection string.
mongo.database mongoDatabase / String --- Database containing the collection to be followed.
mongo.collection mongoCollection / String --- Collection being followed.
mongo.oplog-delay oplogDelayMinutes / int 10 Number of minutes to lag behind the oplog. By delaying the oplog you can restart your process without missing any events. Note that this expects that it is ok to send the same event multiple times as long as they are sent in order.
mongo.oplog-interval oplogUpdateIntervalMinutes / int 10 The number of minutes to wait between updating the oplog timestamp file.
queue-size queue / BlockingQueue ArrayBlockingQueue<>(4000) Optionally override the queue implementation with something custom or with a different capacity.

Use cases

  • Event monitoring / auditing: setting a certain field indicates a document merge for your business
  • Synchronizing databases: Re-index to elasticsearch each time a document changes
  • Migration / Backup: map documents to Postgres, MySQL, Cassandra, CSV
  • Performance auditing: Detect inefficient usage patterns, such as many small frequent updates to a document instead of one larger update.

Example

Here is a short, self contained, correct (compilable), example which will process all documents of a given collection specified on the command line. To process all documents from a given instance, database, collection:

java -jar TestApp.jar mongodb://localhost:27017 test_database test_collection
import com.traackr.mongo.follower.exceptions.FailedToStartException;
import com.traackr.mongo.follower.interfaces.MongoEventListener;
import com.traackr.mongo.follower.model.Command;
import com.traackr.mongo.follower.model.Delete;
import com.traackr.mongo.follower.model.Insert;
import com.traackr.mongo.follower.model.FollowerConfig;
import com.traackr.mongo.follower.model.Update;
import com.traackr.mongo.follower.service.Runner;

import org.bson.Document;

import java.io.IOException;

/**
 * @author wwinder
 * Created on: 7/20/17
 */
public class TestApp implements MongoEventListener {
  public static void main(String[] args)
      throws IOException, FailedToStartException, InterruptedException {
    if (args.length != 3) {
      throw new IllegalArgumentException("Arguments: <connection string> <database> <collection>");
    }

    String connectionString = args[0];
    String database = args[1];
    String collection = args[2];

    FollowerConfig tc = FollowerConfig.builder()
        .listener(new TestApp())
        .dryRun(false)
        .initialExport(false)
        .mongoConnectionString(connectionString)
        .mongoDatabase(database)
        .mongoCollection(collection)
        .oplogFile("/tmp/testapp/oplogfile")
        .build();

    Runner.run(tc);

    while (true) {
      Thread.sleep(1000000);
    }
  }

  @Override
  public void exportDocument(Document doc) {
    System.out.println("Import: " + doc.toString());
  }

  @Override
  public void delete(Delete entry) {
    System.out.println("Delete: " + entry.getId());
  }

  @Override
  public void insert(Insert entry) {
    System.out.println("Delete: " + entry.getDocument().toString());
  }

  @Override
  public void update(Update entry) {
    System.out.println("Update: " + entry.getDocument().toString());
  }

  @Override
  public void command(Command entry) {
    System.out.println("Command: " + entry.toString());
  }
}

About

A simple, robust and flexible interface for streaming document events out of MongoDB.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages