Data Subscriptions: How to Consume a Data Subscription


Subscriptions are consumed by processing batches of documents received from the server. A SubscriptionWorker object manages the documents processing and the communication between the client and the server according to a set of configurations received upon it's creation. We've introduced several ways to create and configure a SubscriptionWorker, starting from just giving a subscription name, and ending with a detailed configuration object - SubscriptionWorkerOptions.


SubscriptionWorker lifecycle

A SubscriptionWorker object starts its life from being generated by the DocumentsStore.subscriptions:

subscriptionWorker = store.subscriptions().getSubscriptionWorker(Order.class, subscriptionName);

At this point, the worker has only got its configuration. No connection or processing happens at this moment. In order to start processing, the run method should be called. The run method receives the batch processing logic that should be performed:

subscriptionRuntimeTask = subscriptionWorker.run(batch -> {
    // your logic here
});

From this point on, the subscription worker will start processing batches. If for any reason, the processing is aborted, and the returned task (subscriptionRuntimeTask) will be finished with an exception.

Error handling

There are two categories of errors that may occur during subscription processing:

Internal mechanism errors

Those errors occur during the normal server-client communication between the worker and the server.
If an unexpected error occurs, the worker will try to reconnect to the server. There are conditions in which the worker will cease its operation and will not try to reconnect:

  • The subscription does not exist or was deleted
  • Another worker took over the subscription (see connection strategy)
  • The worker could not connect to any of the servers
  • The worker could not receive the node in charge of the task (this can happen when there is no leader)
  • Authorization exception
  • Exception during connection establishment

User's batch processing logic unhandled exception

Example:

workerWBatch.run(x -> {
    throw new RuntimeException();
});

If an exception is thrown, the worker will abort the current batch process. A worker can be configured to treat the thrown exception by either of the following two ways:

  • By default, the worker will wrap the thrown exception with a SubscriberErrorException exception and rethrow it,
    terminating the subscription execution without acknowledging progress or retrying. The task returned by the Run function will be terminated with an erroneous state, throwing a SubscriberErrorException exception.

  • If SubscriptionWorkerOptions's value ignoreSubscriberErrors is set to true, the erroneous batch will get acknowledged without retrying and the next batches will continue processing.

Reconnecting

In the cases above, we described situations in which a worker will try to reconnect with the server. There are two key SubscriptionWorkerOptions fields controlling this state:

  • timeToWaitBeforeConnectionRetry - The time that the worker will 'sleep' before trying to reconnect.
  • maxErroneousPeriod - The maximum time in which the worker is allowed to be in erroneous state. After that time passes, the worker will stop trying to reconnect

Workers interplay

There can only be one active subscription worker working on a subscription. Nevertheless, there are scenarios where it is required to interact between an existing subscription worker and one that tries to connect. This relationship and interoperation is configured by the SubscriptionConnectionOptions Strategy field.
The strategy field is an enum, having the following values:
  • OPEN_IF_FREE - the server will allow the worker to connect only if there isn't any other currently connected workers.
    If there is a existing connection, the incoming worker will throw a SubscriptionInUseException.
  • WAIT_FOR_FREE - If the client currently cannot open the subscription because it is used by another client, it will wait for the previous client to disconnect and only then will connect.
    This is useful in client failover scenarios where there is one active client and another one already waiting to take its place.
  • TAKE_OVER - the server will allow an incoming connection to overthrow an existing one. It will behave according to the existing connection strategy:
    • The existing connection has a strategy that is not TAKE_OVER. In this case, the incoming connection will take over it causing the existing connection to throw a SubscriptionInUseException exception.
    • The existing connection has a strategy that is TAKE_OVER. In this case, the incoming connection will throw a SubscriptionInUseException exception.