Sharding: Document Extensions

Document extensions are data entities that are associated with documents. They currently include Counters, Attachments, Time Series, and Revisions.
From a user's point of view, document extensions behave similarly under sharded and non-sharded databases and are handled using the same API commands and Studio views.
Document extensions are identified by the ID of their parent document, and are always stored in the same bucket as the document.

When a document is resharded, its document extensions are transferred along with it to the new shard.
In this page:
- Document Extensions and Resharding
- Precautions and Recommendations
  - Precautions
  - Recommendations

Document Extensions and Resharding

When RavenDB runs resharding to balance the data load between shards, it copies documents from one shard to another and then removes their original copy.

If a change occurs in the original data after some of it was copied to its new location (e.g. a time series has updated entries in the original bucket after its parent document was copied), RavenDB will not remove the original document and its extensions from their original location until the new/modified data is relocated as well.

In all other respects, we handle the original document as if it had already moved, including reading from and writing to only the new document in its new bucket and shard.

Precautions and Recommendations

Precautions

The main contribution of sharded databases is their ability to manage huge volumes of data efficiently by serving it from multiple shards.
We should take extra care, then, to help the database maintain its ability to divide the data between shards.
The following points relate to this issue.

Time Series
Some time series can get very large. As they reside in a single bucket with their parent document, they cannot be spread between shards and may become hard to manage and use.
We recommend keeping the number of time series added to each document fairly small, and using practices such as rollup and retention.
Revisions
Revisions may accumulate in a large database, especially in an environment of rapid document modification. We can, however, create a revisions configuration that would take this into account, limit revisions quantity by number and age, and automatically remove those that are no longer needed.
Attachments
Remain aware of the size and amount of attachments in your database as well, and try to avoid adding many or oversized attachments to the same document, especially as a recurring method.
Counters
Counters are tiny entities, that weigh much less on the system than time series, revisions, or attachments. It is, however, recommended to keep an eye on them as well, to make sure they are not used in quantities that do pose a problem.

Recommendations

While planning our data model, we should prefer a larger amount of smaller documents over a smaller amount of heavier documents that are harder to relocate and balance.
As explained above, we should limit the size and amount of document extensions and spread them among many documents. Where possible, we can use wise features like time series rollups to summarize a large amount of data using a tiny amount of space.
It takes longer to retrieve related documents (and their extensions) when they are stored on different shards. To accelerate such operations, we can store related documents in the same bucket in advance.

see on GitHub

RavenDB

RavenDB Cloud

Try

Experience interactive demos and playground server

RavenDB Docs

RavenDB Cloud Docs

Documentation Guide

Download

Features

Performance

Comparison

What’s New

Demo

Bootcamp

Webinars

Workshops

Inside RavenDB Book

GitHub

StackOverflow

Articles

Whitepapers

Events

Promotional Materials

Unlock your business potential

Use Cases

Articles

Whitepapers

Press Releases

Industry Reports

Performance

Comparison

Proof of Concept Program

Academic Program

Events

What’s New

Roadmap

On-premise Pricing

Cloud Pricing

Support

Proof of Concept Program

Academic Program

Sharding: Document Extensions

Document Extensions and Resharding

Precautions and Recommendations

Precautions

Recommendations

Related Articles

Counters

Attachments

Time Series

Revisions

RavenDB

RavenDB Cloud

Try

RavenDB Docs

RavenDB Cloud Docs

Documentation Guide

Download

Features

Performance

Comparison

What’s New

Demo

Bootcamp

Webinars

Workshops

Inside RavenDB Book

GitHub

StackOverflow

Articles

Whitepapers

Events

Promotional Materials

Use Cases

Articles

Whitepapers

Press Releases

Industry Reports

Performance

Comparison