Non Relational Databases Today Cost Less in Time, Money, and Headache

With one simple change you can reduce developer time on your database, reduce cost to your cloud platform, and reduce latency in your applications.

Impedance mismatch is when there is a nontrivial difference between the way that your data is modeled and the way the data is stored.

That’s why the right data model is a huge time saver for your release cycle and DevOps. Using a relational database to store your data against an application using objects to manage it is like working with a square key and a round keyhole.

Your application code will need a lot of sandpaper and duct tape to get your systems to work together.

You may ask yourself, can’t I just round the edges by creating classes with properties and methods that smooth the data from the database to the app and back?

The challenge is not just in the application objects, but in the data queries.

Let’s start with a simple class and some basic data about fathers:

    
    class User {
        string Name;
        Int Age;
    }

Now, in my SQL database, I make my table:

    
    Create table Users (Name string, Age int);

Here is my data table:

Name	Age
Peter	35
Mike	43

Now, let’s add just one more property to make it interesting. How about, array Children[] to track their kids?

    
    class User {
        string Name;
        Int Age;
        string[] Children;
    }

Now, how can we store an array into a table of rows and columns?

Here is one approach:

Name	Age	Children
Peter	35	Meg, Chris, Brian, Stewie
Mike	43	Greg, Marcia, Jan, Peter, Bobby, Cindy

But this is not an array inside the database. It is a string.

Now I have to redo my class code to adapt:

    
    class User {
        string Name;
        Int Age;
        string[] Children => DB_Children.Split(‘,’);
    }

This is an excellent example of an impedance mismatch. I have the model of my object in my class, and I have how it’s stored in the database.

This is an elementary example, and already you have to dedicate the majority of your resources to manage the mismatch. Adjusting your object to fit the data is not a simple job.

What happens when our examples get hairy and start to resemble the real world?

How about querying my database for all users that have a child named Greg?

Here is the query:

    
    select * from Users where Child.Name = ‘Greg’

but the way it’s worded will be:

    
    select * from Users where Children like ‘%greg,%’

This is because the data is modeled to make all the children one big string array with elements separated by commas. It’s going to take longer to process and return a results set.

You will also get a match for names like:

“Gregory”

But if a User has a single child named Greg, there wouldn’t be a match because there will be no ‘,’ comma after the name Greg. This will also not match “Greg” the Youngest.

In one query, something as simple as finding users with a child with a specific name becomes an enormous task. You now have to optimize things on the database end as well. Queries using LIKE in this manner also not use indexes, so will be forced to do a full scan, slowing things down when you have a lot of data.

See how the data is stored is different from how the information is represented in memory. The database vs. the application’s “version” of the data creates the impedance mismatch.

The challenge for developers using relational databases is that it is too easy to create queries that don’t deliver the exact data sets you are trying to capture with your queries. This requires more time to create different queries and to test them to confirm their users are always getting exactly what they are asking for, putting stress on your release cycle and DevOps process.

This forces developers to spend more time on their database and less time on their applications. It produces lots of potholes when developing how your application works with your data.

Non Relational Databases Today Can Avoid This Nightmare

RavenDB lets me get to the business of writing code and not mess around with schema, migrations, and all the misery that comes with a relational database.

What if we made 2 tables, one for parents and one for kids with the key-value being the parent ID?

Here is what is stored in the relational database:

Name	Age	Parent ID	Child ID
Peter	35	3
Mike	43	2
Greg	18	2	1
Marcia	17	2	2
Peter	14	2	3
Jan	12	2	4
Bobby	8	2	5
Cindy	6	2	6
Meg	12	3	7
Chris	14	3	8
Brian	5	3	9
Stewie	0.9	3	10

Table 1. People Table

Child ID	Parent ID
1	2
2	2
3	2
4	2
5	2
6	2
7	3
8	3
9	3
10	3

Table 2. Children Table

Let’s try a new query:

Give me all the parents who have children under the age of five.

I have an object with properties, and I want to query that. Here is the question:

  
  select * from Users where Children.Age <= 5

In RavenDB, a document oriented non relational database, this is your query:

  
  from Users where Children[].Age <= 5

using today’s databases in SQL, this is your query:

  
  select * from Users as parent
  join FilialTies as ft on parent.Id = ft.ParentId
  join Users as child on ft.ChildId = child.Id
  where child.Age <= 5

That’s a lot of work for your system!

What are the results set? It should be Peter (parent ID = 3).

The RQL query is straightforward. Just grab the document property Parent where there is a child under the age of five.

The SQL asks you to do some more work.

Let’s grab the table again. Here we have the parents and the children with their parent Ids.

Name	Age	Parent
Peter	35
Mike	43
Greg	18	2
Marcia	17	2
Peter	14	2
Jan	12	2
Bobby	8	2
Cindy	6	2
Meg	12	3
Chris	14	3
Brian	5	3
Stewie	0.9	3

We need to join them where children are indicated.

Note that we are talking here about the simplest possible scenario. A real-world schema could drown you in the details, especially when you build something that scales out.

child.Name	child.Age	parent.Name	parent.Id
Greg	18	Mike	2
Marcia	17	Mike	2
Peter	14	Mike	2
Jan	12	Mike	2
Bobby	8	Mike	2
Cindy	6	Mike	2
Meg	12	Peter	3
Chris	14	Peter	3
Brian	5	Peter	3
Stewie	0.9	Peter	3

That’s a query within a query just to put the dataset of potential matches together.

So even if you rearrange your classes to take in relational data more efficiently, the queries themselves are inefficient, resulting in more calls to the database, computing costs, and performance lags.

As the query stands, we are taking every parent.Id with a child under the age of five. But Peter, parent.Id 3, has two children under or equal to age 5.

Wouldn’t his name appear twice in the results set? Your query is returning a Cartesian Product, or redundant data.

Based on a simple query, we see the added workload using SQL versus non relational databases. While there are ways around all of these issues, they add friction to the process and require additional developer time and resources to solve.

Here are the main costs that come with the relational model:

You need to divert more developer time to create, deploy, and test queries until you are confident those queries return the most accurate results to your users.
The amount of work your database has to do can be exponentially greater by answering multiple questions to return results for a single query. Using the document model, databases today can serve a single query with a single query.
You run the risk of getting multiple data processed in your results. This requires further developer effort on your database and added memory and computing costs from your cloud platform. You get hit with added expenses in time and money.

RavenDB

RavenDB Cloud

Try

Experience interactive demos and playground server

Download

RavenDB Cloud

Features

Performance

Comparison

What’s New

RavenDB Docs

RavenDB Cloud Docs

Documentation Guide

Demo

Bootcamp

Webinars

Workshops

Inside RavenDB Book

GitHub

StackOverflow

Articles

Whitepapers

Events

Promotional Materials

Unlock your business potential

Use Cases

Articles

Whitepapers

Press Releases

Industry Reports

Performance

Comparison

Proof of Concept Program

Academic Program

Events

What’s New

Roadmap

On-premise Pricing

Cloud Pricing

Proof of Concept Program

Academic Program

Non Relational Databases Today Cost Less in Time, Money, and Headache

Non Relational Databases Today Can Avoid This Nightmare

Woah, already finished? 🤯

Related Articles

Cloned Dictionary vs. Immutable Dictionary vs. Frozen Dictionary in high traffic systems

RavenDB News: June, 2024

Recording: Building a Database Engine in C# & .NET

Watch Live Demo

RavenDB

RavenDB Cloud

Try

Download

RavenDB Cloud

Features

Performance

Comparison

What’s New

RavenDB Docs

RavenDB Cloud Docs

Documentation Guide

Demo

Bootcamp

Webinars

Workshops

Inside RavenDB Book

GitHub

StackOverflow

Articles

Whitepapers

Events

Promotional Materials

Use Cases

Articles

Whitepapers

Press Releases

Industry Reports

Performance

Comparison

Proof of Concept Program