The Distributed SQL Blog

Thoughts on distributed databases, open source, and cloud native

Distributed SQL Sharding: How Many Tablets, and at What Size?

The first answer to this question is the usual “it depends“. The second answer, thanks to YugabyteDB’s auto-splitting feature and distributed SQL sharding principles, is “don’t worry, this is managed automatically.” 

However, it’s still important to understand how sharding works, how to handle corner cases correctly, and how to split tablets to save resources.

Read More

Distributed SQL Deep Dive: Inside YugabyteDB’s Two-Layer Architecture

YugabyteDB is a 100% open source, distributed SQL database system. This single phrase expresses two distinct notions: a SQL database system, and a distributed database system. Historically, these notions were mutually exclusive. But current technology allows a single system to implement both notions. YugabyteDB does this with its two-layer architecture: an extensible query processing layer and a distributed document store.

Read More

YugabyteDB Migration: What About Those 19 Oracle Features I Thought I Would Miss?

In 2019, I gave a presentation with the title, “19 features you will miss if you leave Oracle.” The talk focused on features used every day with Oracle—and those that may not be available in other databases. However, the goal of this presentation was not to influence any decision or outcome. 

Migrating from a commercial database is not about covering a full set of features.

Read More

Multi-Cloud Distributed SQL: Avoiding Region Failure with YugabyteDB

As Amazon CTO Werner Vogels often reminds us: ”Everything fails, all the time”. This means any component of the infrastructure will fail at some point. However, with more and more software being deployed in the cloud, today’s infrastructure must provide redundancy. In addition, the applications that sit on top of it must provide continuity over failures.

This involves database replication when the failure concerns data in motion.

Read More

Distributed SQL Essentials: Sharding and Partitioning in YugabyteDB

A distributed SQL database provides a service where you can query the global database without knowing where the rows are. You connect to any node, without having to know the cluster topology. You query your tables, and the database will determine the best access to your data, whether it’s close to your client or geographically distant.

The organization of data,

Read More

How a Distributed SQL Database Boosts Secondary Index Queries with Index Only Scan

A distributed SQL database reads from remote nodes, which increases the need for optimal data access. From my 20 years of experience as a database consultant, here is the most overlooked optimization technique for any database that supports it: performance-critical queries should find their columns from the index structure, without making extra hops to the table.

Many myths against SQL,

Read More

Connecting to YugabyteDB with Arctype, a Collaborative SQL Client

YugabyteDB is PostgreSQL compatible. It uses the same protocol and SQL layer as PostgreSQL. This makes YugabyteDB compatible with a majority of database tools, because PostgreSQL is the most popular open source database on the planet. Here is an example with Arctype, a user-friendly collaborative SQL client to query and visualize data.

Setting up a database connection

1.

Read More