The Distributed SQL Blog

Thoughts on distributed databases, open source and cloud native

Distributed SQL Summit Schedule Now Live!

In two weeks, thought leaders, database builders, and application developers are coming together for a free online conference to push the boundaries of cloud native RDBMS forward. Distributed SQL (Virtual) Summit, now in its second year, is taking place September 15-17.

We’re excited to announce that the 2020 Distributed SQL Summit schedule is now live!

Read More

Row Counts of Tables in a SQL Schema & Database – PostgreSQL and YugabyteDB

Getting total row counts of data in tables across various dimensions (per-table, per-schema, and in a given database) is a useful technique to have in one’s tool belt of SQL tricks. While there are a number of use cases for this, my scenario was to get the per-table row counts of all tables in PostgreSQL and YugabyteDB as a first sanity check after migrating an application with the pre-existing data from PostgreSQL to YugabyteDB.

Read More

TPC-C Benchmark: 10,000 Warehouses on YugabyteDB

We are excited to announce that the TPC-C benchmark implementation for YugabyteDB is now open source and ready to use! While this implementation is not officially ratified by the TPC organization, it closely follows the TPC-C v5.11.0 specification.

For those new to TPC-C, the aim of the benchmark is to test how a database performs when handling transactions generated by a real-world OLTP application.

Read More

100% Committed to Open Source, YugabyteDB Community Update – June 11, 2020

While it is still early days, it is exciting to see the YugabyteDB community hit some cool milestones! We wanted to share them with you, plus update you some additional community related news.

Our Commitment to Social Justice

First things first. Recent events that resulted in the death of George Floyd at the hands of police have been traumatic and painful for many of us,

Read More

Bringing Truth to Competitive Benchmark Claims – YugabyteDB vs CockroachDB, Part 2

In part 1 of this blog series, we highlighted multiple factual errors in the Cockroach Labs analysis of YugabyteDB. In this second post we provide the next layer of detail behind YugabyteDB’s architecture, with an emphasis on comparing it to that of CockroachDB’s.

Contents of this post

TLDR

Yugabyte SQL is based on a reuse of PostgreSQL’s native query layer.

Read More

Bringing Truth to Competitive Benchmark Claims – YugabyteDB vs CockroachDB, Part 1

This is the first in a two part blog series which highlights factual errors in the Cockroach Labs analysis of YugabyteDB. In the second post in this series, we provide the next layer of detail behind YugabyteDB’s architecture, with an emphasis on comparing it to that of CockroachDB’s.

Contents of this post

Introduction

At Yugabyte,

Read More

5 Query Pushdowns for Distributed SQL and How They Differ from a Traditional RDBMS

A pushdown is an optimization to improve the performance of a SQL query by moving its processing as close to the data as possible. Pushdowns can drastically reduce SQL statement processing time by filtering data before transferring it over the network, filtering data before loading it into memory, or pruning out entire files or blocks that  do not need to be read.

Read More

Achieving 10x Better Distributed SQL Performance in YugabyteDB 2.1

When starting the YugabyteDB project, our founding thesis was to build a high-performance distributed SQL database for the cloud native era. Achieving high performance will always remain an ongoing initiative, especially when additional optimizations are required to support new features and new use cases. We are excited that the current YugabyteDB 2.1 release has a number of improvements that make Yugabyte SQL’s performance 10x better on average than the previous 2.0 release (from September 2019).

Read More

Four Data Sharding Strategies We Analyzed in Building a Distributed SQL Database

A distributed SQL database needs to automatically partition the data in a table and distribute it across nodes. This is known as data sharding and it can be achieved through different strategies, each with its own tradeoffs. In this post, we will examine various data sharding strategies for a distributed SQL database, analyze the tradeoffs, explain the rationale for which of these strategies YugabyteDB supports and what we picked as the default sharding strategy.

Read More