The Distributed SQL Blog

Thoughts on distributed databases, open source and cloud native

Part 1: Deploying a Distributed SQL Backend for Apache Airflow on Google Cloud

Apache Airflow is a popular platform for programmatically authoring, scheduling, and monitoring workflows. Airflow has been deployed by companies like Adobe, Airbnb, Etsy, Instacart, and Square. The advantage of defining workflows as code is that they become more maintainable, versionable, testable, and collaborative. Airflow is used to author these workflows as directed acyclic graphs (DAGs) of tasks.

Read More

Follow-up: Postgres Conference Webinar – Introduction to YugabyteDB

I delivered the webinar “YugabyteDB: a distributed PostgreSQL database” last week. It was hosted by Postgres Conference in a series that they’d arranged so that the speakers who’d been selected to speak at their big annual conference in New York City in March could reach their audience. COVID-19 brought that event’s cancellation. This brief post tells you where you can find the edited videorecording and the PDF version of the slides that I used.

Read More

Oracle vs PostgreSQL: First Glance – Testing YugabyteDB’s Compatibility

Roland Takacs wrote an interesting blog post titled Oracle vs PostgreSQL: First Glance earlier this month. The genesis for his blog post was that he was in the middle of migrating his current Oracle tech stack to Python, parquet files, and PostgreSQL. As such, Roland thought it might be a good exercise to document the various Oracle features he was accustomed to and figure out what the equivalent functionality was in PostgreSQL.

Read More

Boosting Performance for Small Distributed SQL Data Sets with Colocated Tables

Editor’s note – This post was updated July 2020 with new release information

An Introduction to Colocated Tables

In YugabyteDB v2.1, we released a new feature in beta: colocated tables. And we were excited to announce the general availability of colocated tables, along with many other exciting new features,

Read More

YugabyteDB Engineering Update – May 6, 2020

YugabyteDB Release Updates

We are excited to announce that YugabyteDB 2.1.5 is GA! The 2.1.5 release shipped with over 40 new enhancements and fixes. Also, yesterday we hit a milestone that we are very proud of–we welcomed our 1000th member to the YugabyteDB community Slack channel! Congrats to Rob Sami for being the 1000th member,

Read More

Bringing Truth to Competitive Benchmark Claims – YugabyteDB vs CockroachDB, Part 2

In part 1 of this blog series, we highlighted multiple factual errors in the Cockroach Labs analysis of YugabyteDB. In this second post we provide the next layer of detail behind YugabyteDB’s architecture, with an emphasis on comparing it to that of CockroachDB’s.

Contents of this post

TLDR

Yugabyte SQL is based on a reuse of PostgreSQL’s native query layer.

Read More

Bringing Truth to Competitive Benchmark Claims – YugabyteDB vs CockroachDB, Part 1

This is the first in a two part blog series which highlights factual errors in the Cockroach Labs analysis of YugabyteDB. In the second post in this series, we provide the next layer of detail behind YugabyteDB’s architecture, with an emphasis on comparing it to that of CockroachDB’s.

Contents of this post

Introduction

At Yugabyte,

Read More

Why We Built YugabyteDB by Reusing the PostgreSQL Query Layer

Reusing PostgreSQL’s native query layer instead of writing a new PostgresQL-compatible query layer ground up has been one of the best design decisions we have made in YugabyteDB. As outlined in the challenges we faced building a distributed SQL database, we have battle scars to prove this insight – we started writing a PostgreSQL-compatible query layer from scratch before realizing that we simply cannot build the world’s best cloud native RDBMS in a timely manner if we persist down this path.

Read More