The Distributed SQL Blog

Thoughts on distributed databases, open source and cloud native

Announcing YugaByte DB 1.1 and Company Update

The team at YugaByte is excited to announce that YugaByte DB 1.1 is officially GA! You can download the latest version from our Quick Start page. New in this release:

YugaByte DB Open Source

Read More

Ravi Murthy: Starting a New Chapter with YugaByte

In case you missed the announcement, Ravi Murthy has joined YugaByte as our VP of Engineering. Read on to learn more about his experiences leading the teams who managed the explosive growth of applications and data at Facebook, plus what’s next at YugaByte!

After almost 7 years at Facebook, I am super-excited to start in my new role as YugaByte’s VP of Engineering.

Read More

Jepsen Testing on YugaByte DB

At YugaByte, our mission is to build a robust, reliable, distributed OLTP database. Needless to say, we take correctness and technical accuracy of our claims very seriously. Therefore, we absolutely love a testing framework like Jepsen which helps verify correctness and are fans of Kyle Kingsbury’s work!

Here is a summary of what we have done so far in regards to Jepsen:

  • We have performed our own DIY style Jepsen testing
  • The YugaByte DB Jepsen testing repository is open source
  • For the current suite of Jepsen tests for YugaByte DB that we have tested in a loop,

Read More

Building a High Growth Business by Monetizing Open Source Software

Whenever a venture-funded software infrastructure startup takes the open source route to market, a few questions emerge:

  • What open source license and project governance model will it choose?
  • How will it monetize the open source project?
  • What if AWS, Microsoft Azure or Google Cloud offer the startup’s open source project as a managed service?

Read More

Understanding How YugabyteDB Runs on Kubernetes

As we reviewed in “Docker, Kubernetes and the Rise of Cloud Native Databases”, Kubernetes has benefited from rapid adoption to become the de-facto choice for container orchestration. This has happened in a short span of only 4 years since Google open sourced the project in 2014. YugabyteDB’s automated sharding and strongly consistent replication architecture lends itself extremely well to containerized deployments powered by Kubernetes orchestration.

Read More

Benchmarking an 18 Terabyte YugabyteDB Cluster with High Density Data Nodes

For ever-growing data workloads such as time series metrics and IoT sensor events, running a highly dense database cluster where each node stores terabytes of data makes perfect sense from a cost efficiency standpoint. If we are spinning up new data nodes only to get more storage-per-node, then there is a significant wastage of expensive compute resources. However, running multi-terabyte data nodes with Apache Cassandra as well as other Cassandra-compatible databases (such as DataStax Enterprise) is not an option.

Read More

How Does the Raft Consensus-Based Replication Protocol Work in YugabyteDB?

As we saw in ”How Does Consensus-Based Replication Work in Distributed Databases?”, Raft has become the consensus replication algorithm of choice when it comes to building resilient, strongly consistent systems. YugabyteDB uses Raft for both leader election and data replication. Instead of having a single Raft group for the entire dataset in the cluster, YugabyteDB applies Raft replication at an individual shard level where each shard has a Raft group of its own.

Read More