How It Works

Getting Started with PostgreSQL Triggers in a Distributed SQL Database

September 24, 2019

Distributed SQL How It Works How To PostgreSQL

Getting Started with PostgreSQL Triggers in a Distributed SQL Database

Triggers are a basic feature that all monolithic SQL systems like Oracle, SQL Server and PostgreSQL have supported for many years. They are very useful in a variety of scenarios ranging from simple audit logging, to advanced tasks like updating remote databases in a federated cluster. In this blog, we’ll look at examples of INSERT, UPDATE and INSTEAD OF triggers in Yugabyte DB.

What’s Yugabyte DB? It is an open source, high-performance distributed SQL database built on a scalable and fault-tolerant design inspired by Google Spanner.

…

PostgreSQL Compatibility in YugabyteDB 2.0

September 17, 2019

By Jimmy Guerrero

ACID Transactions CockroachDB Distributed SQL How It Works PostgreSQL YugabyteDB

PostgreSQL Compatibility in YugabyteDB 2.0

YugabyteDB 2.0 is now generally available, featuring the production readiness of the PostgreSQL compatible Yugabyte SQL API (YSQL). In addition to Jepsen testing results, new performance benchmarks, and ecosystem integrations, YugabyteDB 2.0 offers a high degree of PostgreSQL compatibility, with plans for future support.

Best Practices and Recommendations for Distributed SQL on Kubernetes

September 11, 2019

By Andrew Nelson

How It Works How To Kubernetes

Best Practices and Recommendations for Distributed SQL on Kubernetes

YugabyteDB and Kubernetes have very complementary design principles because they both rely on an extensible and flexible API layer, as well as a scale-out architecture for performance and availability. In this blog post we’ll look at best practices and recommendations when choosing Kubernetes as the cluster foundation for a distributed SQL system. This will begin with a review of relevant architectural decisions of the YugabyteDB. Then we’ll walk you through how to handle the provisioning,

…

Low Latency Reads in Geo-Distributed SQL with Raft Leader Leases

August 20, 2019

By Karthik Ranganathan

ACID Transactions Distributed SQL How It Works

Low Latency Reads in Geo-Distributed SQL with Raft Leader Leases

Note: This post contains interactive animations that explain how some of these complex algorithms work. Please view this post in a suitable media (at least 1000px by 600px screen resolution) for best results.

In this blog post, we are going to dive deep into the read performance of Raft – why read performance can take a hit and how it can be improved using leader leases. Additionally, we will also look at how to make the correctness guarantees around leader leases stronger.

…

How Data Sharding Works in a Distributed SQL Database

June 6, 2019

By Sid Choudhury

Distributed SQL How It Works YugabyteDB

How Data Sharding Works in a Distributed SQL Database

Enterprises of all sizes are embracing rapid modernization of user-facing applications as part of their broader digital transformation strategy. The relational database (RDBMS) infrastructure that such applications rely on suddenly needs to support much larger data sizes and transaction volumes. However, a monolithic RDBMS tends to quickly get overloaded in such scenarios. One of the most common architectural patterns used to scale an RDBMS is to “shard” the data. In this blog, we will learn what data sharding is and how it can be used to scale a SQL database.

…

How to Handle Runaway Queries in a Distributed SQL Database

May 28, 2019

By Rahul Desirazu

Distributed SQL How It Works

How to Handle Runaway Queries in a Distributed SQL Database

Runaway queries are queries that scan through a large set of data. Such queries consume vast amounts of I/O and CPU resources of the database in the background, even if the results appear as harmless timeouts to the end user or the client application. How do runaway queries get executed in the first place, anyway? Everyone who uses databases has at some point or another entered SELECT * from some_large_table, only to realize they forgot to add a LIMIT n clause.

…

5 Reasons Why Apache Kafka Needs a Distributed SQL Database

May 21, 2019

By Sid Choudhury

Distributed SQL How It Works Kafka

5 Reasons Why Apache Kafka Needs a Distributed SQL Database

Modern enterprise applications must be super-elastic, adaptable, and running 24/7. However, traditional request-driven architectures entail a tight coupling of applications. For example, App 1 asks for some information from App 2 and waits. App 2 then sends the requested information to App 1. This sort of app-to-app coupling hinders development agility and blocks rapid scaling.

In event-driven architectures, applications publish events to a message broker asynchronously. They trust the broker to route the message to the right application,

…

Achieving Fast Failovers After Network Partitions in a Distributed SQL Database

May 15, 2019

By Timur Yusupov

Distributed SQL Google Spanner How It Works Jepsen Tests

Achieving Fast Failovers After Network Partitions in a Distributed SQL Database

In February of this year, Kyle Kingsbury of Jepsen.io was conducting formal testing of YugabyteDB for correctness under extreme and unorthodox conditions. Obviously, simulating all manner of network partitions is part of his testing methodology. As a result, during his testing he spotted the fact that although nodes would reliably come back after a failure, the recovery itself was taking roughly 25 seconds to occur. We certainly didn’t like the sound of that!

…

6 Technical Challenges Developing a Distributed SQL Database

April 26, 2019

By Karthik Ranganathan

Amazon Aurora Distributed SQL Google Spanner How It Works Jepsen Tests

6 Technical Challenges Developing a Distributed SQL Database

You can join the discussion on HackerNews here.

We crossed the three year mark of developing the YugabyteDB database in February of 2019. It has been a thrilling journey thus far, but not without its fair share of technical challenges. There were times when we had to go back to the drawing board and even sift through academic research to find a better solution than what we had at hand.

…

How to Achieve High Availability, Low Latency and GDPR Compliance in a Distributed SQL Database

April 3, 2019

By Sid Choudhury

Distributed SQL How It Works How To MongoDB PostgreSQL

How to Achieve High Availability, Low Latency and GDPR Compliance in a Distributed SQL Database

YugabyteDB is purpose built for geo-distributed applications that require high availability, high performance and regulatory compliance. In this blog, we are going to “look under the hood,” to explore exactly how YugabyteDB distributes data across multiple clouds, regions and availability zones.

How It Works

Explore Distributed SQL and YugabyteDB in Depth