The Distributed SQL Blog

Thoughts on distributed databases, open source, and cloud native

TPC-C Benchmark: Scaling YugabyteDB to 100,000 Warehouses

We’re excited to announce a new milestone in our TPC-C benchmark journey! YugabyteDB can now scale up to 100,000 warehouses with an efficiency of 99.78%. More specifically, this results in 630,000 operations per second on the database cluster, observed over a two-hour period.

For those new to TPC-C, it’s an OLTP system benchmarking tool used to measure performance when handling transactions generated by a real-world OLTP application.

Read More

YugabyteDB and Apache Superset: Explore and Visualize Open Source Data

Enterprises today collect and store vast amounts of data. However, realizing business value from this data requires modern data exploration and visualization tools.

Apache Superset is a widely popular, open source data exploration and visualization platform used and supported by enterprises such as Airbnb, Lyft, and Nielsen. More specifically, Superset enables users to quickly and easily explore and visualize large amounts of data stored in YugabyteDB.

Read More

Linux Performance Tuning: Dealing with Memory and Disk IO

Linux is a general purpose operating system. This means it’s created to generally do what is right, instead of having specific code paths to perform what is right for a single specific task—and potentially be wrong for others.

Linux does not have tunable parameters for reserving memory for caching disk pages (the page cache), like operating systems such as HPUX (dbc_min_pct,

Read More

Securing YugabyteDB: Part 1 – Server-to-Server Encryption in Transit

Encryption in transit is a common requirement for client-to-server communication. It is particularly important for YugabyteDB, a 100% open source, distributed SQL database built to accelerate cloud native agility. YugabyteDB typically stores important user and customer data at an organization.

In this ongoing blog series, we take a look at the different aspects of encryption in transit for YugabyteDB.

Read More

A Matter of Time: Evolving Clock Sync for Distributed Databases

Distributed clock synchronization is critical for many applications, including distributed SQL databases. Clock synchronization needs to keep up with the other demands in our modern infrastructure, such as:

  • Applications that have increasing performance requirements while distributing data across different geographic regions
  • Network infrastructure and computing power that is improving constantly

A distributed SQL database is highly available and resilient to failures when deployed across a cluster of nodes.

Read More

My Yugabyte Journey: From Intern to Full-Time Software Engineer

Hello. My name is Tim Elgersma, and I’m a software engineering intern on the YSQL team at Yugabyte. I have one semester left in my bachelor of Computer Science program at the University of Waterloo. In this blog post, I’d like to talk about my experience interning here over the past several months, and why I’m excited to join the company full time upon graduation.

Read More

PostgreSQL Timestamps and Timezones: How to Navigate the Interval Minefield

This is the second of a two part blog post series about the date-time data types that PostgreSQL, and therefore YSQL, support. The first part dealt with the basic business of representing moments (when things happen). The relevant data types here are time, date, and timestamp—where the latter has a without time zone and a with time zone variant.

Read More

PostgreSQL Timestamps and Timezones: What You Need to Know—and What You Don’t

Anecdotal reports indicate that some PostgreSQL programmers are daunted by the date and time data types, and by how operations that use values of these data types might be affected by the session’s timezone setting. Even experienced developers struggle when they first embark on a critical project that relies on this functionality. YugabyteDB’s YSQL subsystem gives the application developer the same experience as PostgreSQL.

Read More

YugabyteDB Integrates with Arctype SQL Client

PostgreSQL has eaten the world. And so Yugabyte set out to build the most Postgres-compatible, scalable, and resilient database. YugabyteDB is not only wire compatible with PostgreSQL, it is code compatible by reusing the upper half of PostgreSQL. Thanks to this compatibility, YugabyteDB can connect to the vast majority of database tools that integrate with Postgres.

In a previous blog post,

Read More

YugabyteDB Savepoints: Checkpointing Work in Distributed Transactions

Yugabyte brings best-in-class performance, scalability, and availability to YugabyteDB, a fully PostgreSQL-compatible SQL database. Because YugabyteDB’s architecture uses PostgreSQL at the SQL layer, we get a long-tail of PostgreSQL compatibility for free. But implementing savepoints requires deeper integration into YugabyteDB’s distributed persistence and transaction layers. 

In this blog post, we’ll discuss why you should care about savepoints.

Read More