YugaByte DB

The YugaByte Database Blog

Thoughts on open source, cloud native and distributed databases

Benchmarking an 18 Terabyte YugaByte DB Cluster with High Density Data Nodes

For ever-growing data workloads such as time series metrics and IoT sensor events, running a highly dense database cluster where each node stores terabytes of data makes perfect sense from a cost efficiency standpoint. If we are spinning up new data nodes only to get more storage-per-node, then there is a significant wastage of expensive compute resources. However, running multi-terabyte data nodes with Apache Cassandra as well as other Cassandra-compatible databases (such as DataStax Enterprise) is not an option.

YugaByte Company and Database Update – Aug 3, 2018

$16 Million Funding Round

In case you missed the news earlier this Summer, YugaByte raised an additional $16M of funding from Dell Technologies Capital and our previous investor Lightspeed Venture Partners. With the additional funding, we are accelerating investments in engineering, sales, and customer success to scale our support for enterprises building business-critical applications in the cloud.

Yes We Can! Distributed ACID Transactions with High Performance

ACID transactions are a fundamental building block when developing business-critical, user-facing applications. They simplify the complex task of ensuring data integrity while supporting highly concurrent operations. While they are taken for granted in monolithic SQL/relational DBs, distributed NoSQL/non-relational DBs either forsake them completely or support only a highly restrictive single-row flavor (see sections below). This loss of ACID properties is usually justified with a gain in performance (measured in terms of low latency and/or high throughput).

Achieving Sub-ms Latencies on Large Datasets in Public Clouds

YugaByte DB performance on large data sets

One of our users was interested to learn more about YugaByte DB’s behavior for a random read workload where the data set does not fit in RAM and queries need to read data from disk (i.e. an uncached random read workload).

The intent was to verify if YugaByte DB was designed well to handle this case with the optimal number of IOs to the disk subsystem.

Scaling YugaByte DB to Millions of Reads and Writes

Writes are RF=3 with strong consistency, reads are leader-only data strongly consistent reads.

Here at YugaByte, we continuously push the limits of the systems we build. As a part of that, we ran some large cluster benchmarks to scale YugaByte DB to million of reads and writes per second while retaining low latencies. This post goes into the details about our 50 node cluster benchmark.

Building a Strongly Consistent Cassandra with Better Performance

In an earlier blog on database consistency, we had a detailed discussion on the risks and challenges applications face in dealing with eventually consistent NoSQL databases. We also dispelled the myth that eventually consistent DBs perform better than strongly consistent DBs. In this blog, we will look more closely into how YugaByte DB provides strong consistency while outperforming an eventually consistent DB like Apache Cassandra.

YugaByte DB Architecture: Diverse Workloads with Operational Simplicity

YugaByte DB is a transactional, high performance, geo-distributed operational database that converges multiple NoSQL and SQL interfaces into an unified solution. The v0.9 public beta of YugaByte DB is compatible with Apache Cassandra (CQL) and Redis APIs, with PostgreSQL under development. A fundamental design goal for YugaByte DB has been to provide the same transactional,

