The Distributed SQL Blog

Thoughts on distributed databases, open source, and cloud native

An Adventure in Multi-Cloud with Kubernetes and Distributed Data

At this year’s Distributed SQL Summit Asia 2021, Leon Kuperman from CAST AI presented the talk, “An Adventure in Multi-Cloud with Kubernetes and Distributed Data.”

Setting the Scene

CAST AI is an AI-driven cloud optimization platform for Kubernetes that helps companies cut their cloud bill and prevent downtime. In the presentation,

Read More

Creating Standard Patterns to Load Data into YugabyteDB with Apache Airflow

At this year’s Distributed SQL Summit Asia 2021, Viraj Parekh and Pete DeJoy from Astronomer presented the talk, “Creating Standard Patterns to Load Data in YugabyteDB with Apache Airflow.”

Apache Airflow is an open source workflow orchestrator that is used by data teams of all sizes. Airflow is trusted by organizations like Marriott,

Read More

Reimagining the RDBMS for the Cloud

Perspectives from 5 years of building a cloud native database

YugabyteDB just turned 5 years old, and I cannot help but reminisce about our journey in building this database over the last half-decade. I vividly remember the genesis of the YugabyteDB project, which started with Kannan, Mikhail, and myself meeting for lunch at a restaurant to discuss the future of cloud native databases.

Read More

Achieving Compliance with YugabyteDB

This is the first in a series of posts about how different compliance and regulatory frameworks work and how YugabyteDB can be an essential part of a company’s compliance efforts. This installment is an overview of the different international compliance frameworks companies encounter on their journeys with customers.

What’s the Purpose of a Compliance Framework?

A compliance framework is a set of guidelines,

Read More

YugabyteDB Fundamentals Training Q&A – Jan 28, 2021

Yesterday we had almost 200 folks show up for the latest “YugabyteDB Fundamentals and Certification” training session. We had a lot of questions during the session and weren’t able to get to them all! In this blog I have compiled the majority of them and provided answers in a simple Q&A format.

Missed yesterday’s training? You can catch our next training session,

Read More

Presentation Recap: The Art of the State: Serverless Computing and Distributed Data – Joe Hellerstein, UC Berkeley and Trifacta

We were delighted to have Joe Hellerstein, professor of Computer Science at UC Berkeley and co-founder and Chief Strategy Officer at Trifacta, give the day 2 opening keynote at this year’s 2020 Distributed SQL Summit.

If you weren’t able to attend or want a refresher, here’s the summary and playback of his keynote presentation.

Read More

Distributed SQL Summit Recap: Envoy and Service Meshes for Databases

At the Distributed SQL Summit 2020, Christoph Pakulski, software engineer at Tetrate, and Prasad Radhakrishnan, VP of data engineering at Yugabyte presented the talk “Envoy and Service Meshes for Databases: What the Future Holds”. In the talk, they explored the topics of Envoy, service meshes–specifically Istio, and how they intersect with the database world.

Read More

Presentation Recap: Modernizing Application Development with GraphQL and Distributed SQL

At the Distributed Summit 2020 , Allison Kunz, Solutions Engineer at Hasura, presented the talk “Modernizing Application Development with Planet-scale GraphQL and Distributed SQL”. In the talk she covered what is GraphQL, why GraphQL, why Hasura GraphQL Engine, and a checklist of what it takes to have enterprise-grade GraphQL APIs in production, focusing on performance,

Read More

Distributed SQL Summit Recap: Pinterest’s Exploration of Distributed SQL

At the Distributed SQL Summit 2020, Lianghong Xu – Engineering Manager & Tech Lead, Pinterest – presented the talk “Pinterest’s Exploration of Distributed SQL”. In the talk he covered the evolution of storage at Pinterest, the role that the HBase ecosystem plays within the company, current challenges and opportunities for innovation, and finally, their exploration of Distributed SQL as a viable solution to some of these challenges.

Read More