The Distributed Data Show

The Distributed Data Show is your weekly source for the latest news and technical expertise to help you succeed in building large-scale distributed systems. Brought to you by the DataStax Developer Advocate team, we go in-depth with DataStax engineers and special guests from the broader data community. New episodes each Tuesday.

Subscribe for weekly updates!  Also available on YouTube, ITunes, and your favorite podcast provider.

Share your feedback.

Check out our other podcasts.

E.g., 09/20/2019
E.g., 09/20/2019
Dec 11, 2018 • By: Cedrick Lunven, Christopher Splinter

DataStax Apache Kafka™ Connector is one of the new and long awaited functionality in DataStax Enterprise 6.7. It allows seamless data integration between Apache Kafka™ DataStax enterprise using Kafka Connect. Chris Splinter, product manager, explains us what the connector is able to do achieve and provided some details about how it can be used to help and empower developers.

Dec 04, 2018 • By: Amanda Moran

What’s new in DSE 6.7! Information and discussion about DSE Metric Collector, improvements to Search, improvements to back-up and point-in-time restore, and finally the long awaited DSE Kafka connector!

Nov 27, 2018 • By: Adron Hall

In this episode Adron Hall speaks with Luc Perkins about his work at the CNCF, Kubernetes, and where projects are heading and what projects they're working on. Adron also speaks with Luc about docs, projects he's been seeing that are really interesting, skeleton code for projects, and lot's of other topics.

Nov 20, 2018 • By: Jeff Carpenter

Over the past 4 years, Caroline has worked as a solutions engineer to help many customers adopt Apache Cassandra and DataStax Enterprise. In this episode Caroline shares how these conversations with those customers have changed over time, from an initial focus on scaling out databases, to an emphasis on microservices architecture, to a high level interest in using machine learning to get insights from data.

Nov 13, 2018 • By: Adron Hall

In this episode Adron talks with Travis about starting and working site reliability in a large retail enterprise. They tackle topics ranging around outages, database sizing, monitoring and observability, disparate workloads and migrations between cloud providers. Then Travis and Adron head into discussion about distributed cache and some of the questions we would need to ask to determine the functionality needed. The episode then wraps up with a few outtakes at the end.

Nov 06, 2018 • By: Patrick McFadin

Patrick welcomes Shogo Hoshii to the show to learn why Yahoo Japan is running over 200 clusters of Apache Cassandra and why NoSQL databases are starting to gain momentum in Japan.


Oct 30, 2018 • By: Patrick McFadin

Patrick McFadin talks with “Cassandra Archaeologist” Carlos Rolo of Pythian about best practices for upgrades in Apache Cassandra clusters.

Oct 23, 2018 • By: Patrick McFadin

Dave Bechberger returns to the show to discuss what you should be thinking about when adding graph databases to legacy applications, and to take us to school on being production ready.

Oct 16, 2018 • By: Adron Hall

In this episode Adron speaks with Jim Hatcher, and Jim tells us all about graph frames, fraud detection, and more. We also talk about some additional use cases and other interesting topics around where the industry is moving in regards to graph, analytics, and other business use cases that have really expanded the use of these multi-model databases.

Oct 11, 2018 • By: Adron Hall, Jonathan Lacefield

Jonathan Lacefield is working on efforts with DataStax Enterprise Graph, and today Adron speaks with him about graph while at San Francisco Graph Day 2018! We delve into super nodes and adjacency lists to start with. But eventually we move from the aspects of what super nodes and adjacency lists give us to what we are doing and what we’re looking for with graph data and graph database solutions