The Distributed Data Show

The Distributed Data Show is your weekly source for the latest news and technical expertise to help you succeed in building large-scale distributed systems. Brought to you by the DataStax Developer Advocate team, we go in-depth with DataStax engineers and special guests from the broader data community. New episodes each Tuesday.

Subscribe for weekly updates!  Also available on YouTube, ITunes, and your favorite podcast provider.

Share your feedback.

Check out our other podcasts.

E.g., 10/21/2019
E.g., 10/21/2019
Dec 19, 2017 • By: Patrick McFadin

DuyHai Doan and Patrick McFadin explain the primary two ways of distributing data used in computer science: hash-based partitioning and range-based partitioning, and the implications of each of these on operations (hint: rebalancing!). 

Dec 12, 2017 • By: David Gilardi, Patrick McFadin

Patrick McFadin talks with David Gilardi about the new recommendation engine recently added to the KillrVideo reference application using DSE Graph and Java DSL’s (Domain Specific Language).

Dec 05, 2017 • By: Luke Tillman

Luke Tillman and DuyHai Doan talk about why pre-aggregation is difficult on an eventually consistent database like Apache Cassandra, and debate whether the storage engine in Cassandra should be made pluggable.

Dec 01, 2017 • By: David Gilardi

David Gilardi talks with Kiyu Gabriel and Darla Baker about the DataStax Managed Cloud and gets some details on the advantages of using a managed service for your distributed database.

Nov 21, 2017 • By: Kathryn Erickson, Jeff Carpenter

Kat Erickson announces the availability of official Docker images of DataStax Enterprise and why you should use them.

Nov 14, 2017 • By: Jeff Carpenter

DuyHai Doan shares his advice on debugging graph traversals using the Gremlin query language, including how to identify and fix performance bottlenecks and his thoughts on the “supernode” challenge.

Nov 07, 2017 • By: Stephen Mallette, David Gilardi

David Gilardi talks with Stephen Mallette about domain specific languages for graph databases in Java, when you should use a DSL, and some of the implementation details you’ll want to know to succeed.

Oct 31, 2017 • By: Jeff Carpenter, Luke Tillman, Patrick McFadin

Patrick McFadin, Luke Tillman and Jeff Carpenter sit around the campfire telling Cassandra data modeling horror stories, dad jokes and the occasional spooky noise.

Oct 24, 2017 • By: Wei Deng, Jeff Carpenter, Luke Tillman

Wei Deng talks about the challenges involved in securing distributed databases, the latest security features in DataStax Enterprise, and recommended techniques to help you and your company stay out of the headlines.

Oct 17, 2017 • By: Jeff Carpenter

DuyHai Doan takes us inside of Apache Kudu, a data store designed to support fast access for analytics in the Hadoop ecosystem. We compare Kudu’s architecture with Apache Cassandra and discuss why effective design patterns for distributed systems show up again and again.