DataStax Developer Blog

Get the latest developer news and updates! The DataStax Developer blog is a great resource to keep up to date on the latest!

Subscribe for weekly updates!

Subscribe to RSS feed

E.g., 07/21/2019
E.g., 07/21/2019
Jul 08, 2019 • By: Brian Hess

The first three blog posts in this series dealt with data loading (herehere, and here), and the fourth blog post (here) dealt with data unloading.  This blog will deal with the count mode for dsbulk.

Something new in dsbulk version 1.1.0 is the ability to count the data in a table.  This is a common task that folks do once the load data to ensure that the data was loaded correctly.  It could be that a load failed midway and they would like to see how far it got. It could be that the primary key was not what was desired, and instead of unique inserts it became overwrites.

Jul 08, 2019 • By: Jeff Carpenter

This is the latest installment of a series about building a Python application with Apache Cassandra — specifically a Python implementation of the KillrVideo microservice tier. In previous posts I shared what motivated this project, how I started with infrastructure including GRPC and Etcd, the testing approach, and most recently, how I began implementing data access using Cassandra.

In this post we’ll look at some additional examples of data access using the DataStax Python Driver, ranging from the simple to the complex. (I’ll be making reference some driver / mapper concepts from the previous post so you may want to review that if you haven’t already.)

Jun 07, 2019 • By: Brian Hess

In the previous 3 blog posts (herehere, and here), we covered some loading examples, and covered some of the common options, such as logging and connection details.  In this blog post, we will turn our attention to unloading.

May 31, 2019 • By: Adron Hall

Some notes along with this talk. Which is about ways to mitigate super nodes, partitioning strategies, and related efforts. Jonathan’s talk is vendor neutral, even though he works at DataStax. Albeit that’s not odd to me, since that’s how we roll at DataStax anyway. We take pride in working with DSE but also with knowing the various products out there, as things are, we’re all database nerds after all. (more below video)

 

May 14, 2019 • By: Scott Hendrickson

We recently announced availability of DataStax Distribution of Apache Cassandra on Microsoft Azure aimed at helping developers take advantage of a simpler, more cost-effective way to build and scale applications. We wanted to take our experience with large scale deployments on Azure for over 5 years and deliver an environment with best practices built-in so you can focus on your business. With that in mind, let’s jump into what you need to know to build an app.

May 13, 2019 • By: Russell Spitzer

Apache Spark! Darling of the Big Data world and the easiest entry point into Machine Learning. It's fast, it's cool, and it's hip. So many great things, but should you be using it in your stack?

May 10, 2019 • By: Jeff Carpenter

Over the first few posts of this series, I’ve been sharing about my experience building a Python implementation of the KillrVideo microservice tier. In the previous posts I shared why I started this project, about building GRPC service stubs, advertising the endpoints in etcd, and setting up integration tests to exercise the service APIs.

May 08, 2019 • By: Jeff Carpenter

After 2+ years of hard work from the community, Apache Cassandra 4.0 is in the final stages of testing for official release. As the biggest gathering of Cassandra professionals since the last major release (3.0), DataStax Accelerate represents a great opportunity to get caught up on what’s been happening in the project - straight from the committers and experts implementing these changes and the power users who are pushing the limits of this highly scalable database even further.

Apr 30, 2019 • By: Brian Hess

This is the third blog post about dsbulk.  The first two blog posts (here and here) covered some basic loading examples.  This post will delve into some of the common options to load, unloading, and counting.

Apr 22, 2019 • By: Cristina Veale, Eric Zietlow

As developers, knowledge is the main thing that holds us back.  The phrase “the more you know” has never been truer. From learning new languages and technologies to better understanding the theory around them, we must expand our knowledge base in order to stay relevant in today's industry.  There are many classes we can take and blogs we can read, but nothing ever comes close to real, hands-on experience working with new technologies. Given the overwhelming demand, bootcamps and hands-on instruction are in short supply.