Before Getting Started

To make the most of this course please download the recommended assets.

Assets

Training materials:
DS320 Virtual Machine Download (Includes Exercises)
DS320 Course Slides
Log in to Download

About this Course

Course duration: 12 hours

In this course, you will learn how to effectively and efficiently solve analytical problems with Apache Spark™, Apache Cassandra™, and DataStax Enterprise. You will learn about Spark API, Spark-Cassandra Connector, Spark SQL, Spark Streaming, and crucial performance optimization techniques.

Search
t
Introduction: Data Analytics
Introduction: Spark Architecture Log in to View
Introduction: Spark Shell Log in to View
Introduction: Web UI Log in to View
Essentials: Hello World Log in to View
Essentials: Resilient Distributed Datasets Log in to View
Essentials: Three Ways to Create an RDD Log in to View
Essentials: RDD Transformations Log in to View
Essentials: RDD Actions Log in to View
Connecting Spark: Reading Data From Cassandra Log in to View
Connecting Spark: Processing Cassandra Data Log in to View
Connecting Spark: Converting Cassandra Data Log in to View
Connecting Spark: Saving Data Back to Cassandra Log in to View
Optimization: Broadcast Variables Log in to View
Optimization: Accumulator Variables Log in to View
Optimization: RDD Persistence Log in to View
Key/Value Pairs: Introduction to Pair RDDs Log in to View
Key/Value Pairs: Aggregation Log in to View
Key/Value Pairs: Grouping and Sorting Log in to View
Key/Value Pairs: Joins Log in to View
Key/Value Pairs: Set Operations Log in to View
Tuning Partitioning: Understanding Partitioning Log in to View
Tuning Partitioning: Partitioning Rules Log in to View
Tuning Partitioning: Controlling Partitioning Log in to View
Tuning Partitioning: Data Shuffling Log in to View
Spark/Cassandra Connector: Count Log in to View
Spark/Cassandra Connector: Group By Key Log in to View
Spark/Cassandra Connector: Joining Tables Log in to View
Spark/Cassandra Connector: Cassandra-Aware Partitioning Log in to View
Spark Streaming: Discretized Stream Log in to View
Spark Streaming: Architecture Log in to View
Spark Streaming: First App Log in to View
Spark Streaming: Stateless Transformations Log in to View
Spark Streaming: Stateful Transformations Log in to View
Spark Streaming: Window Transformation Log in to View
Spark Streaming: Output Operations Log in to View
Spark Streaming: Checkpointing and Recovery Log in to View
Spark Streaming: Persistence Log in to View
Spark Streaming: Controlling Parallelism Log in to View
Spark SQL: Spark SQL Basics Log in to View
Spark SQL: Creating DataFrames Log in to View
Spark SQL: Accessing DataFrame Schema and Rows Log in to View
Spark SQL: RDD Operations on DataFrames Log in to View
Spark SQL: Language-Integrated Queries Log in to View
Spark SQL: Saving DataFrames to Cassandra Log in to View
Spark SQL: Querying Cassandra with SQL Log in to View
Spark SQL: Writing Efficient SQL Queries Log in to View