Distributed Data Show Episode 80: Finding Bad Actors with Max Melnick | Apache Cassandra and DataStax Enterprise - DataStax Academy
Skip to main content
  • Learn
    • Learning Paths
    • Online Courses
    • Short Courses
    • Browse by Topic
    • Classroom Training
    • Public Training
    • Certifications
  • Distributed Data Show
  • Success Segments
  • Developer Blog
  • Meet The Contributors
  • Community
  • DATASTAX.COM HOME
  • ACADEMY HOME
  • DOCS

Download Datastax

Login or Sign Up
Home » Distributed Data Show » Distributed Data Show Episode 80: Finding Bad Actors with Max Melnick

Distributed Data Show Episode 80: Finding Bad Actors with Max Melnick

Jan 08, 2019
Jeff Carpenter
Teaser: 

In this episode Jeff talks with Max Melnick about how he got into analytics consulting with Deloitte (no, he's not an accountant), and how the Mission Graph capability Deloitte has built on top of DataStax Enterprise helps analysts leverage complex networks to detect financial fraud, terrorism, and even supply chain vulnerabilities.

Topics: 
DataStax Enterprise
Apache Cassandra

Highlights

0:15 -You told us you want to hear about use cases, so here we go!

0:58 - Getting to know Max - Virginia native, TensorFlow committer, Analytics at Deloitte

2:35 - The problem space for Mission Graph - identifying high-risk actors in networks aka "bad guys" to help analysts be more effective

3:53 - Mission Graph - solving not only the data problem of high volume, but the operational problems of connecting/correlating across data sets

5:35 - Mission Graph is a self-service platform for interactive network exploration and analysis. With current systems, most analyst time is spent on data engineering tasks. Mission Graph is trying to fix this.

7:58 - The technology stack includes Cassandra, DataStax Search, DataStax Graph. Lessons learned include: 1) avoiding network-based storage and doing regular health checks.

9:48 - 2) When onboarding new team members, give them initial tasks that correspond to their skill level.

11:17 - 3) When using Graph, use the DSE GraphFrames API for large volume data loading. Use Spark GraphFrames to take advantage of algorithms page rank, motif finding, etc.

13:28 - Wrapping up

Contact

DataStax Enterprise is powered by the best distribution of Apache Cassandra™.

©2019 DataStax, All rights reserved. DataStax, Titan, and TitanDB are registered trademark of DataStax, Inc. and its subsidiaries in the United States and/or other countries.

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.

DataStax.com     Docs     Privacy Policy     Terms of Use

text
CREATE NEW ACCOUNT