Go to content

Approximate Data for Small, Insightful Analytics (Ben Kornmeier, ProtectWise)

Slides: https://www.slideshare.net/DataStax/using-approximate-data-for-small-insightful-analytics-ben-kornmeier-protectwise-cassandra-summit-2016 | Running a Cassandra cluster in AWS that can store petabytes worth of data can be costly. This talk will detail the novel approach of using approximate data structures to keep costs low, yet retain insightful, and up to date query results. The talk will explore a number of real world examples from our environment to demonstrate the power of approximate data. It will cover: determining how many IP addresses are on a network, ranking IPs by traffic, and finally determining approximate min, max, and averages on values. The talk will also cover how this data is laid out in Cassandra, so that a query always returns up to date data, without burdening the compactor. About the Speaker Ben Kornmeier Engineer, ProtectWise Ben is a Staff Engineer at ProtectWise. When he is not building realtime processing pipelines, he enjoys hiking, biking, and keeping his dog out of trouble.

July 26, 2016