ScalaIO - Piotr Kolaczkowski - Lightning fast cluster computing with Cassandra and Spark
Apache Spark is a fast cluster computing engine, written mainly in Scala. Apache Cassandra is a distributed database system, written mainly in Java. The presentation will show you how we integrated them, using Scala. The connector we've built allows you to query Cassandra from complex, distributed, parallel Spark applications written in Scala. The talk will cover how the whole system works from the user-perspective, its high level architecture and some implementation details, as well as which Scala features were very useful, and which turned out to be problematic and why.
October 23, 2014