Realtime Data Pipeline w Spark & Cassandra + Mesos (Rahul Kumar, Sigmoid)
Slides: https://www.slideshare.net/DataStax/realtime-data-pipeline-with-spark-streaming-and-cassandra-with-mesos-rahul-kumar-sigmoid-c-summit-2016 | Developing an end-to-end big data application right from data ingestion, data enrichment, and visualisation is a very cumbersome task. In this talk, I will demonstrate how to use Apache Mesos, Cassandra, Apache Spark and Docker to build a scalable, fault tolerant, responsive data platform. This talk is a collection of different recipe's that will help the developer to understand Mesos ecosystem projects and Apache Spark.Choosing the right technologies and tools during the development phase has a major impact on the success of the whole project. Apache Mesos provides the best cluster management system, Marathon gives the feature for long-running applications and Cassandra provides fully fault tolerance distributed data storage solution . About the Speaker Rahul Kumar Technical Lead, Sigmoid Rahul Kumar working as a Technical Lead with Sigmoid, He developed various real-time data analytics applications using Apache Hadoop, Mesos ecosystem projects, Akka and Apache Spark.