Go to content

Donald Miner - Hadoop with Python

"Speaker: Donald Miner In this tutorial, students will learn how to use Python with Apache Hadoop to store, process, and analyze incredibly large data sets. Hadoop has become the standard in distributed data processing, but has mostly required Java in the past. Today, there are a numerous open source projects that support Hadoop in Python and this tutorial will show students how to use them. Slides can be found at: https://speakerdeck.com/pycon2015 and https://github.com/PyCon/2015-slides"

April 8, 2015