Johannes Moser - GitHub Archive - Processing and analyzing the billions of events generated by the w
The GitHub Archive project records and archives all the activity on the public GitHub timeline. At Crate.IO we thought this would be an ideal data set to work with in our demos and presentations. Of course, importing such a large and varied set of data wasn't likely to be easy. This presentation will cover: - Strategies for importing and storing big data sets - Strategies for querying big data sets effectively - Basic Data Visualization And of course, we'll spend some time playing with GitHub data to 'finally' solve the arguments about the most popular programming languages and frameworks...
February 29, 2016