KotlinConf 2018 - Building Data Science Workflows with Kotlin by Holger Brandl
Recording brought to you by American Express https://americanexpress.io/kotlin-jobs Kotlin's language design and its great tooling provide a wonderful framework for data science. Still evolving are libraries for convenient and kotlin-esque table manipulation and reporting. In this session I would like to present the design and features of krangl, which is a {K}otlin DSL for data w{rangl}ing. By mimicking well established concepts from pandas and R, it implements a grammar of data manipulation using a modern functional-style API. It allows to filter, transform, aggregate and reshape tabular data. Clearly static types are preferable when using Kotlin, but very often data is fluent and has no immediate type. To bridge this gap, krangl provides means to toggle between typed and untyped data. As an example, we will discuss how to compete at kaggle with workflows written in Kotlin. To facilitate that, we will use Jupyter to convert Kotlin scripts into HTML/notebooks. About the Presenter: Holger Brandl works as a data scientist at the Max Planck Institute of Molecular Cell Biology and Genetics (Dresden, Germany). He holds a Ph.D. degree in machine learning, and has developed new concepts in the field of computational linguistics. More recently he has co-authored publications in high-ranking journals such as Nature and Science. He is actively contributing to the Kotlin community by developing tools and libraries for bioinformatics, high-performance computing and data-science.