/skience

Examples of Kotlin for Data Science usage

Primary LanguageJupyter NotebookMIT LicenseMIT

sKience

sKience is a demonstration project illustrating the utilization of a modern technology stack to facilitate the practice of data science with Kotlin effortlessly.

This repository contains the notebooks I used for my talk (in 🇫🇷) about Data science in Kotlin at Devoxx France 2024

Introduction

introduction notebook contains basic kotlin notebook features such as input/output and dependencies management.

Dataframe

Dataframe notebook show some features of both Dataframe and Kandy library to manipulate data

DuckDB

DuckDB is an analytic database that can be easily use with kotlin Dataframe in a kotlin notebook to act as interface of many data.

DuckDB and dataframe diagram

Duckdb Dataftame notebook show advanced usages of kotlin dataframe with duckDB.

To run this notebook you must run import data notebook. It will create a duckDB database file that contains New York yellow taxi trip for 5 years which represents about 10GB and 400 millions database rows

Docker

To run notebooks with jupyter in docker use docker compose up command from the base directory.