Pinned Repositories
AnomalyDetection
Anomaly Detection with R
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
confluent-spark-avro
Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.
discord.py
An API wrapper for Discord written in Python.
druid
Apache Druid: a high performance real-time analytics database.
edatemplates
R package containing Rmarkdown templates for exploratory data analyses
Face-and-Image-super-resolution
gcfboardr
A text dataset harvested from Green Climate Fund Board documents
kubeflow
Machine Learning Toolkit for Kubernetes
The-Elements-of-Statistical-Learning-Python-Notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
ljodea's Repositories
ljodea/gcfboardr
A text dataset harvested from Green Climate Fund Board documents
ljodea/AnomalyDetection
Anomaly Detection with R
ljodea/arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
ljodea/confluent-spark-avro
Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.
ljodea/discord.py
An API wrapper for Discord written in Python.
ljodea/druid
Apache Druid: a high performance real-time analytics database.
ljodea/edatemplates
R package containing Rmarkdown templates for exploratory data analyses
ljodea/Face-and-Image-super-resolution
ljodea/kubeflow
Machine Learning Toolkit for Kubernetes
ljodea/The-Elements-of-Statistical-Learning-Python-Notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
ljodea/equilid
Socially-Equitable Language Identification
ljodea/example-rmd-templates
A selection of minimal templates used to highlight R Markdown templates, as referred to in the "R Markdown Definitive Guide"
ljodea/gobook
Exercises from "The Go Programming Language" by Donovan and Kernighan (2016)
ljodea/ljodea.github.io
My website
ljodea/mediapipe
MediaPipe is a cross-platform framework for building multimodal applied machine learning pipelines
ljodea/metorikku
A simplified, lightweight ELT Framework based on Apache Spark
ljodea/neo4j-kubernetes
ljodea/pipelines
Machine Learning Pipelines for Kubeflow
ljodea/proposal
Go Project Design Documents
ljodea/PyPDF2
A utility to read and write PDFs with Python
ljodea/RNeo4j
Neo4j Driver for R.
ljodea/rocker-versioned
Run current & prior versions of R using docker
ljodea/samza
Mirror of Apache Samza
ljodea/spark-corenlp
CoreNLP wrapper for Spark
ljodea/spark-redshift
Redshift data source for Spark
ljodea/youtube8m