epa095's Stars
ggerganov/llama.cpp
LLM inference in C/C++
spotify/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Kanaries/pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
microsoft/SandDance
Visually explore, understand, and present your data.
cpq/bare-metal-programming-guide
A bare metal programming guide (ARM microcontrollers)
tslearn-team/tslearn
The machine learning toolkit for time series analysis in Python
BayesWitnesses/m2cgen
Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies
palantir/pyspark-style-guide
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
neo4j-labs/neosemantics
Graph+Semantics: Import/Export RDF from Neo4j. SHACL Validation, Model mapping and more.... If you like it, please ★ ⇧
cov-lineages/pangolin
Software package for assigning SARS-CoV-2 genome sequences to global lineages.
MrPowers/mack
Delta Lake helper methods in PySpark
databrickslabs/mosaic
An extension to the Apache Spark framework that allows easy and fast processing of very large geospatial datasets.
milesgranger/cramjam
Your go-to for easy access to a plethora of compression algorithms, all neatly bundled in one simple installation.
wisecubeai/graphster
spark-based library that helps construct and query knowledge graphs from unstructured and structured data
aplbrain/grand-cypher
Implementation of the Cypher language for searching NetworkX graphs
mullerpeter/databricks-grafana
Grafana Databricks integration allowing direct connection to Databricks to query and visualize Databricks data in Grafana.
martinju/stromstotte
souvik-databricks/dlt-with-debug
A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT run and Non-DLT interactive notebook run.
cwida/duckdb-pgq
DuckDB is an in-process SQL OLAP Database Management System
DataTreehouse/chrontext
milesgranger/flaco
(PoC) A very memory-efficient way to read data from PostgreSQL
pgdr/ph
ph — the tabular data shell tool
bkkas/elhub-python-sdk
Non official ElHub API sdk for python
milesgranger/gilknocker
Is the GIL seeing someone else? How's about repetitively calling and seeing how long it takes to answer?
equinor/gordo-controller
Kubernetes controller for the Gordo CRD
equinor/gordo-core
Gordo core library
pgdr/cristin
flikka/flikka.github.io
Webpage content for datakunst.no
equinor/gordo-helm
Gordo Helm Charts
pgdr/emacs