changhsinlee's Stars
wagoodman/dive
A tool for exploring each layer in a docker image
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
guidance-ai/guidance
A guidance language for controlling large language models.
rupa/z
z - jump around
google/or-tools
Google's Operations Research tools:
pycaret/pycaret
An open-source, low-code machine learning library in Python
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
TomWright/dasel
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
kingoflolz/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
nicolashery/mac-dev-setup
A beginner's guide to setting up a development environment on macOS
pre-commit/pre-commit-hooks
Some out-of-the-box hooks for pre-commit
pythonprofilers/memory_profiler
Monitor Memory usage of Python code
piskvorky/smart_open
Utils for streaming large files (S3, HDFS, gzip, bz2...)
mljar/mljar-supervised
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
HBNetwork/python-decouple
Strict separation of config from code.
tomerfiliba/plumbum
Plumbum: Shell Combinators
modAL-python/modAL
A modular active learning framework for Python
josephburnett/jd
JSON diff and patch
linkedin/greykite
A flexible, intuitive and fast forecasting library
skrub-data/skrub
Prepping tables for machine learning
beejjorgensen/bgnet
Beej's Guide to Network Programming source
koaning/human-learn
Natural Intelligence is still a pretty good idea.
ydataai/ydata-quality
Data Quality assessment with one line of code
NVIDIA/framework-reproducibility
Providing reproducibility in deep learning frameworks
matthewwardrop/formulaic
A high-performance implementation of Wilkinson formulas for Python.
machine-learning-apps/actions-ml-cicd
A Collection of GitHub Actions That Facilitate MLOps
koaning/simsity
Super Simple Similarities Service
davidthaler/Walmart_competition_code
This repo holds the code for the 1st place entry in the Walmart 2014 sales forecasting competition hosted on Kaggle.
dominodatalab/domino-research
Projects developed by Domino's R&D team
jonathandinu/causality-inference
Code and resources from Causal Inference in Data Science LiveLessons