tpulkit's Stars
langchain-ai/opengpts
Paperspace/ml-in-a-box
Machine learning tool-set for Paperspace VMs
npiv/chatblade
A CLI Swiss Army Knife for ChatGPT
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
noveens/distill_cf
[ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.
noveens/infinite_ae_cf
[ NeurIPS '22 ] ∞-AE model's implementation in JAX. Kernel-only method outperforms complicated SoTA models with a closed-form solution and a single hyper-parameter.
pytest-visual/pytest-visual
A visual testing framework for ML with automated change detection
instill-ai/instill-core
🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
btaba/sinkhorn_knopp
python implementation of Sinkhorn-Knopp
google/snappy
A fast compressor/decompressor
JarekDuda/AsymmetricNumeralSystemsToolkit
Testing various methods for choosing tANS entropy coding automata
rygorous/ryg_rans
Simple rANS encoder/decoder (arithmetic coding-ish entropy coder).
NVIDIA-Merlin/Transformers4Rec
Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.
NVIDIA/cuCollections
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
rapidsai/cuml
cuML - RAPIDS Machine Learning Library
omron-sinicx/neural-astar
Official implementation of "Path Planning using Neural A* Search" (ICML-21)
NVIDIA/nccl
Optimized primitives for collective multi-GPU communication
karpathy/llama2.c
Inference Llama 2 in one file of pure C
trholding/llama2.c
Llama 2 Everywhere (L2E)
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
dmlc/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
HaohanWang/HFC
Implementation for the paper (CVPR Oral): High Frequency Component Helps Explain the Generalization of Convolutional Neural Networks
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
OpenNMT/OpenNMT-py
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
zmtomorrow/NeLLoC