JeanKaddour's Stars
QuivrHQ/quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
py-why/dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
quarylabs/quary
Open-source BI for engineers
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
pretzelai/pretzelai
The modern replacement for Jupyter Notebooks
PySpur-Dev/PySpur
Graph-Based Editor for LLM Workflows
WecoAI/aideml
AIDE: the state-of-the-art machine learning engineer agent, generating machine learning solution code from natural language descriptions.
RobertTLange/evosax
Evolution Strategies in JAX 🦎
Titan-Systems/titan
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake data warehouse.
bigcode-project/bigcodebench
BigCodeBench: Benchmarking Code Generation Towards AGI
lucidrains/Adan-pytorch
Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch
VikParuchuri/libgen_to_txt
Convert all of libgen to high quality markdown
AnswerDotAI/cold-compress
Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of GPT-Fast, a simple, PyTorch-native generation codebase.
JeanKaddour/NoTrainNoGain
Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)
microsoft/csuite
CSuite: A Suite of Benchmark Datasets for Causality
luchris429/DiscoPOP
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
UmerHA/triton_util
Make triton easier
aengusl/spawrious
JeanKaddour/LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
yihong-chen/ReFactorGNN
Implementation for ReFactor GNNs
FreddyHaas/anonymizedGPT
Open-source AI chatbot app that anonymizes personal information
dongmean/AL_vs_SubsetSelection
vzantedeschi/DAGuerreotype
source code of the paper: DAG LEARNING ON THE PERMUTAHEDRON
bioml-tools/bio-datasets
Bringing bio (molecules and more) to the Hugging Face Datasets library
aengusl/manipulation-chatarena
Fork of chatarena: add examples that help to study the manipulation capabilities of LLMs
oscarkey/composite-tests
Kernel-based statistical tests to check if data is drawn from any distribution in a parametric family
oscarkey/typed-configs
Define configs using Python dataclasses and override them on the CLI