cfoster0's Stars
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
unslothai/unsloth
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
ml-explore/mlx
MLX: An array framework for Apple silicon
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
langroid/langroid
Harness LLMs with Multi-Agent Programming
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
protectai/ai-exploits
A collection of real world AI/ML exploits for responsibly disclosed vulnerabilities
vectara/hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
EleutherAI/math-lm
VILA-Lab/ATLAS
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Mozilla-Ocho/Memory-Cache
MemoryCache is an experimental development project to turn a local desktop environment into an on-device AI agent
srush/annotated-mamba
Annotated version of the Mamba paper
hugoycj/Instant-angelo
Instant-angelo: Build high-fidelity Digital Twin within 20 Minutes!
togethercomputer/stripedhyena
Repository for StripedHyena, a state-of-the-art beyond Transformer architecture
HazyResearch/zoology
Understand and test language model architectures on synthetic tasks.
berlino/gated_linear_attention
KihoPark/linear_rep_geometry
GAIR-NLP/alignment-for-honesty
white-flame/eurisko
Doug Lenat's EURISKO from SAIL archives circa 1981
dvruette/barrel-rec-pytorch
sustcsonglin/gated_linear_attention_layer
thomasahle/json-gpt
Fast and simple library to get correct JSON output from GPT