Xiaoyang-Wang's Stars
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
karpathy/llm.c
LLM training in simple, raw C/CUDA
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
triton-lang/triton
Development repository for the Triton language and compiler
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
khangich/machine-learning-interview
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
mosaicml/composer
Supercharge Your Model Training
alirezadir/Machine-Learning-Interviews
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
openai/transformer-debugger
youssefHosni/Data-Science-Interview-Questions-Answers
Curated list of data science interview questions and answers
karpathy/ng-video-lecture
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
andrewekhalel/MLQuestions
Machine Learning and Computer Vision Engineer - Technical Interview Questions
openai/weak-to-strong
davda54/sam
SAM: Sharpness-Aware Minimization (PyTorch)
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
patx/pickledb
pickleDB is an open source key-value store using Python's json module.
volcengine/veScale
A PyTorch Native LLM Training Framework
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
CarperAI/DRLX
Diffusion Reinforcement Learning Library
GFNOrg/gfn-lm-tuning
Leooyii/LCEG
Long Context Extension and Generalization in LLMs
BIT-DA/DUC
[ICLR 2023 Spotlight] Code release for "Dirichlet-based Uncertainty Calibration for Active Domain Adaptation"