yassouali

CS PhD student in ML

CentraleSupelecFrance

yassouali's Stars

comfyanonymous/ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
Language:Python36.5k 294 2.3k3.9k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda20.2k 199 1082.1k
HigherOrderCO/Bend
A massively parallel, high-level programming language
Language:Rust15.6k 88 147387
roboflow/supervision
We write your reusable computer vision tools. 💜
Language:Python14.8k 116 3401.2k
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.
Language:Python11.3k 85 2481.1k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
Language:Jupyter Notebook10k 82 2821.4k
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7k 86 102691
OpenAccess-AI-Collective/axolotl
Go ahead and axolotl questions
Language:Python6.4k 48 577714
adam-maj/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
Language:SystemVerilog6.3k 56 19450
intel-analytics/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, Axolotl, etc.
Language:Python6.1k 244 2.3k1.2k
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python5.4k 35 8601.4k
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++4.7k 107 885815
openai/transformer-debugger
Language:Python3.9k 25 13231
pytorch/torchtune
A Native-PyTorch Library for LLM Fine-tuning
Language:Python3.3k 37 309248
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
Language:Python3.1k 35 326225
albertan017/LLM4Decompile
Reverse Engineering: Decompiling Binary Code with Large Language Models
Language:Python2.6k 29 16194
sgl-project/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
Language:Python2.6k 29 243160
cohere-ai/cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
Language:TypeScript2.3k 25 20241
sarah-ek/faer-rs
Linear algebra foundation for the Rust programming language
Language:Rust1.7k 19 6554
Modos-Labs/Glider
Open-source E-ink monitor. Mirror of https://gitlab.com/zephray/glider
Language:C1.5k 21 230
Hirrolot/datatype99
Algebraic data types for C99
Language:C1.3k 14 1424
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.3k 11 307143
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Language:Cuda1.2k 25 1137
srush/Triton-Puzzles
Puzzles for learning Triton
Language:Jupyter Notebook731 6 742
MDK8888/GPTFast
Accelerate your Hugging Face Transformers 6-8.5x. Native to Hugging Face and PyTorch.
Language:Python642 6 1062
tspeterkim/flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
Language:Cuda449 4 331
IST-DASLab/marlin
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
Language:Python384 13 2029
pytorch/ao
Native PyTorch library for quantization and sparsity
Language:Python296 18 4241
HazyResearch/aisys-building-blocks
Building blocks for foundation models.
257 27 010
razetime/ngn-k-tutorial
An ngn/k tutorial.
Language:C193 9 1219

yassouali

yassouali's Stars

comfyanonymous/ComfyUI

karpathy/llm.c

HigherOrderCO/Bend

roboflow/supervision

princeton-nlp/SWE-agent

meta-llama/llama-recipes

jasonppy/VoiceCraft

OpenAccess-AI-Collective/axolotl

adam-maj/tiny-gpu

intel-analytics/ipex-llm

EleutherAI/lm-evaluation-harness

NVIDIA/cutlass

openai/transformer-debugger

pytorch/torchtune

turboderp/exllamav2

albertan017/LLM4Decompile

sgl-project/sglang

cohere-ai/cohere-toolkit

sarah-ek/faer-rs

Modos-Labs/Glider

Hirrolot/datatype99

casper-hansen/AutoAWQ

HazyResearch/ThunderKittens

srush/Triton-Puzzles

MDK8888/GPTFast

tspeterkim/flash-attention-minimal

IST-DASLab/marlin

pytorch/ao

HazyResearch/aisys-building-blocks

razetime/ngn-k-tutorial