Zymrael's Stars
unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
bloomberg/memray
Memray is a memory profiler for Python
arcee-ai/mergekit
Tools for merging pretrained large language models.
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
apple/ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
sumerc/yappi
Yet Another Python Profiler, but this time multithreading, asyncio and gevent aware.
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
EurekaLabsAI/ngram
The n-gram Language Model
DLYuanGod/TinyGPT-V
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
lilacai/lilac
Curate better data for LLMs
evo-design/evo
Biological foundation modeling from molecular to genome scale
BAAI-DCAI/Bunny
A family of lightweight multimodal models.
stas00/the-art-of-debugging
The Art of Debugging
forhaoliu/ringattention
Transformers with Arbitrarily Large Context
lean-dojo/LeanDojo
Tool for data extraction and interacting with Lean programmatically.
BobMcDear/attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
lm-sys/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
louaaron/Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
pratyushasharma/laser
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
SHI-Labs/NATTEN
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
justinchiu/openlogprobs
Extract full next-token probabilities via language model APIs
HazyResearch/zoology
Understand and test language model architectures on synthetic tasks.
kabouzeid/turm
TUI for the Slurm Workload Manager
bremen79/parameterfree
Parameter-Free Optimizers for Pytorch
athms/mad-lab
A MAD laboratory to improve AI architecture designs 🧪
wangsiping97/FastGEMV
High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.
cloneofsimo/min-fsdp
advaitgosai/autocite
simple bibtex generator for any text with \cite{}