Zymrael

Numerics, model architecture, scaling @ Liquid AI

Stanford University

Zymrael's Stars

unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python16.1k 110 8261.1k
bloomberg/memray
Memray is a memory profiler for Python
Language:Python13.2k 59 194392
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python4.6k 50 297404
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3.2k 28 131279
apple/ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
Language:Python1.8k 30 0103
sumerc/yappi
Yet Another Python Profiler, but this time multithreading, asyncio and gevent aware.
Language:Python1.5k 15 7872
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Language:Python1.4k 13 413108
EurekaLabsAI/ngram
The n-gram Language Model
Language:C1.3k 48 092
DLYuanGod/TinyGPT-V
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Language:Python1.2k 19 3375
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda1.2k 16 105109
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python1.1k 39 75107
lilacai/lilac
Curate better data for LLMs
Language:Python937 13 29288
evo-design/evo
Biological foundation modeling from molecular to genome scale
Language:Jupyter Notebook931 18 50111
BAAI-DCAI/Bunny
A family of lightweight multimodal models.
Language:Python888 19 11466
stas00/the-art-of-debugging
The Art of Debugging
Language:C798 16 031
forhaoliu/ringattention
Transformers with Arbitrarily Large Context
Language:Python619 6 1648
lean-dojo/LeanDojo
Tool for data extraction and interacting with Lean programmatically.
Language:Python545 13 6183
BobMcDear/attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
Language:Python460 10 421
lm-sys/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
Language:Jupyter Notebook426 5 2757
louaaron/Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
Language:Python361 7 1134
pratyushasharma/laser
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Language:Python361 22 2226
SHI-Labs/NATTEN
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
Language:Cuda345 10 10926
justinchiu/openlogprobs
Extract full next-token probabilities via language model APIs
Language:Python227 3 114
HazyResearch/zoology
Understand and test language model architectures on synthetic tasks.
Language:Python157 14 1926
kabouzeid/turm
TUI for the Slurm Workload Manager
Language:Rust121 3 134
bremen79/parameterfree
Parameter-Free Optimizers for Pytorch
Language:Python105 5 14
athms/mad-lab
A MAD laboratory to improve AI architecture designs 🧪
Language:Python85 1 25
wangsiping97/FastGEMV
High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.
Language:Cuda81 6 43
cloneofsimo/min-fsdp
Language:Python68 3 24
advaitgosai/autocite
simple bibtex generator for any text with \cite{}
Language:JavaScript31 1 02

Zymrael

Zymrael's Stars

unslothai/unsloth

bloomberg/memray

arcee-ai/mergekit

dvlab-research/MGM

apple/ml-fastvit

sumerc/yappi

argilla-io/distilabel

EurekaLabsAI/ngram

DLYuanGod/TinyGPT-V

flashinfer-ai/flashinfer

huggingface/nanotron

lilacai/lilac

evo-design/evo

BAAI-DCAI/Bunny

stas00/the-art-of-debugging

forhaoliu/ringattention

lean-dojo/LeanDojo

BobMcDear/attorch

lm-sys/arena-hard-auto

louaaron/Score-Entropy-Discrete-Diffusion

pratyushasharma/laser

SHI-Labs/NATTEN

justinchiu/openlogprobs

HazyResearch/zoology

kabouzeid/turm

bremen79/parameterfree

athms/mad-lab

wangsiping97/FastGEMV

cloneofsimo/min-fsdp

advaitgosai/autocite