dmizr

Research @ Apple

AppleCupertino, CA

dmizr's Stars

meta-llama/llama
Inference code for Llama models
Language:Python56.6k 526 1k9.6k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python37.5k 377 3186k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
30.3k 2.4k 01.7k
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.3k 286 422.3k
ml-explore/mlx
MLX: An array framework for Apple silicon
Language:C++17.5k 149 5661k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10k 99 667974
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.7k 77 563621
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.2k 66 71552
google-research/arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Language:Python5.4k 34 54330
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python4.3k 116 83316
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:Python3.7k 31 261341
google/prompt-to-prompt
Language:Jupyter Notebook3.1k 25 84296
google-research/t5x
Language:Python2.7k 36 141309
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Language:Python2.4k 21 364248
apple/axlearn
An Extensible Deep Learning Library
Language:Python1.9k 63 16269
kakaobrain/coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
Language:Python1.2k 15 1538
mlfoundations/dclm
DataComp for Language Models
Language:HTML1.2k 38 63108
xl0/lovely-tensors
Tensors, for human consumption
Language:Jupyter Notebook1.1k 10 2216
apple/ml-aim
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
Language:Python938 19 548
LTH14/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
Language:Python842 7 3940
google/seqio
Task-based datasets, preprocessing, and evaluation for sequence models.
Language:Python562 15 3158
epfLLM/Megatron-LLM
distributed trainer for LLMs
Language:Python545 18 5977
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
Language:Python529 11 2033
google-deepmind/nanodo
Language:Python198 8 110
MatX-inc/seqax
seqax = sequence modeling + JAX
Language:Python134 7 210
graphcore-research/unit-scaling
A library for unit scaling in PyTorch
Language:Jupyter Notebook105 6 117
cloneofsimo/scaling-guide
WIP
Language:Python89 9 01
cloneofsimo/min-fsdp
Language:Python73 4 24
cloneofsimo/ezmup
Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam
Language:Python68 5 33
liuxingbin/dbot
[ICLR2024] Exploring Target Representations for Masked Autoencoders
Language:Python51 4 78

dmizr

dmizr's Stars

meta-llama/llama

karpathy/nanoGPT

karpathy/LLM101n

google-research/tuning_playbook

ml-explore/mlx

salesforce/LAVIS

facebookresearch/xformers

LargeWorldModel/LWM

google-research/arxiv-latex-cleaner

FoundationVision/VAR

rom1504/img2dataset

google/prompt-to-prompt

google-research/t5x

OFA-Sys/OFA

apple/axlearn

kakaobrain/coyo-dataset

mlfoundations/dclm

xl0/lovely-tensors

apple/ml-aim

LTH14/rcg

google/seqio

epfLLM/Megatron-LLM

penghao-wu/vstar

google-deepmind/nanodo

MatX-inc/seqax

graphcore-research/unit-scaling

cloneofsimo/scaling-guide

cloneofsimo/min-fsdp

cloneofsimo/ezmup

liuxingbin/dbot