tyleryzhu's Stars
EleutherAI/nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
MadcowD/ell
A language model programming library.
kaiyuyue/torchshard
Slicing a PyTorch Tensor Into Parallel Shards
kaiyuyue/nxtp
Object Recognition as Next Token Prediction (CVPR 2024)
esfrankel/torchtune
A Native-PyTorch Library for LLM Fine-tuning
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
immich-app/immich
High performance self-hosted photo and video management solution.
shashankvkt/DoRA_ICLR24
This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video""
AILab-CVC/SEED-Bench
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
OpenGVLab/InternVideo2
young-geng/scalax
A simple library for scaling up JAX programs
keyvanakbary/learning-notes
Notes on books I read, talks I watch, articles I study, and papers I love
AnswerDotAI/fsdp_qlora
Training LLMs with QLoRA + FSDP
AI-Hypercomputer/maxtext
A simple, performant and scalable Jax LLM!
vikhyat/moondream
tiny vision language model
bair-climate-initiative/xT
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
TRI-ML/prismatic-vlms
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
ruipeterpan/cos598d_sp24
LargeWorldModel/LWM
esfrankel/weak-to-strong
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
TRI-ML/vlm-evaluation
VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
a6o/Slurm-Viewer
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
jonbarron/tabilize
Simple code for generating a color-coded latex table from raw data
mosaicml/composer
Supercharge Your Model Training