sudo-Boris's Stars
faif/python-patterns
A collection of design patterns/idioms in Python
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
reflex-dev/reflex
🕸️ Web apps in pure Python 🐍
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
facebookresearch/detr
End-to-End Object Detection with Transformers
state-spaces/mamba
Mamba SSM architecture
mmistakes/minimal-mistakes
:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.
huggingface/trl
Train transformer language models with reinforcement learning.
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
humanloop/awesome-chatgpt
Curated list of awesome tools, demos, docs for ChatGPT and GPT-3
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
johnma2006/mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
arifszn/gitprofile
🚀 Create and deploy a dynamic portfolio by just providing your GitHub username.
glotlabs/gdrive
Google Drive CLI Client
PKU-YuanGroup/LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
sangminwoo/awesome-vision-and-language
A curated list of awesome vision and language resources (still under construction... stay tuned!)
composable-models/llm_multiagent_debate
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
kvfrans/jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
sming256/OpenTAD
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
Yui010206/SeViLA
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
gauss5930/LLM-Agora
LLM Agora, debating between open-source LLMs to refine the answers
StanfordVL/atp-video-language
Official repo for CVPR 2022 (Oral) paper: Revisiting the "Video" in Video-Language Understanding. Contains code for the Atemporal Probe (ATP).
KyawHtetWin/chip-huyen-ml-interview-book-solutions
Solutions provided to Chip Huyen's Machine Learning Interview Book with GPT
showlab/mist
timherzig/decomposition_learning
HTCV Project