pikerbright

pikerbright's Stars

labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python56.1k 458 1325.8k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python37.3k 375 3185.9k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python26.1k 202 4.2k5.4k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.2k 157 1.5k2.2k
brendangregg/FlameGraph
Stack trace visualizer
Language:Perl17.4k 482 1502k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.6k 103 576883
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python11.1k 128 230807
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook9.2k 95 402818
conan-io/conan
Conan - The open-source C and C++ package manager
Language:Python8.3k 133 10.7k981
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python6k 68 270520
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6k 52 605462
facebookresearch/sapiens
High-resolution models for human tasks.
Language:Python4.5k 44 147247
JingyunLiang/SwinIR
SwinIR: Image Restoration Using Swin Transformer (official repository)
Language:Python4.4k 52 150548
jarro2783/cxxopts
Lightweight C++ command line option parser
Language:C++4.2k 58 278592
karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
Language:Python3.6k 36 20497
huggingface/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Language:Python2.6k 58 753467
megvii-research/NAFNet
The state-of-the-art image restoration model without nonlinear activation functions.
Language:Python2.2k 21 145276
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python1.8k 22 144131
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Language:Python1.8k 54 132158
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
1.5k 46 476
cacay/MemoryPool
An easy to use and efficient memory pool allocator written in C++.
Language:C++1.2k 65 10410
microsoft/FaceSynthetics
822 30 060
kijai/ComfyUI-Florence2
Inference Microsoft Florence2 VLM
Language:Python741 5 8250
PeizeSun/TransTrack
Multiple Object Tracking with Transformer
Language:Python630 21 81109
foivospar/Arc2Face
[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces
Language:Python597 17 3043
NVlabs/EAGLE
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Language:Python537 32 1944
Yuliang-Liu/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
Language:Python470 14 2830
AssafSinger94/dino-tracker
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”
Language:Python416 12 3440
naver/multi-hmr
Pytorch demo code and models for Multi-HMR
Language:Python209 9 4320
KupynOrest/head_detector
Official repo for VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads.
Language:Python147 8 126