MaybeShewill-CV's Stars
xai-org/grok-1
Grok open release
hiyouga/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
meta-llama/llama3
The official Meta Llama 3 GitHub site
unslothai/unsloth
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
skyline75489/what-happens-when-zh_CN
What-happens-when 的中文翻译,原仓库 https://github.com/alex/what-happens-when
LargeWorldModel/LWM
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
mcinglis/c-style
My favorite C programming practices.
layerdiffusion/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
MzeroMiko/VMamba
VMamba: Visual State Space Models,code is based on mamba
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
NVlabs/FoundationPose
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
AILab-CVC/UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
verlab/accelerated_features
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
skypjack/meta
Header-only, non-intrusive and macro-free runtime reflection system in C++
google-deepmind/recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
taskflow/work-stealing-queue
A fast work-stealing queue template in C++
peterWon/D-LIOM
Tightly-coupled Direct LiDAR-Inertial Odometry and Mapping Based on Cartographer3D.
THUDM/Inf-DiT
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
yardenfren1996/B-LoRA
Implicit Style-Content Separation using B-LoRA
wkcn/TinyCLIP
[ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance