liujingqwq's Stars
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
OpenBMB/MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
NVlabs/VILA
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
315386775/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
KwaiVGI/LivePortrait
Bring portraits to life!
mli/paper-reading
深度学习经典、新论文逐段精读
IMJONEZZ/NLP
A vast compendium of Natural Language Processing (NLP) knowledge from both a linguistics and a computer science perspective
cosmicman-cvpr2024/CosmicMan
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
ShiqiYu/OpenGait
A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
open-mmlab/mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
tianweiy/DMD2
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
facebook/react
The library for web and native user interfaces.
kohya-ss/sd-scripts
Akegarasu/lora-scripts
SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
dylran/crowddiff
VisDrone/VisDrone-Dataset
The dataset for drone based detection and tracking is released, including both image/video, and annotations.
sha2nkt/ipman-r
This is a code repository for training the IPMAN-R model and evaluating performance
pixelite1201/BEDLAM
PerceivingSystems/bedlam_render
BEDLAM (CVPR 2023) render pipeline tools
muelea/shapy
CVPR 2022 - Official code repository for the paper: Accurate 3D Body Shape Regression using Metric and Semantic Attributes.
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
xuxy09/SMPLer
"SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation", TPAMI 2024
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
liufeng2915/3DInvarReID
Learning Clothing and Pose Invariant 3D Shape Representation for Long-Term Person Re-Identification (ICCV 2023)