liujingqwq

liujingqwq's Stars

facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook13.7k1.4k
OpenBMB/MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Language:Python17.1k1.2k
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.7k223
NVlabs/VILA
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Language:Python2.8k224
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML13.1k1.5k
315386775/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓，同时包含工作和科研过程中的新想法、新问题、新资源与新项目
1.9k186
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python13.7k1.5k
mli/paper-reading
深度学习经典、新论文逐段精读
27.9k2.5k
IMJONEZZ/NLP
A vast compendium of Natural Language Processing (NLP) knowledge from both a linguistics and a computer science perspective
Language:Jupyter Notebook83
cosmicman-cvpr2024/CosmicMan
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
Language:Python3248
ShiqiYu/OpenGait
A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.
Language:Python782176
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Language:Python13k3.1k
open-mmlab/mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
Language:Python6k1.3k
tianweiy/DMD2
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
Language:Python62234
facebook/react
The library for web and native user interfaces.
Language:JavaScript232k47.4k
kohya-ss/sd-scripts
Language:Python5.6k912
Akegarasu/lora-scripts
SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
Language:Python4.8k595
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook12.2k1.6k
dylran/crowddiff
Language:Python10813
VisDrone/VisDrone-Dataset
The dataset for drone based detection and tracking is released, including both image/video, and annotations.
1.4k165
sha2nkt/ipman-r
This is a code repository for training the IPMAN-R model and evaluating performance
Language:Python796
pixelite1201/BEDLAM
Language:Python23121
PerceivingSystems/bedlam_render
BEDLAM (CVPR 2023) render pipeline tools
Language:Python1408
muelea/shapy
CVPR 2022 - Official code repository for the paper: Accurate 3D Body Shape Regression using Metric and Semantic Attributes.
Language:Python32847
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.5k4.6k
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python7.1k579
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
Language:Jupyter Notebook5.6k444
xuxy09/SMPLer
"SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation", TPAMI 2024
Language:Python766
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python11.2k2.4k
liufeng2915/3DInvarReID
Learning Clothing and Pose Invariant 3D Shape Representation for Long-Term Person Re-Identification (ICCV 2023)
Language:Python20