zfxxfeng's Stars
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
iperov/DeepFaceLive
Real-time face swap for PC streaming or video calls
meituan/YOLOv6
YOLOv6: a single-stage object detection framework dedicated to industrial applications.
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Deci-AI/super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
WongKinYiu/yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
CompVis/stable-diffusion
A latent text-to-image diffusion model
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
google/magritte
Mediapipe-based library to redact faces from videos and images
DingXiaoH/RepVGG
RepVGG: Making VGG-style ConvNets Great Again
PaddlePaddle/FastDeploy
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
leftthomas/CGD
A PyTorch implementation of CGD based on the paper "Combination of Multiple Global Descriptors for Image Retrieval"
MicPie/clasp
CLASP - Contrastive Language-Aminoacid Sequence Pretraining
pytorch/vision
Datasets, Transforms and Models specific to Computer Vision
Alibaba-MIIL/ML_Decoder
Official PyTorch implementation of "ML-Decoder: Scalable and Versatile Classification Head" (2021)
facebookresearch/swav
PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882
wvangansbeke/Unsupervised-Classification
SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]
facebookresearch/moco
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
52CV/CV-Surveys
计算机视觉相关综述。包括目标检测、跟踪........
milvus-io/bootcamp
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
PaddlePaddle/VIMER
视觉预训练基础模型仓库
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
EBazarov/nsfw_data_source_urls
Collection of NSFW images URLs for the purposes of training an NSFW Image Classifier