SnowPye's Stars
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
LeapLabTHU/EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
wzk1015/Arxiv-Assistant
Automatically fetch daily arxiv papers, filter with GPT, and send you an email.
NEU-DataMining/DailyPaper
By crawling the latest papers on arXiv with specified keywords using a web crawler, and then summarizing the content of the papers using chatgpt, we can compile and update the information.通过爬虫每日抓取arXiv上指定关键词的最新论文,然后使用chatgpt总结论文内容,汇总更新。
SwinTransformer/Swin-Transformer-Semantic-Segmentation
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.
SwinTransformer/Swin-Transformer-Object-Detection
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
kailashahirwar/cheatsheets-ai
Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
ziqi-jin/finetune-anything
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
autodistill/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
openai/transformer-debugger
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
SwinTransformer/Feature-Distillation
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
yformer/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
MaybeShewill-CV/segment-anything-u-specify
using clip and sam to segment any instance you specify with text prompt of any instance names
JonathonLuiten/TrackEval
HOTA (and other) evaluation metrics for Multi-Object Tracking (MOT).
NirAharon/BoT-SORT
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
mit-han-lab/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
Curated-Awesome-Lists/Awesome-Open-AI-Sora
Sora AI Awesome List – Your go-to resource hub for all things Sora AI, OpenAI's groundbreaking model for crafting realistic scenes from text. Explore a curated collection of articles, videos, podcasts, and news about Sora's capabilities, advancements, and more.
xiaolincoder/CS-Base
图解计算机网络、操作系统、计算机组成、数据库,共 1000 张图 + 50 万字,破除晦涩难懂的计算机基础知识,让天下没有难懂的八股文!🚀 在线阅读:https://xiaolincoding.com
IDEA-Research/DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
AILab-CVC/M2PT
[CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
qianqianwang68/omnimotion
csguide-dabai/Programmer-look-at-China
介绍**各二线以上城市的互联网环境以及生活成本