SnowPye

PhD, CASIA

CASIABeijing, China

SnowPye's Stars

fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python6.4k784
LeapLabTHU/EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
Language:Python1888
wzk1015/Arxiv-Assistant
Automatically fetch daily arxiv papers, filter with GPT, and send you an email.
Language:Python21
NEU-DataMining/DailyPaper
By crawling the latest papers on arXiv with specified keywords using a web crawler, and then summarizing the content of the papers using chatgpt, we can compile and update the information.通过爬虫每日抓取arXiv上指定关键词的最新论文，然后使用chatgpt总结论文内容，汇总更新。
Language:Python176
SwinTransformer/Swin-Transformer-Semantic-Segmentation
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.
Language:Python1.1k223
SwinTransformer/Swin-Transformer-Object-Detection
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
Language:Python1.8k374
kailashahirwar/cheatsheets-ai
Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
15k3.4k
ziqi-jin/finetune-anything
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
Language:Python69351
autodistill/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
Language:Python1.7k134
openai/transformer-debugger
Language:Python4k232
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python30.7k4.6k
SwinTransformer/Feature-Distillation
Language:Python22911
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python18.8k2.9k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python10.9k973
yformer/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Language:Jupyter Notebook2k143
MaybeShewill-CV/segment-anything-u-specify
using clip and sam to segment any instance you specify with text prompt of any instance names
Language:Python1569
JonathonLuiten/TrackEval
HOTA (and other) evaluation metrics for Multi-Object Tracking (MOT).
Language:Python913225
NirAharon/BoT-SORT
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
Language:Jupyter Notebook848418
mit-han-lab/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
Language:Python1.6k143
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
Language:Python7.1k669
Curated-Awesome-Lists/Awesome-Open-AI-Sora
Sora AI Awesome List – Your go-to resource hub for all things Sora AI, OpenAI's groundbreaking model for crafting realistic scenes from text. Explore a curated collection of articles, videos, podcasts, and news about Sora's capabilities, advancements, and more.
20414
xiaolincoder/CS-Base
图解计算机网络、操作系统、计算机组成、数据库，共 1000 张图 + 50 万字，破除晦涩难懂的计算机基础知识，让天下没有难懂的八股文！🚀 在线阅读：https://xiaolincoding.com
13.1k1.7k
IDEA-Research/DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
Language:Python2.1k228
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python5.6k592
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook14.1k1.3k
AILab-CVC/M2PT
[CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
Language:Python834
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
Language:Python3.1k349
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python136k25.9k
qianqianwang68/omnimotion
Language:Python2.1k121
csguide-dabai/Programmer-look-at-China
介绍**各二线以上城市的互联网环境以及生活成本
Language:Shell3.1k250

SnowPye

SnowPye's Stars

fudan-generative-vision/hallo

LeapLabTHU/EfficientTrain

wzk1015/Arxiv-Assistant

NEU-DataMining/DailyPaper

SwinTransformer/Swin-Transformer-Semantic-Segmentation

SwinTransformer/Swin-Transformer-Object-Detection

kailashahirwar/cheatsheets-ai

ziqi-jin/finetune-anything

autodistill/autodistill

openai/transformer-debugger

huggingface/pytorch-image-models

SwinTransformer/Feature-Distillation

lucidrains/vit-pytorch

PKU-YuanGroup/Open-Sora-Plan

yformer/EfficientSAM

MaybeShewill-CV/segment-anything-u-specify

JonathonLuiten/TrackEval

NirAharon/BoT-SORT

mit-han-lab/efficientvit

CASIA-IVA-Lab/FastSAM

Curated-Awesome-Lists/Awesome-Open-AI-Sora

xiaolincoder/CS-Base

IDEA-Research/DINO

IDEA-Research/GroundingDINO

IDEA-Research/Grounded-Segment-Anything

AILab-CVC/M2PT

CVHub520/X-AnyLabeling

AUTOMATIC1111/stable-diffusion-webui

qianqianwang68/omnimotion

csguide-dabai/Programmer-look-at-China