gityihang's Stars
changlin31/AutoProg-Zero
Efficient Training of Large Vision Models via Advanced Automated Progressive Learning
jianchang512/clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
ZHO-ZHO-ZHO/ComfyUI-InstantID
Unofficial implementation of InstantID for ComfyUI
stepfun-ai/Step-Video-T2V
xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
genmoai/mochi
The best OSS video generation models
smthemex/ComfyUI_Sonic
Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI
facebookresearch/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
black-forest-labs/flux
Official inference repo for FLUX.1 models
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
pika/pika
Pure Python RabbitMQ/AMQP 0-9-1 client library
deepseek-ai/DeepSeek-R1
lmbxmu/SuperViT
Official Pytorch implementation of Super Vision Transformer (IJCV)
csguoh/MambaIR
[ECCV2024, CVPR2025] MambaIR and MambaIRv2!
Xiaofeng-life/AwesomeDehazing
A collection of dehazing methods.
YuHengsss/VSSD
Introduce Mamba2 to Vision.
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
nachifur/MulimgViewer
MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
hejh8/CycleRDM
sihyun-yu/REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)
wusongbai139/controlnet_train_webUI
利用diffusers编写的训练controlnet模型的项目,计划集成训练各种预训练模型的controlnet模型的方案。
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
ruke1ire/RTF
A State-Space Model with Rational Transfer Function Representation.
G-U-N/PyCIL
PyCIL: A Python Toolbox for Class-Incremental Learning
MetaInsight7/Monitor-DeWatermark
Eliminating sensitive information from monitoring data
MetaInsight7/guet-web
桂电监测校园网脚本,掉线自动重连
MetaInsight7/MaskFaceTool
This project aims to add masks to the facial dataset, which is based on FMA-3D and constructs a effective, easy to operate, and efficient pipeline for facial detection, alignment, and mask wearing.
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
state-spaces/mamba
Mamba SSM architecture