gityihang

gityihang's Stars

changlin31/AutoProg-Zero
Efficient Training of Large Vision Models via Advanced Automated Progressive Learning
Language:Python1
jianchang512/clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频
Language:Python8.1k840
ZHO-ZHO-ZHO/ComfyUI-InstantID
Unofficial implementation of InstantID for ComfyUI
Language:Python1.4k80
stepfun-ai/Step-Video-T2V
Language:Python2.5k206
xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Language:Python1.4k114
genmoai/mochi
The best OSS video generation models
Language:Python3k311
smthemex/ComfyUI_Sonic
Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI
Language:Python61752
facebookresearch/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Language:Python2.7k409
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python20.5k1.4k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python10.8k1k
pika/pika
Pure Python RabbitMQ/AMQP 0-9-1 client library
Language:Python3.7k847
deepseek-ai/DeepSeek-R1
84.2k10.9k
lmbxmu/SuperViT
Official Pytorch implementation of Super Vision Transformer (IJCV)
Language:Python438
csguoh/MambaIR
[ECCV2024, CVPR2025] MambaIR and MambaIRv2!
Language:Python60251
Xiaofeng-life/AwesomeDehazing
A collection of dehazing methods.
12511
YuHengsss/VSSD
Introduce Mamba2 to Vision.
Language:Python1177
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Language:Python3.2k219
nachifur/MulimgViewer
MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.
Language:Python1.2k107
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。
Language:Python27k2.1k
hejh8/CycleRDM
Language:Python71
sihyun-yu/REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)
Language:Python84640
wusongbai139/controlnet_train_webUI
利用diffusers编写的训练controlnet模型的项目，计划集成训练各种预训练模型的controlnet模型的方案。
Language:Python241
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python9.4k1k
ruke1ire/RTF
A State-Space Model with Rational Transfer Function Representation.
Language:Assembly773
G-U-N/PyCIL
PyCIL: A Python Toolbox for Class-Incremental Learning
Language:Python884138
MetaInsight7/Monitor-DeWatermark
Eliminating sensitive information from monitoring data
Language:Python104
MetaInsight7/guet-web
桂电监测校园网脚本，掉线自动重连
Language:Python2
MetaInsight7/MaskFaceTool
This project aims to add masks to the facial dataset, which is based on FMA-3D and constructs a effective, easy to operate, and efficient pipeline for facial detection, alignment, and mask wearing.
Language:Python181
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Language:Python11.7k2k
state-spaces/mamba
Mamba SSM architecture
Language:Python14.1k1.2k