Hryxyhe's Stars
CompVis/stable-diffusion
A latent text-to-image diffusion model
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Stability-AI/generative-models
Generative Models by Stability AI
junyanz/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
fonttools/fonttools
A library to manipulate font files from Python.
gjy3035/Awesome-Crowd-Counting
Awesome Crowd Counting
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
luo3300612/Visualizer
assistant tools for attention visualization in deep learning
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
leeyeehoo/CSRNet-pytorch
CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
ShihaoZhaoZSH/Uni-ControlNet
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
snap-research/Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
genforce/freecontrol
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
LMD0311/PointMamba
[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis
clovaai/fewshot-font-generation
The unified repository for few-shot font generation methods. This repository includes FUNIT (ICCV'19), DM-Font (ECCV'20), LF-Font (AAAI'21) and MX-Font (ICCV'21).
LMD0311/DAPT
[CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
pprp/Awesome-LLM-Prune
Awesome list for LLM pruning.
wooyeolBaek/attention-map
🚀 Cross attention map tools for huggingface/diffusers
LiheYoung/FreeMask
[NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models
zkawfanx/Atlantis
Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion (CVPR2024, Highlight)
dk-liang/CrowdCLIP
[CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
Jyouhou/Case-Sensitive-Scene-Text-Recognition-Datasets
This dataset contains re-annotations of 4 popular Latin/English scene text recognition datasets.
Yuliang-Liu/Open-Oracle
AI-assisted Deciphering Oracle Bone Script
DYZhang09/ToC3D
[ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Pengjie-W/HUST-OBC
Oracle Bone Script data collected by VLRLab of HUST
Hryxyhe/MFH
This project is for MFH:Marrying Frequency Domain with Handwritten Mathematical Expression Recognition. We implement our method based on CoMER.