Hryxyhe

Hryxyhe's Stars

CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook68.6k 559 71610.2k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python26.5k 216 4.3k5.5k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.8k 258 3112.7k
junyanz/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
Language:Python23.2k 348 1.5k6.3k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python14.5k 109 1.1k1.2k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python9.6k 124 434904
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.2k 54 639481
fonttools/fonttools
A library to manipulate font files from Python.
Language:Python4.4k 118 1.7k455
gjy3035/Awesome-Crowd-Counting
Awesome Crowd Counting
2.4k 104 54479
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.4k 26 7570
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.4k 20 6856
luo3300612/Visualizer
assistant tools for attention visualization in deep learning
Language:Jupyter Notebook1k 3 2582
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
Language:Python810 19 6266
leeyeehoo/CSRNet-pytorch
CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
Language:Jupyter Notebook657 26 98259
ShihaoZhaoZSH/Uni-ControlNet
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Language:Python613 13 2642
snap-research/Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Language:Python531 11 4619
genforce/freecontrol
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
Language:Python444 26 1614
LMD0311/PointMamba
[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis
Language:Python369 6 3624
clovaai/fewshot-font-generation
The unified repository for few-shot font generation methods. This repository includes FUNIT (ICCV'19), DM-Font (ECCV'20), LF-Font (AAAI'21) and MX-Font (ICCV'21).
Language:Python211 8 1836
LMD0311/DAPT
[CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
Language:Python178 2 66
pprp/Awesome-LLM-Prune
Awesome list for LLM pruning.
172 8 18
wooyeolBaek/attention-map
🚀 Cross attention map tools for huggingface/diffusers
Language:Python164 3 99
LiheYoung/FreeMask
[NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models
Language:Python129 2 81
zkawfanx/Atlantis
Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion (CVPR2024, Highlight)
Language:Python78 8 148
dk-liang/CrowdCLIP
[CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
Language:Jupyter Notebook75 6 247
Jyouhou/Case-Sensitive-Scene-Text-Recognition-Datasets
This dataset contains re-annotations of 4 popular Latin/English scene text recognition datasets.
49 3 09
Yuliang-Liu/Open-Oracle
AI-assisted Deciphering Oracle Bone Script
38 3 00
DYZhang09/ToC3D
[ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Language:Python37 2 23
Pengjie-W/HUST-OBC
Oracle Bone Script data collected by VLRLab of HUST
Language:Python32 1 31
Hryxyhe/MFH
This project is for MFH:Marrying Frequency Domain with Handwritten Mathematical Expression Recognition. We implement our method based on CoMER.
Language:Python5 1 20