LonglongaaaGo
I am a Ph.D. student at Ubiquitous Computing and Machine Learning Research Lab (UCML), Memorial University of Newfoundland.
Memorial University of NewfoundlandCanada
LonglongaaaGo's Stars
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
hwalsuklee/awesome-deep-text-detection-recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
MhLiao/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
megvii-research/HiDiffusion
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
zsyOAOA/DifFace
DifFace: Blind Face Restoration with Diffused Error Contraction (TPAMI, 2024)
kongzhecn/OMG
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
3DTopia/3DTopia
Text-to-3D Generation within 5 Minutes
IDKiro/sdxs
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
Grzego/handwriting-generation
Implementation of handwriting generation with use of recurrent neural networks in tensorflow. Based on Alex Graves paper (https://arxiv.org/abs/1308.0850).
YingqingHe/ScaleCrafter
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
YingqingHe/LVDM
LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation
HL-hanlin/Ctrl-Adapter
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
KU-CVLAB/GaussianTalker
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
customdiffusion360/custom-diffusion360
CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control
conallwang/MeGA
The official implementation of "MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing".
LeonHLJ/FouriScale
Official implementation of FouriScale (ECCV2024)
wooyeolBaek/attention-map
🚀 Cross attention map tools for huggingface/diffusers
Weifeng-Chen/ID-Aligner
Official implement of ID-Aligner
yanivnik/sinfusion-code
YingqingHe/Shadow-Removal-via-Generative-Priors
[ACM MM 2021 Oral] Unsupervised Portrait Shadow Removal via Generative Priors
GuoLanqing/Self-Cascade
[ECCV2024] Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
blackprotoss/GSDM
Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)
Sealical/anywhere-multi-agent
aimagelab/FourBi
Binarizing Documents by Leveraging both Space and Frequency. (ICDAR 2024)
KumapowerLIU/Awesome-LLMs-meet-Multimodal-Generation