LonglongaaaGo

I am a Ph.D. student at Ubiquitous Computing and Machine Learning Research Lab (UCML), Memorial University of Newfoundland.

Memorial University of NewfoundlandCanada

LonglongaaaGo's Stars

clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Language:Jupyter Notebook3.7k 84 3901.1k
Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
Language:Python3.2k 64 247966
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook2.8k 26 157270
hwalsuklee/awesome-deep-text-detection-recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
2.5k 151 13513
MhLiao/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
Language:Python2.1k 43 363478
TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Language:Python1.4k 41 69114
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
844 58 1333
megvii-research/HiDiffusion
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
Language:Jupyter Notebook748 7 3142
zsyOAOA/DifFace
DifFace: Blind Face Restoration with Diffused Error Contraction (TPAMI, 2024)
Language:Python632 20 2144
kongzhecn/OMG
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
Language:Python621 12 1942
3DTopia/3DTopia
Text-to-3D Generation within 5 Minutes
Language:Python617 12 1242
IDKiro/sdxs
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
Language:Python599 26 1821
Grzego/handwriting-generation
Implementation of handwriting generation with use of recurrent neural networks in tensorflow. Based on Alex Graves paper (https://arxiv.org/abs/1308.0850).
Language:Python531 18 32107
YingqingHe/ScaleCrafter
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
Language:Python487 9 3029
YingqingHe/LVDM
LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation
Language:Python443 28 2316
HL-hanlin/Ctrl-Adapter
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Language:Python381 22 2316
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML313 11 416
KU-CVLAB/GaussianTalker
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
Language:Python262 11 5531
customdiffusion360/custom-diffusion360
CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Language:Python147 5 16
conallwang/MeGA
The official implementation of "MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing".
Language:Python141 10 220
LeonHLJ/FouriScale
Official implementation of FouriScale (ECCV2024)
Language:Python133 11 75
wooyeolBaek/attention-map
🚀 Cross attention map tools for huggingface/diffusers
Language:Python122 3 89
Weifeng-Chen/ID-Aligner
Official implement of ID-Aligner
117 18 12
yanivnik/sinfusion-code
Language:Python96 6 1213
YingqingHe/Shadow-Removal-via-Generative-Priors
[ACM MM 2021 Oral] Unsupervised Portrait Shadow Removal via Generative Priors
Language:Python74 10 711
GuoLanqing/Self-Cascade
[ECCV2024] Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
59 8 10
blackprotoss/GSDM
Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)
Language:Python47 7 94
Sealical/anywhere-multi-agent
Language:Jupyter Notebook32 4 32
aimagelab/FourBi
Binarizing Documents by Leveraging both Space and Frequency. (ICDAR 2024)
Language:Python10 3 12
KumapowerLIU/Awesome-LLMs-meet-Multimodal-Generation
1

LonglongaaaGo

LonglongaaaGo's Stars

clovaai/deep-text-recognition-benchmark

Belval/TextRecognitionDataGenerator

xinyu1205/recognize-anything

hwalsuklee/awesome-deep-text-detection-recognition

MhLiao/DB

TencentARC/BrushNet

mayuelala/FollowYourClick

megvii-research/HiDiffusion

zsyOAOA/DifFace

kongzhecn/OMG

3DTopia/3DTopia

IDKiro/sdxs

Grzego/handwriting-generation

YingqingHe/ScaleCrafter

YingqingHe/LVDM

HL-hanlin/Ctrl-Adapter

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

KU-CVLAB/GaussianTalker

customdiffusion360/custom-diffusion360

conallwang/MeGA

LeonHLJ/FouriScale

wooyeolBaek/attention-map

Weifeng-Chen/ID-Aligner

yanivnik/sinfusion-code

YingqingHe/Shadow-Removal-via-Generative-Priors

GuoLanqing/Self-Cascade

blackprotoss/GSDM

Sealical/anywhere-multi-agent

aimagelab/FourBi

KumapowerLIU/Awesome-LLMs-meet-Multimodal-Generation