LonglongaaaGo
I am a Ph.D. student at Ubiquitous Computing and Machine Learning Research Lab (UCML), Memorial University of Newfoundland.
Memorial University of NewfoundlandCanada
LonglongaaaGo's Stars
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
lllyasviel/IC-Light
More relighting!
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
meijieru/crnn.pytorch
Convolutional recurrent network in pytorch
sirfz/tesserocr
A Python wrapper for the tesseract-ocr API
ayumiymk/aster.pytorch
ASTER in Pytorch
sail-sg/MDT
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
Kevin-thu/DiffMorpher
Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)
yzhang2016/video-generation-survey
A reading list of video generation
ID-Animator/ID-Animator
yikaiw/Vidu4D
[NeurIPS 2024] Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels
ee227c/ee227c.github.io
EE227C (Spring 2018) Course page
ZYM-PKU/UDiffText
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
zqh0253/3DitScene
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
oneThousand1000/Portrait3D
(SIGGRAPH 2024) Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior
yashkant/spad
Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024
h/pytesseract
Python-tesseract is an optical character recognition (OCR) tool for python
UCSB-NLP-Chang/CoPaint
Implementation of paper 'Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models'
UCSC-VLAA/CRATE-alpha
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
Cameltr/TransRef
Code and datasets of paper《TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting》
Cameltr/RGTSI
Code and datasets of ICIP 2022 paper 'Reference-Guided Texture and Structure Inference for Image Inpainting'
oneThousand1000/3DPortraitGAN
(Preprint) 3DPortraitGAN: Learning One-Quarter Headshot 3D GANs from a Single-View Portrait Dataset with Diverse Body Poses
liujianzhi/EchoReel
An innovative method designed to augment the capabilities of existing video diffusion models
jasongzy/EG4D
Official implementation of "EG4D: Explicit Generation of 4D Object without Score Distillation"
LoYuXr/CalliRewrite
CalliRewrite: Recovering Handwriting Behaviors from Calligraphy Images Without Supervision (ICRA 2024)
zhenglab/TransCNN-HAE
mavillot/FUNSD-Information-Extraction