bowen-upenn
Ph.D. candidate in Computer and Information Science
GRASP Lab, University of PennsylvaniaPhiladelphia, United States
bowen-upenn's Stars
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
tyxsspa/AnyText
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
msracver/Deformable-ConvNets
Deformable Convolutional Networks
KaihuaTang/Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”
rstrudel/segmenter
[ICCV2021] Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation
4uiiurz1/pytorch-deform-conv-v2
PyTorch implementation of Deformable ConvNets v2 (Modulated Deformable Convolution)
OSU-NLP-Group/SeeAct
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
wenbowen123/BundleTrack
[IROS 2021] BundleTrack: 6D Pose Tracking for Novel Objects without Instance or Category-Level 3D Models
jnhwkim/ban-vqa
Bilinear attention networks for visual question answering
taeho-kil/Document-Image-Dewarping
Document Image Dewarping
robinreni96/Font_Recognition-DeepFont
Its a implementation of DeepFont : Identify Your Font from An Image using Keras
chenhaoxing/DiffUTE
This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).
DVLP-CMATERJU/RectiNet
A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping
waxnkw/IETrans-SGG.pytorch
This is the code of ECCV 2022 (Oral) paper "Fine-Grained Scene Graph Generation with Data Transfer".
mods333/energy-based-scene-graph
Code release for Energy-Based Learning for Scene Graph Genertaion
nickyisadog/latent-diffusion-inpainting
wz7in/CVPR2023-VLSAT
CVPR2023 : VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud
aioz-ai/CFR_VQA
Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
simplify23/TPS_PP
Official Pytorch implementations of TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition (IJCAI 2023)
muktilin/NICE
[CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation
denabazazian/scene_text_segmentation
Pytorch implementation for pixel-wise scene text segmentation based on DeepLabV3+
HemingwayLee/deepfont-implement
zzjun725/Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”