z-w-wang's Stars
shannanyinxiang/PageNet
Official implementation of PageNet (IJCV 2022)
MCG-NJU/AWT
[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
FudanVI/benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
jinxiwang/ocr_TDR
好未来Feature Camp:中文手写汉字识别
facebookresearch/classifier-balancing
This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
lizhe2004/chatbot-list
行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍
mem0ai/mem0
The Memory layer for your AI apps
datawhalechina/hugging-multi-agent
A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基于MetaGPT的多智能体入门与开发教程
state-spaces/mamba
Mamba SSM architecture
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
lucidrains/CoCa-pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
LAION-AI/dalle2-laion
Pretrained Dalle2 from laion
lucidrains/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
LPengYang/FreeDrag
Official Implementation of FreeDrag (CVPR 2024)
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
zhangxuying1004/RefCOD
Official Code for 'Referring Camouflaged Object Detection (指向性伪装物体检测) '
datawhalechina/daily-interview
Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star
mJackie/RecSys
计算广告/推荐系统/机器学习(Machine Learning)/点击率(CTR)/转化率(CVR)预估/点击率预估
hjbahng/visual_prompting
Exploring Visual Prompts for Adapting Large-Scale Models
zhenyingfang/Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
zhaoyue-zephyrus/TeSTra
Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
huawei-noah/VanillaNet
dk-liang/CrowdCLIP
[CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
JiauZhang/DragGAN
Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold
VividLe/Online-Action-Detection
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.
amazon-science/long-short-term-transformer
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
Artanic30/HOICLIP
CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models