zhang-pengyu's Stars
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
JingyunLiang/SwinIR
SwinIR: Image Restoration Using Swin Transformer (official repository)
subeeshvasu/Awesome-Deblurring
A curated list of resources for Image and Video Deblurring
MasterBin-IIAU/UNINEXT
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
SJTU-ViSYS/M2DGR
M2DGR: a Multi-modal and Multi-scenario Dataset for Ground Robots(RA-L2021 & ICRA2022)
jiawen-zhu/HQTrack
Tracking Anything in High Quality
4DVLab/Vision-Centric-BEV-Perception
Vision-Centric BEV Perception: A Survey
codingonion/awesome-llm-and-aigc
🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.
DavidZhangdw/Visual-Tracking-Development
Visual Object Tracking
wangxiao5791509/Single_Object_Tracking_Paper_List
Paper list for single object tracking (State-of-the-art SOT trackers)
haochenheheda/segment-anything-annotator
We developed a python UI based on labelme and segment-anything for pixel-level annotation. It support multiple masks generation by SAM(box/point prompt), efficient polygon modification and category record. We will add more features (such as incorporating CLIP-based methods for category proposal and VOS methods for video datasets
tao-bai/attack-and-defense-methods
A curated list of papers on adversarial machine learning (adversarial examples and defense methods).
JinluZhang1126/MixSTE
Official implementation of CVPR 2022 paper(MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video)
ZhangYuanhan-AI/Bamboo
Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning.
Zyj061/SpikeCV
An open-source framework for spike vision
hulianyuyy/CorrNet
Continuous Sign Language Recognition with Correlation Network (CVPR 2023)
ZechengLi19/Awesome-Sign-Language
Paper list of sign language, including sign language recognition(SLR), sign language translation(SLT) and other interesting work. Quick start your awesome work with us!! 🤟🤟🤟
wangdongdut/SOT-Learning
zhang-pengyu/DUT-VTUAV
Visible-Thermal UAV Tracking: A Large-Scale Benchmark (CVPR2022)
jiawen-zhu/TrackGPT
Tracking with Human-Intent Reasoning
EadCat/NIQA
No-reference Image Quality Assessment(NIQA) Algorithms (BRISQUE, NIQE, PIQE, RankIQA, MetaIQA)
byminji/SLTtrack
Official Implementation of Towards Sequence-Level Training for Visual Tracking (ECCV 2022)
Event-AHU/COESOT
A large-scale benchmark dataset for color-event based visual tracking
lawrence-cj/ARKitTrack
PyTorch implementation of ARKitTrack for CVPR'2023 paper "ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data", by Haojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu. Code will be released here.
Harry24k/MAIR
Fantastic Robustness Measures: The Secrets of Robust Generalization [NeurIPS 2023]
difhnp/MAT
code for 'Representation Learning for Visual Object Tracking by Masked Appearance Transfer'
wangdongdut/DeepLearning
tub-rip/eventvision2023