Pinned Repositories
2022-TCSVT-CANR
2024-AAAI-HPT
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)
3D-PointCloud
Papers and Datasets about Point Cloud.
3D-Reconstruction-based-on-RGB-D-camera
3D_point_organization
3D LiDAR Point Cloud Organized as Depth Map, Height Map and Surface Normal Map
awesome-fashion-ai
A repository to curate and summarise research papers related to fashion and e-commerce
LAG-Net
SegNet_Mobile
tripletloss
One Shot learning, Siamese networks and Triplet Loss with Keras
Yolov5-Flask-VUE
基于Flask开发后端、VUE开发前端框架,在WEB端部署YOLOv5目标检测模型
ahwhbc's Repositories
ahwhbc/2024-AAAI-HPT
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)
ahwhbc/Awesome-Scene-Text-Image-Super-Resolution
A collection of papers and resources on scene text image super-resolution.
ahwhbc/CloFormer
The official code of "Rethinking Local Perception in Lightweight Vision Transformer"
ahwhbc/color-peel
we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. By jointly learning on multiple color-shape images, we found that the method can successfully disentangle the color and shape concepts.
ahwhbc/Contrastive-Learning-NLP-Papers
Paper List for Contrastive Learning for Natural Language Processing
ahwhbc/darknet
YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )
ahwhbc/DDP-practice
A demo of image classification with PyTorch DDP (DistributedDataParallel) and amp (Automatic Mixed Precision) modules. TODO: Add English version
ahwhbc/FashionTex
The official implementation of SIGGRAPH 2023 conference paper, FashionTex: Controllable Virtual Try-on with Text and Texture.
ahwhbc/Fast-BEV
Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline
ahwhbc/Graphormer
Do Transformers Really Perform Bad for Graph Representation? [NIPS-2021]
ahwhbc/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
ahwhbc/MambaIR
A simple baseline for image restoration with state-space model.
ahwhbc/MIGC
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
ahwhbc/MMIF-CDDFuse
[CVPR 2023] Official implementation for "CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion."
ahwhbc/mobile-vision
Mobile vision models and code
ahwhbc/OpenGait
A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.
ahwhbc/ovsam
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
ahwhbc/personal-paper-code-daily
🎓 Automatically Update Some Fields Papers Daily using Github Actions (Update Every 12th hours)
ahwhbc/PICR-Net_ACMMM23
ahwhbc/pix2pixHD
Synthesizing and manipulating 2048x1024 images with conditional GANs
ahwhbc/Point-cloud-quality-assessment
Collections of papers, databases, and codes targeted at point cloud quality assessment (PCQA), mesh quality assessment (MQA), 3D model quality assessment (3DQA).
ahwhbc/Qwen-7B
The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.
ahwhbc/qwen-sft
通义千问 SFT试验
ahwhbc/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
ahwhbc/SDT
This repository is the official implementation of Disentangling Writer and Character Styles for Handwriting Generation (CVPR23).
ahwhbc/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
ahwhbc/SOLIDER
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent
ahwhbc/ultralyticsPro
🔥🔥🔥专注于改进YOLOv8模型,NEW - YOLOv8 🚀 RT-DETR 🥇 in PyTorch >, Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀
ahwhbc/VTG-GPT
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT
ahwhbc/Zero-shot-RIS
[CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"