Xiaodongsuper's Stars
saermart/DouyinLiveWebFetcher
抖音直播间网页版的弹幕数据抓取(2024最新版本)
Zheng-Chong/CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
wanghao9610/OV-DINO
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
StarUniversus/gcmae
PyTorch implementation of GCMAE
JackAILab/ConsistentID
Customized ID Consistent for human
BillChan226/HALC
[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"
Zheng-Chong/FashionMatrix
Fashion Matrix is dedicated to bridging various visual and language models and continuously refining its capabilities as a comprehensive fashion AI assistant. This project will continue to update new features and optimization effects.
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
mshumer/gpt-prompt-engineer
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
median-research-group/LibMTL
A PyTorch Library for Multi-Task Learning
PaddlePaddle/ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Xiaodongsuper/Entity-Graph-Enhanced-Cross-Modal-Pretraining-for-Instance-level-Product-Retrieval
sagizty/VPT
Unofficial code for VPT(Visual Prompt Tuning) paper of arxiv 2203.12119
zzr-idam/Interpretable-Pyramid-Network
Single UHD Image Dehazing via Interpretable Pyramid Network
zzr-idam/Under-Display-Camera-UAV
zzr-idam/Under-Display-Camera-Zoo
zzr-idam/4KDehazing
Xiaodongsuper/SCALE_code
M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining CVPR 2022
Xiaodongsuper/M5Product_toolkit
M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining CVPR 2022. Dataset toolkit
Xiaodongsuper/Adaptive-Collaborative-Similarity-Learning-for-Unsupervised-Multi-view-Feature-Selection
Adaptive Collaborative Similarity Learning for Unsupervised Multi-view Feature Selection IJCAI2018
alcs417/L1-norm-Graph
a novel semi-supervised method for miRNA-disease association prediction
alcs417/AMVML
Xiaodongsuper/M5Product_dataset
M5Product Main Page.
j-min/VL-T5
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
haohang96/bingo
Bag of Instances Aggregation Boosts Self-supervised Distillation (ICLR 2022)
zhanxlin/Product1M
Product1M
willard-yuan/hashing-baseline-for-image-retrieval
:octocat:Various hashing methods for image retrieval and serves as the baselines