tianruochen's Stars
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
bcmi/SLBR-Visible-Watermark-Removal
[ACM MM 2021] Visible Watermark Removal via Self-calibrated Localization and Background Refinement
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
WangRongsheng/pytorch-classification
利用pytorch实现图像分类的一个完整的代码,训练,预测,TTA,模型融合,模型部署,cnn提取特征,svm或者随机森林等进行分类,模型蒸馏,一个完整的代码
zht8506/UniQA
This is the repository for paper UniQA
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
SunzeY/AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
ArronAI007/Awesome-AGI
AGI资料汇总学习(主要包括LLM和AIGC),持续更新......
315386775/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Westlake-AI/SemiReward
[ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning
microsoft/Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
cw1091293482/Deep-Incremental-Image-Retrieval
incremental learning for fine-grained image retrieval via feature estimation
AndresPMD/GCN_classification
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
aassxun/SEMICON
cty8998/SIRL-QAConv
[ACMMM 2023] Learning Style-Invariant Robust Representation for Generalizable Visual Instance Retrieval
facebookresearch/ViewDiff
ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
wusuozhi/stable-diffusion-learning
justinpinkney/stable-diffusion
HUSTAI/uie_pytorch
PaddleNLP UIE模型的PyTorch版实现
PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
mli/paper-reading
深度学习经典、新论文逐段精读
contr4l/SimilarCharacter
对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Syliz517/CLIP-ReID
Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)