tianruochen

tianruochen's Stars

lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Language:Python11.2k1.1k
bcmi/SLBR-Visible-Watermark-Removal
[ACM MM 2021] Visible Watermark Removal via Self-calibrated Localization and Background Refinement
Language:Python22536
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML12.2k1.3k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.6k514
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.8k486
WangRongsheng/pytorch-classification
利用pytorch实现图像分类的一个完整的代码，训练，预测，TTA，模型融合，模型部署，cnn提取特征，svm或者随机森林等进行分类，模型蒸馏，一个完整的代码
Language:Jupyter Notebook264
zht8506/UniQA
This is the repository for paper UniQA
Language:Python14
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2.2k148
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python7.5k1.2k
SunzeY/AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Language:Jupyter Notebook74146
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
6.2k858
ArronAI007/Awesome-AGI
AGI资料汇总学习（主要包括LLM和AIGC），持续更新......
Language:Jupyter Notebook31926
315386775/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓，同时包含工作和科研过程中的新想法、新问题、新资源与新项目
1.9k179
Westlake-AI/SemiReward
[ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning
Language:Python622
microsoft/Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
Language:Python1.4k182
cw1091293482/Deep-Incremental-Image-Retrieval
incremental learning for fine-grained image retrieval via feature estimation
Language:Python9
AndresPMD/GCN_classification
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Language:Python6411
aassxun/SEMICON
Language:Python314
cty8998/SIRL-QAConv
[ACMMM 2023] Learning Style-Invariant Robust Representation for Generalizable Visual Instance Retrieval
Language:Python5
facebookresearch/ViewDiff
ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).
Language:Python34222
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10.1k980
wusuozhi/stable-diffusion-learning
Language:Jupyter Notebook163
justinpinkney/stable-diffusion
Language:Jupyter Notebook1.5k272
HUSTAI/uie_pytorch
PaddleNLP UIE模型的PyTorch版实现
Language:Python603101
PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
95227
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
Language:Jupyter Notebook9.7k770
mli/paper-reading
深度学习经典、新论文逐段精读
27.6k2.5k
contr4l/SimilarCharacter
对常用的6700个汉字进行音、形比较，输出音近字、形近字的列表。 # 相近字
Language:Python443135
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook3k281
Syliz517/CLIP-ReID
Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)
Language:Python30747