Pinned Repositories
Awesome-Story-Visualization
Awesome-Text-to-Image
A Survey on Text-to-Image Generation/Synthesis.
CoIn
[ACM MM 2024] A fast and effective Story Visualization and Continuation Model
DE-Net
[AAAI 2023] Dynamic Text-guided Image Editing Adversarial Networks
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
DF-GAN
[CVPR2022 oral] A Simple and Effective Baseline for Text-to-Image Synthesis
GALIP
[CVPR2023] A faster, smaller, and better text-to-image model for large-scale training
StoryImager
[ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
TextMatch
基于Pytorch的,中文语义相似度匹配模型(ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet)
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
tobran's Repositories
tobran/DF-GAN
[CVPR2022 oral] A Simple and Effective Baseline for Text-to-Image Synthesis
tobran/GALIP
[CVPR2023] A faster, smaller, and better text-to-image model for large-scale training
tobran/StoryImager
[ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
tobran/DE-Net
[AAAI 2023] Dynamic Text-guided Image Editing Adversarial Networks
tobran/CoIn
[ACM MM 2024] A fast and effective Story Visualization and Continuation Model
tobran/Awesome-Story-Visualization
tobran/Awesome-Text-to-Image
A Survey on Text-to-Image Generation/Synthesis.
tobran/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
tobran/annotated_deep_learning_paper_implementations
🧑🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
tobran/aphantasia
CLIP + FFT/DWT/RGB = text to image/video
tobran/blended-diffusion
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
tobran/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
tobran/BMInf
Low-cost Inference Package for Big Pretrained Language Models (PLMs)
tobran/CLIP-Guided-Diffusion
Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.
tobran/CLIP-ViL
tobran/CLIPasso
tobran/DenseCLIP
[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
tobran/disco-diffusion
tobran/gansformer
Generative Adversarial Transformers
tobran/GLIGEN
Open-Set Grounded Text-to-Image Generation
tobran/intro_dgm
An Introduction to Deep Generative Modeling: Examples
tobran/lang-seg
Language-Driven Semantic Segmentation
tobran/MaskGIT-pytorch
Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)
tobran/mmgeneration
MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
tobran/ONE-PIC
tobran/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
tobran/tobran
Config files for my GitHub profile.
tobran/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
tobran/visualization
a collection of visualization function
tobran/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch