Pinned Repositories
AdaBound
An optimizer that trains as fast as Adam and as good as SGD.
AdvancedEAST
AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.
AlgorithmnCode
Learning how to use git, And saved same Algorithmn Code
AliceMind
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
awesome-cpp
A curated list of awesome C/C++ frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
FGCrossNet_ACMMM2019
Source code of our ACM MM 2019 paper "A New Benchmark and Approach for Fine-grained Cross-media Retrieval".
mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
PMTD
Pyramid Mask Text Detector designed by SenseTime Video Intelligence Research team.
pytorch-mask-rcnn
jjprincess's Repositories
jjprincess/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
jjprincess/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
jjprincess/AliceMind
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
jjprincess/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
jjprincess/bubogpt
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
jjprincess/chatGPT-multimodal-bot
jjprincess/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
jjprincess/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
jjprincess/Cream
This is a collection of our NAS and Vision Transformer work.
jjprincess/CSRA
Official code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"
jjprincess/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
jjprincess/disco-diffusion
jjprincess/dlrm
An implementation of a deep learning recommendation model (DLRM)
jjprincess/Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
jjprincess/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
jjprincess/MASTER-pytorch
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
jjprincess/MobileModels
手机品牌型号汇总 | Mobile Models | This repository is licensed under CC BY-NC-SA 4.0
jjprincess/OFA
Official repository of OFA. Paper: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
jjprincess/PartialLabelingCSL
Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"
jjprincess/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
jjprincess/R2D2
jjprincess/SegFormer
Official PyTorch implementation of SegFormer
jjprincess/t5-pegasus-chinese
基于GOOGLE T5中文生成式模型的摘要生成/指代消解,支持batch批量生成,多进程
jjprincess/ts2_net
jjprincess/UEDVC
jjprincess/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
jjprincess/Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
jjprincess/VideoX
VideoX: a collection of video cross-modal models
jjprincess/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
jjprincess/you-get
:arrow_double_down: Dumb downloader that scrapes the web