jjprincess

Pinned Repositories

AdaBound
An optimizer that trains as fast as Adam and as good as SGD.
Language:Python0 0 00
AdvancedEAST
AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.
Language:Python0 0 00
AlgorithmnCode
Learning how to use git, And saved same Algorithmn Code
Language:C++0 0 00
AliceMind
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Language:Python0 0 00
awesome-cpp
A curated list of awesome C/C++ frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
0 0 00
FGCrossNet_ACMMM2019
Source code of our ACM MM 2019 paper "A New Benchmark and Approach for Fine-grained Cross-media Retrieval".
Language:Python1 0 00
mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python1 0 00
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
Language:Python1 0 00
PMTD
Pyramid Mask Text Detector designed by SenseTime Video Intelligence Research team.
Language:Python48 3 038
pytorch-mask-rcnn
Language:Python1 1 00

jjprincess's Repositories

jjprincess/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python1 0 00
jjprincess/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
Language:Python1 0 00
jjprincess/AliceMind
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Language:Python0 0 00
jjprincess/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
1
jjprincess/bubogpt
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
jjprincess/chatGPT-multimodal-bot
Language:Python0 0
jjprincess/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
jjprincess/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
jjprincess/Cream
This is a collection of our NAS and Vision Transformer work.
jjprincess/CSRA
Official code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"
Language:Python0 0
jjprincess/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Language:Python0 0
jjprincess/disco-diffusion
Language:Jupyter Notebook0 0
jjprincess/dlrm
An implementation of a deep learning recommendation model (DLRM)
Language:Python0 0
jjprincess/Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
Language:Python0 0
jjprincess/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook0 0
jjprincess/MASTER-pytorch
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
Language:Python0 0
jjprincess/MobileModels
手机品牌型号汇总 | Mobile Models | This repository is licensed under CC BY-NC-SA 4.0
0 0
jjprincess/OFA
Official repository of OFA. Paper: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Language:Python0 0
jjprincess/PartialLabelingCSL
Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"
Language:Python0 0
jjprincess/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
Language:Python0 0
jjprincess/R2D2
jjprincess/SegFormer
Official PyTorch implementation of SegFormer
Language:Python0 0
jjprincess/t5-pegasus-chinese
基于GOOGLE T5中文生成式模型的摘要生成/指代消解，支持batch批量生成，多进程
Language:Python0 0
jjprincess/ts2_net
jjprincess/UEDVC
Language:Python0 0
jjprincess/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
jjprincess/Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
Language:Python0 0
jjprincess/VideoX
VideoX: a collection of video cross-modal models
Language:Python0 0
jjprincess/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python0 0
jjprincess/you-get
:arrow_double_down: Dumb downloader that scrapes the web