xhyandwyy

Multimodal mPLUG.

Alibaba DAMO AcademyHangzhou, China

xhyandwyy's Stars

open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
Language:Python29.9k 372 8.4k9.5k
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python21.2k 156 2693.1k
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Language:Python12.2k 105 3.7k3k
openai/DALL-E
PyTorch package for the discrete VAE used for DALL·E.
Language:Python10.8k 229 901.9k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9.1k 134 1.1k1.4k
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python8.1k 98 1.7k998
karpathy/neuraltalk2
Efficient Image Captioning code in Torch, runs on GPU
Language:Jupyter Notebook5.5k 274 1871.3k
NVIDIA/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook5.1k 117 5621.4k
huggingface/notebooks
Notebooks using the Hugging Face libraries 🤗
Language:Jupyter Notebook3.8k 74 1711.6k
facebookresearch/SentEval
A python tool for evaluating the quality of sentence embeddings.
Language:Python2.1k 46 59310
whai362/PVT
Official implementation of PVT series
Language:Python1.8k 23 111247
facebookresearch/LAMA
LAnguage Model Analysis
Language:Python1.4k 71 48184
tylin/coco-caption
Language:Jupyter Notebook1.1k 24 56546
joe-siyuan-qiao/DetectoRS
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution
Language:Python1.1k 37 94175
lukemelas/PyTorch-Pretrained-ViT
Vision Transformer (ViT) in PyTorch
Language:Python802 10 30127
facebookresearch/GENRE
Autoregressive Entity Retrieval
Language:Python774 19 96103
ryouchinsa/Rectlabel-support
RectLabel is an offline image annotation tool for object detection and segmentation.
Language:Python516 18 26073
dddzg/up-detr
[TPAMI 2022 & CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers
Language:Python477 13 3271
hemingkx/ChineseNMT
ChineseNMT: Translate English to Chinese with PyTorch Implementation of Transformer
Language:Python459 6 1690
MILVLG/mcan-vqa
Deep Modular Co-Attention Networks for Visual Question Answering
Language:Python447 6 3888
LuoweiZhou/VLP
Vision-Language Pre-training for Image Captioning and Question Answering
Language:Python417 19 4662
huggingface/nn_pruning
Prune a model while finetuning or training.
Language:Jupyter Notebook396 47 2460
pzzhang/VinVL
project page for VinVL
350 9 4125
lucidrains/transformer-in-transformer
Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorch
Language:Python305 13 844
layumi/Image-Text-Embedding
TOMM2020 Dual-Path Convolutional Image-Text Embedding :feet: https://arxiv.org/abs/1711.05535
Language:MATLAB287 12 1873
VITA-Group/AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
Language:Python208 16 1442
aftix/bacon
Scientific Computing in Rust
Language:Rust186 4 38
okankop/MFF-pytorch
Motion Fused Frames implementation in PyTorch, codes and pretrained models.
Language:Python131 6 1433
berniebear/Multi-HT100M
52 8 51
vedanuj/grid-feats-vqa
Grid features pre-training code for visual question answering
Language:Python6 0 03

xhyandwyy

xhyandwyy's Stars

open-mmlab/mmdetection

lucidrains/vit-pytorch

PaddlePaddle/PaddleNLP

openai/DALL-E

speechbrain/speechbrain

huggingface/accelerate

karpathy/neuraltalk2

NVIDIA/tacotron2

huggingface/notebooks

facebookresearch/SentEval

whai362/PVT

facebookresearch/LAMA

tylin/coco-caption

joe-siyuan-qiao/DetectoRS

lukemelas/PyTorch-Pretrained-ViT

facebookresearch/GENRE

ryouchinsa/Rectlabel-support

dddzg/up-detr

hemingkx/ChineseNMT

MILVLG/mcan-vqa

LuoweiZhou/VLP

huggingface/nn_pruning

pzzhang/VinVL

lucidrains/transformer-in-transformer

layumi/Image-Text-Embedding

VITA-Group/AutoSpeech

aftix/bacon

okankop/MFF-pytorch

berniebear/Multi-HT100M

vedanuj/grid-feats-vqa