xhyandwyy's Stars
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
PaddlePaddle/PaddleNLP
๐ Easy-to-use and powerful NLP and LLM library with ๐ค Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ๐Text Classification, ๐ Neural Search, โ Question Answering, โน๏ธ Information Extraction, ๐ Document Intelligence, ๐ Sentiment Analysis etc.
openai/DALL-E
PyTorch package for the discrete VAE used for DALLยทE.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
huggingface/accelerate
๐ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
karpathy/neuraltalk2
Efficient Image Captioning code in Torch, runs on GPU
NVIDIA/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
huggingface/notebooks
Notebooks using the Hugging Face libraries ๐ค
facebookresearch/SentEval
A python tool for evaluating the quality of sentence embeddings.
whai362/PVT
Official implementation of PVT series
facebookresearch/LAMA
LAnguage Model Analysis
tylin/coco-caption
joe-siyuan-qiao/DetectoRS
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution
lukemelas/PyTorch-Pretrained-ViT
Vision Transformer (ViT) in PyTorch
facebookresearch/GENRE
Autoregressive Entity Retrieval
ryouchinsa/Rectlabel-support
RectLabel is an offline image annotation tool for object detection and segmentation.
dddzg/up-detr
[TPAMI 2022 & CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers
hemingkx/ChineseNMT
ChineseNMT: Translate English to Chinese with PyTorch Implementation of Transformer
MILVLG/mcan-vqa
Deep Modular Co-Attention Networks for Visual Question Answering
LuoweiZhou/VLP
Vision-Language Pre-training for Image Captioning and Question Answering
huggingface/nn_pruning
Prune a model while finetuning or training.
pzzhang/VinVL
project page for VinVL
lucidrains/transformer-in-transformer
Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorch
layumi/Image-Text-Embedding
TOMM2020 Dual-Path Convolutional Image-Text Embedding :feet: https://arxiv.org/abs/1711.05535
VITA-Group/AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
aftix/bacon
Scientific Computing in Rust
okankop/MFF-pytorch
Motion Fused Frames implementation in PyTorch, codes and pretrained models.
berniebear/Multi-HT100M
vedanuj/grid-feats-vqa
Grid features pre-training code for visual question answering