yyyyysxz's Stars
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
xai-org/grok-1
Grok open release
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
chou141253/FGVC-PIM
Pytorch implementation for "A Novel Plug-in Module for Fine-Grained Visual Classification". fine-grained visual classification task.
amazon-science/mix-generation
MixGen: A New Multi-Modal Data Augmentation
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
li-xirong/coco-cn
Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
fawazsammani/knowing-when-to-look-adaptive-attention
PyTorch Implementation of Knowing When to Look: Adaptive Attention via a Visual Sentinal for Image Captioning
jiasenlu/AdaptiveAttention
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
fry404006308/fry_course_materials
范仁义录播课资料,会依次推出各种完全免费的前端、后端、大数据、人工智能等课程,课程网站: https://fanrenyi.com ; b站课程地址: https://space.bilibili.com/45664489 ;
davidnvq/grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
aimagelab/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
ruotianluo/ImageCaptioning.pytorch
I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.