Pinned Repositories
adapt-image-models
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
blog_img
图床,上传笔记插图
flash-attention
Fast and memory-efficient exact attention
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
motion_planning
Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Study-notes
上传学习笔记
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
bobochow's Repositories
bobochow/Study-notes
上传学习笔记
bobochow/adapt-image-models
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
bobochow/blog_img
图床,上传笔记插图
bobochow/flash-attention
Fast and memory-efficient exact attention
bobochow/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
bobochow/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
bobochow/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
bobochow/motion_planning
bobochow/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
bobochow/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
bobochow/TubeViT
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
bobochow/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite
bobochow/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training