Einstone-rose

Ph.D & BIGAI@BIT || Research Intern@Ant Group || VQA, VideoLLM, 3D Understanding

Beijing Institute of TechnologyBeijing

Einstone-rose's Stars

microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36k 347 2.9k4.2k
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python34.8k 177 5.1k2.6k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.7k 230 2733.2k
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook26.7k 325 4053.4k
fengdu78/lihang-code
《统计学习方法》的代码实现
Language:Jupyter Notebook19.1k 536 496.3k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.3k 257 128840
dragen1860/Deep-Learning-with-TensorFlow-book
深度学习入门开源书，基于TensorFlow 2.0案例实战。Open source Deep Learning book, based on TensorFlow 2.0 framework.
Language:Jupyter Notebook13.2k 487 2204.1k
facebookresearch/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Language:Python6.7k 97 6921.2k
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
6.2k 179 16858
zhaoxin94/awesome-domain-adaptation
A collection of AWESOME things about domian adaptation
5.2k 138 54879
dk-liang/Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
3.4k 102 41400
pengzhiliang/MAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
Language:Python2.6k 24 97342
luca-medeiros/lang-segment-anything
SAM with text prompt
Language:Python1.8k 12 60202
Eurus-Holmes/Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
Language:Python1.3k 41 1150
TheShadow29/awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
1k 29 598
ttengwang/Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
225 10 19
BeierZhu/Prompt-align
[ICCV 2023] Prompt-aligned Gradient for Prompt Tuning
Language:Python153 3 119
snap-research/discoscene
CVPR 2023 Highlight: DiscoScene
Language:Python148 25 52
seba-1511/lstms.pth
PyTorch implementations of LSTM Variants (Dropout + Layer Norm)
Language:Python136 9 324
Yimin-Liu/Awesome-Unsupervised-Person-Re-identification
Awesome-Unsupervised-Person-Re-identification
136 4 018
rentainhe/TRAR-VQA
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
Language:Python66 3 918
kkahatapitiya/Coarse-Fine-Networks
Code for our CVPR 2021 paper "Coarse-Fine Networks for Temporal Activity Detection in Videos"
Language:Python56 2 117
luogen1996/SimREC
A lightweight codebase for referring expression comprehension and segmentation
Language:Python52 2 04
PhoebusSi/VQA-VS
Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"
Language:Python36 2 21
scottyih/Slides
35 4 09
Huntersxsx/TSGV-Learning-List
Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作
32 2 03
ttharden/Keyframe-Extraction-for-video-summarization
Language:Python226
kophy/py4db
Python with SQLite/MySQL/LMDB/LevelDB.
Language:Python17 2 04
Trunpm/PMT-AAAI23
Efficient End-to-End Video-Question Answering with Pyramidal Multimodal Transformer - AAAI23
Language:Python7 2 11
Einstone-rose/Awesome-TSGV
Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作
1 1 00

Einstone-rose

Einstone-rose's Stars

microsoft/DeepSpeed

gradio-app/gradio

meta-llama/llama3

openai/CLIP

fengdu78/lihang-code

BradyFU/Awesome-Multimodal-Large-Language-Models

dragen1860/Deep-Learning-with-TensorFlow-book

facebookresearch/SlowFast

pliang279/awesome-multimodal-ml

zhaoxin94/awesome-domain-adaptation

dk-liang/Awesome-Visual-Transformer

pengzhiliang/MAE-pytorch

luca-medeiros/lang-segment-anything

Eurus-Holmes/Awesome-Multimodal-Research

TheShadow29/awesome-grounding

ttengwang/Awesome_Long_Form_Video_Understanding

BeierZhu/Prompt-align

snap-research/discoscene

seba-1511/lstms.pth

Yimin-Liu/Awesome-Unsupervised-Person-Re-identification

rentainhe/TRAR-VQA

kkahatapitiya/Coarse-Fine-Networks

luogen1996/SimREC

PhoebusSi/VQA-VS

scottyih/Slides

Huntersxsx/TSGV-Learning-List

ttharden/Keyframe-Extraction-for-video-summarization

kophy/py4db

Trunpm/PMT-AAAI23

Einstone-rose/Awesome-TSGV