vhzy

AI phd condidate

vhzy's Stars

MLNLP-World/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
3.6k 44 5470
AkariAsai/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
Language:Python1.9k 17 84172
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
1.8k 52 1592
awesome-stable-diffusion/awesome-stable-diffusion
Curated list of awesome resources for the Stable Diffusion AI Model.
1.5k 41 1274
AlibabaResearch/DAMO-ConvAI
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
Language:Python1.2k 16 142188
facebookresearch/atlas
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)
Language:Python518 12 1867
AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
Language:TeX438 12 025
BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
410 5 3212
EvolvingLMMs-Lab/LongVA
Long Context Transfer from Language to Vision
Language:Python339 8 2918
WengLean/hands-on-research-tutorial
《动手做科研》面向科研初学者，一步一步地展示如何入门人工智能科研
Language:Jupyter Notebook2487
boheumd/MA-LMM
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
Language:Python247 4 4027
mbzuai-oryx/Video-LLaVA
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
Language:Python245 14 1611
ttengwang/Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
216 10 19
darcula1993/diffusion-models-class-CN
Materials for the Hugging Face Diffusion Models Course
Language:Jupyter Notebook169 0 021
YueFan1014/VideoAgent
This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)
Language:Python131 3 125
imagegridworth/IG-VLM
Language:Python122 4 85
LDLINGLINGLING/MiniCPM_Series_Tutorial
Minicpm和MiniCPM-V的项目和教程。包括推理，量化，边端部署，微调，技术报告、应用六个主题
Language:Python90 3 43
Ziyang412/VideoTree
Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"
Language:Python82 2 93
LDLINGLINGLING/AutoPlan
本项目是自动化学报中AUTOPLAN的代码地址，使用大语言模型完成了复杂任务的任务规划以及任务执行
Language:Python80 2 46
ziplab/LongVLM
Language:Python66 4 76
IVG-SZ/Flash-VStream
Please refer to our official repo at https://github.com/IVGSZ/Flash-VStream.
Language:Python48 2 68
Liuziyu77/Soda
Search, organize, discover anything!
Language:Jupyter Notebook47 1 15
orrzohar/Video-STaR
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
Language:Python47 2 04
Stanford-ILIAD/explore-eqa
Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"
Language:Python36 7 53
rxtan2/Koala-video-llm
Language:Python30 1 85
kkahatapitiya/LangRepo
Language Repository for Long Video Understanding
Language:Python29 2 13
kahnchana/mvu
Multimodal Video Understanding Framework (MVU)
Language:Python24 2 00
declare-lab/Sealing
[NAACL 2024] Official Implementation of paper "Self-Adaptive Sampling for Efficient Video Question Answering on Image--Text Models"
Language:Python9 4 03
Espere-1119-Song/Paper-Writing-Tips
该仓库是MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
4 0 01
lntzm/CVPR24Track-LongVideo
Language:Python1 0 01

vhzy

vhzy's Stars

MLNLP-World/Paper-Writing-Tips

AkariAsai/self-rag

ChenHsing/Awesome-Video-Diffusion-Models

awesome-stable-diffusion/awesome-stable-diffusion

AlibabaResearch/DAMO-ConvAI

facebookresearch/atlas

AlonzoLeeeooo/awesome-text-to-image-studies

BradyFU/Video-MME

EvolvingLMMs-Lab/LongVA

WengLean/hands-on-research-tutorial

boheumd/MA-LMM

mbzuai-oryx/Video-LLaVA

ttengwang/Awesome_Long_Form_Video_Understanding

darcula1993/diffusion-models-class-CN

YueFan1014/VideoAgent

imagegridworth/IG-VLM

LDLINGLINGLING/MiniCPM_Series_Tutorial

Ziyang412/VideoTree

LDLINGLINGLING/AutoPlan

ziplab/LongVLM

IVG-SZ/Flash-VStream

Liuziyu77/Soda

orrzohar/Video-STaR

Stanford-ILIAD/explore-eqa

rxtan2/Koala-video-llm

kkahatapitiya/LangRepo

kahnchana/mvu

declare-lab/Sealing

Espere-1119-Song/Paper-Writing-Tips

lntzm/CVPR24Track-LongVideo