Pinned Repositories
motor_fault_diagnosis
雪浪工业数据智能挑战赛 工业智检——电机异音AI诊断
EvalAI
:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI
Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
MobileVLM
Strong and Open Vision Language Assistant for Mobile Devices
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
MAC
An end-to-end masked contrastive video-and-language pre-training framework
shufangxun's Repositories
shufangxun/MAC
An end-to-end masked contrastive video-and-language pre-training framework
shufangxun/openbilibili
哔哩哔哩后台源码