CuthbertCai's Stars
meta-llama/llama3
The official Meta Llama 3 GitHub site
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
mlc-ai/web-llm
High-performance In-browser LLM Inference Engine
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
apple/ml-ferret
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
isaac-sim/IsaacLab
Unified framework for robot learning built on NVIDIA Isaac Sim
Meituan-AutoML/MobileVLM
Strong and Open Vision Language Assistant for Mobile Devices
huawei-noah/bolt
Bolt is a deep learning library with high performance and heterogeneous flexibility.
mit-han-lab/tinyengine
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory
mit-han-lab/TinyChatEngine
TinyChatEngine: On-Device LLM Inference Library
FoundationVision/Groma
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
jshilong/GPT4RoI
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
google/aqt
Fraunhofer-IMS/AIfES_for_Arduino
This is the Arduino® compatible port of the AIfES machine learning framework, developed and maintained by Fraunhofer Institute for Microelectronic Circuits and Systems.
SonyResearch/COALA
COALA: A Practical and Vision-Centric Federated Learning Platform, accepted to ICML'24
TencentARC/mllm-npu
mllm-npu: training multimodal large language models on Ascend NPUs
OpenGVLab/GUI-Odyssey
GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes from 6 mobile devices, spanning 6 types of cross-app tasks, 201 apps, and 1.4K app combos.
showlab/videogui
official repo of "VideoGUI: A Benchmark for GUI Automation from Instructional Videos"
theyoungkwon/TinyTrain
The official implementation of TinyTrain [ICML '24]
pavmassimo/TyBox
SLDGroup/GradientFilter-CVPR23
showlab/SCT
[IJCV2023] Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"