tianbaochou's Stars
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
lyuchenyang/Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
coderonion/awesome-llm-and-aigc
🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
SysCV/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
alibaba/ali-dbhub
已迁移新仓库,此版本将不再维护
THUDM/SwissArmyTransformer
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
pytorch/torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
ltsopensource/light-task-scheduler
Distributed Scheduled Job Framework
jina-ai/serve
☁️ Build multimodal AI applications with cloud-native stack
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
msnh2012/Msnhnet
🔥 (yolov3 yolov4 yolov5 unet ...)A mini pytorch inference framework which inspired from darknet.
Jittor/jittor
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.
ShiqiYu/libfacedetection
An open source library for face detection in images. The face detection speed can reach 1000FPS.
uploadcare/pillow-simd
The friendly PIL fork