tianbaochou

tianbaochou's Stars

DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.8k259
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python30.1k4.5k
lyuchenyang/Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
Language:Python1.5k127
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.6k806
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
6.1k856
coderonion/awesome-llm-and-aigc
🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.
52548
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Language:Jupyter Notebook4.8k503
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.7k3.5k
SysCV/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
Language:Python3.7k224
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
Language:TypeScript76.6k59.1k
alibaba/ali-dbhub
已迁移新仓库，此版本将不再维护
8.3k1.3k
THUDM/SwissArmyTransformer
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
Language:Python1k96
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.6k242
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python7.9k967
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.2k1.3k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.1k1.4k
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python38.8k4.3k
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.7k5.2k
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python12k1.1k
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook9.2k819
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python37.1k3.2k
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.6k489
pytorch/torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
Language:Python1k124
ltsopensource/light-task-scheduler
Distributed Scheduled Job Framework
Language:Java3k1.1k
jina-ai/serve
☁️ Build multimodal AI applications with cloud-native stack
Language:Python21.1k2.2k
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Language:Python26.8k3k
msnh2012/Msnhnet
🔥 (yolov3 yolov4 yolov5 unet ...)A mini pytorch inference framework which inspired from darknet.
Language:C++750144
Jittor/jittor
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.
Language:Python3.1k312
ShiqiYu/libfacedetection
An open source library for face detection in images. The face detection speed can reach 1000FPS.
Language:C++12.3k3k
uploadcare/pillow-simd
The friendly PIL fork
Language:Python2.2k86