zhuyglx's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
FFmpeg/FFmpeg
Mirror of https://git.ffmpeg.org/ffmpeg.git
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
Lightning-AI/pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Stability-AI/generative-models
Generative Models by Stability AI
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
leandromoreira/ffmpeg-libav-tutorial
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more. Translations: 🇺🇸 🇨🇳 🇰🇷 🇪🇸 🇻🇳 🇧🇷
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
continue-revolution/sd-webui-segment-anything
Segment Anything for Stable Diffusion WebUI
OpenNMT/CTranslate2
Fast inference engine for Transformer models
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
baichuan-inc/Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
pharmapsychotic/clip-interrogator
Image to prompt with BLIP and CLIP
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
PyAV-Org/PyAV
Pythonic bindings for FFmpeg's libraries.
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
microsoft/Cream
This is a collection of our NAS and Vision Transformer work.
poloclub/diffusiondb
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
GPT-Fathom/GPT-Fathom
GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under aligned settings.
Yejin0111/ADD-GCN
ADD-GCN: Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition (ECCV 2020)
adxcreative/COPE