tianbaochou's Stars
krahets/hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
HigherOrderCO/Bend
A massively parallel, high-level programming language
harry0703/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
triton-lang/triton
Development repository for the Triton language and compiler
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
fossasia/visdom
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
mlc-ai/web-stable-diffusion
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
OpenDriveLab/UniAD
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
LLaVA-VL/LLaVA-NeXT
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
mlabonne/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
karpathy/nano-llama31
nanoGPT style version of Llama 3.1
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
ziyc/drivestudio
A 3DGS framework for omni urban scene reconstruction and simulation.
swc-17/SparseDrive
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
junjie18/CMT
[ICCV 2023] Cross Modal Transformer: Towards Fast and Robust 3D Object Detection
Daniel-xsy/RoboBEV
RoboBEV: Towards Robust Bird's Eye View Perception under Common Corruption and Domain Shift
HuaiyuanXu/3D-Occupancy-Perception
[Information Fusion 2024] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective
megvii-research/Far3D
[AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection