chenzhongzheng

chenzhongzheng's Stars

AutonoBot-Lab/BestMan_Pybullet
Codebase for the 'BestMan' Mobile Manipulator
Language:Python1649
microsoft/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
Language:Jupyter Notebook4.8k360
ros2/ros2
The Robot Operating System, is a meta operating system for robots.
3.6k682
jkobject/geneformer
Language:Jupyter Notebook373
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python1.8k72
XLabs-AI/x-flux
Language:Python1.6k117
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python26.3k5.4k
bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
Language:Python1.8k173
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python143k27k
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python23.3k2.3k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python9.2k865
sindresorhus/awesome
😎 Awesome lists about all kinds of interesting topics
334k27.9k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.7k2.7k
Azure/synthetic-qa-generation
This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured data, assuming a real-world scenario. The sample aims to be step-by-step for developers and data scientists, as well as those in the field, to try it out with a little help.
Language:Jupyter Notebook349
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript52.1k7.6k
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
Language:TypeScript44.7k10.1k
MiuLab/Taiwan-LLM
Traditional Mandarin LLMs for Taiwan
Language:Python1.3k104
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python19.2k1.9k
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python38.8k4.3k
receyuki/stable-diffusion-prompt-reader
A simple standalone viewer for reading prompts from Stable Diffusion generated image outside the webui.
Language:Python1.1k69
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python3.5k297
LLaVA-VL/LLaVA-Interactive-Demo
LLaVA-Interactive-Demo
Language:Python35326
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Language:Jupyter Notebook8.1k859
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Language:Python4.4k403
gligen/GLIGEN
Open-Set Grounded Text-to-Image Generation
Language:Python2k150
LLaVA-VL/LLaVA-NeXT
Language:Python2.9k250
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Language:Jupyter Notebook2k273
microsoft/RecAI
Bridging LLM and Recommender System.
Language:Jupyter Notebook59754
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.3k2.2k
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Language:Python2.7k250