chenzhongzheng's Stars
AutonoBot-Lab/BestMan_Pybullet
Codebase for the 'BestMan' Mobile Manipulator
microsoft/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
ros2/ros2
The Robot Operating System, is a meta operating system for robots.
jkobject/geneformer
baaivision/Emu3
Next-Token Prediction is All You Need
XLabs-AI/x-flux
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
sindresorhus/awesome
😎 Awesome lists about all kinds of interesting topics
Stability-AI/generative-models
Generative Models by Stability AI
Azure/synthetic-qa-generation
This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured data, assuming a real-world scenario. The sample aims to be step-by-step for developers and data scientists, as well as those in the field, to try it out with a little help.
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
MiuLab/Taiwan-LLM
Traditional Mandarin LLMs for Taiwan
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
receyuki/stable-diffusion-prompt-reader
A simple standalone viewer for reading prompts from Stable Diffusion generated image outside the webui.
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
LLaVA-VL/LLaVA-Interactive-Demo
LLaVA-Interactive-Demo
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
gligen/GLIGEN
Open-Set Grounded Text-to-Image Generation
LLaVA-VL/LLaVA-NeXT
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
microsoft/RecAI
Bridging LLM and Recommender System.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)