MakRoya's Stars
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
cocos/cocos-engine
Cocos simplifies game creation and distribution with Cocos Creator, a free, open-source, cross-platform game engine. Empowering millions of developers to create high-performance, engaging 2D/3D games and instant web entertainment.
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
GuijiAI/duix.ai
Docta-ai/docta
A Doctor for your data
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
OpenCSGs/csghub
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inference jobs with user interface. CSGHub also provides Python SDK with full compatibility of hf sdk. Join us together to build a safer and more open platform⭐️
Langboat/Mengzi3
dingodb/dingo
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.
juggleim/im-server
A high-performance IM server.
om-ai-lab/OmAgent
A Language and Multimodal Agents Framework for Smart Device and More
Thinklab-SJTU/Bench2Drive
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
ShareGPT4Omni/ShareGPT4Video
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
520CCC/AIGenerateCode
Android谷歌上架马甲包垃圾代码混淆
bcmi/Awesome-Image-Composition
A curated list of papers, code and resources pertaining to image composition/compositing or object insertion, which aims to generate realistic composite image.
Text-to-Audio/AudioLCM
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
fufankeji/MateGen
Next-Generation Interactive Intelligent Programming Assistant
NexaAI/Awesome-LLMs-on-device
Awesome LLMs on Device: A Comprehensive Survey
CrossPaste/crosspaste-desktop
Universal Pasteboard Across Devices
freechat-fun/freechat
https://freechat.fun
somta/Juggle
一款适用于微服务编排,第三方api集成,私有化定制开发,编写BFF聚合层等场景的强大低码编排工具!
OpenCSGs/csghub-server
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
Windsander/ADI-Stable-Diffusion
Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms.
guanchuwang/redis-bench
wuji3/visiondk
A powerful baseline for image classification, face recognition and image retrieval with Pytorch
ZivJia/Cybersecurity-Doughnuts
Fullstack engineer's checklist for your cybersecurity.
dongxuyue/Open-ReplaceAnything
Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/
aws-samples/Intelli-Agent
Chatbot Portal with Agent: Streamlined Workflow for Building Agent-Based Applications
fanglu0411/sgs
SGS, is a user-friendly, collaborative and versatile browser for visualizing single-cell and spatial multiomics data.
aihao2000/IP-Adapter-Art
Using reference images to control style in text-to-image diffusion models. Based on CSD and IP Adapter