Piony's Stars
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
cocos/cocos-engine
Cocos simplifies game creation and distribution with Cocos Creator, a free, open-source, cross-platform game engine. Empowering millions of developers to create high-performance, engaging 2D/3D games and instant web entertainment.
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
GuijiAI/duix.ai
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
OpenCSGs/csghub
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inference jobs with user interface. CSGHub also provides Python SDK with full compatibility of hf sdk. Join us together to build a safer and more open platform⭐️
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
data-infra/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
Langboat/Mengzi3
dingodb/dingo
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.
om-ai-lab/OmDet
Real-time and accurate open-vocabulary end-to-end object detection
Thinklab-SJTU/Bench2Drive
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
ShareGPT4Omni/ShareGPT4Video
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
520CCC/AIGenerateCode
Android谷歌上架马甲包垃圾代码混淆
Text-to-Audio/AudioLCM
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
fufankeji/MateGen
Next-Generation Interactive Intelligent Programming Assistant
om-ai-lab/OmAgent
A multimodal agent framework for solving complex tasks [EMNLP'2024]
bytewiz3/TravelGPT
juggleim/im-server
A high-performance IM server.
gh0stintheshe11/Stats-SVG
⭐ Dynamically generate stats SVG from your Github, LeetCode, Steam, and more in #Cyberpunk style :)
fudan-generative-vision/dynamicPDB
Dynamic PDB datasets
risesoft-y9/Digital-Infrastructure
数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸多成熟且好用的内部生态应用
HITsz-TMG/UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
freechat-fun/freechat
https://freechat.fun
JinhuaLiang/WavCraft
Official repo for WavCraft, an AI agent for audio creation and editing
Windsander/ADI-Stable-Diffusion
Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms.
guanchuwang/redis-bench
ZivJia/Cybersecurity-Doughnuts
Fullstack engineer's checklist for your cybersecurity.
dongxuyue/Open-ReplaceAnything
Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/
LiberBinjio/GlobalTalk-Hub
GlobalTalk Hub is a place where you can chat without language barriers. Here you can speak your native language freely to friends all over the world and will be understood by them easily. Have fun💕🎈🌭🍔