hiidea's Stars
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
fufankeji/MateGen
Next-Generation Interactive Intelligent Programming Assistant
data-infra/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
cocos/cocos-engine
Cocos simplifies game creation and distribution with Cocos Creator, a free, open-source, cross-platform game engine. Empowering millions of developers to create high-performance, engaging 2D/3D games and instant web entertainment.
HsiuChi/doudou-peiwan-master
huawei-noah/HEBO
Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab
Thinklab-SJTU/Bench2Drive
Cloased-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
Text-to-Audio/AudioLCM
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
dingodb/dingo
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.
GuijiAI/duix.ai
Langboat/Mengzi3
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
GetNude/Clothoff
One-click intelligent dressing system based on AI. Clothoff、Undress、Clothes remover
ShareGPT4Omni/ShareGPT4Video
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
gersteinlab/ML-Bench
The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.09835)
om-ai-lab/OmDet
Real-time and accurate open-vocabulary end-to-end object detection
OpenCSGs/CSGHub
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数据集、模型文件、代码等)。CSGHub提供类似私有化的Huggingface功能,以类似OpenStack Glance管理虚拟机镜像、Harbor管理容器镜像以及Sonatype Nexus管理制品的方式,实现对LLM资产的管理。欢迎关注反馈和Star⭐️
bytewiz3/TravelGPT
ethstorage/es-node
Golang implementation of the EthStorage node.
multivactech/MultiVAC
All-dimensional Sharding Flexible Blockchain
bcmi/Awesome-Image-Composition
A curated list of papers, code and resources pertaining to image composition/compositing or object insertion, which aims to generate realistic composite image.
childrentime/reactuse
Collection of essential React Hooks Utilities.
Websoft9/websoft9
Applications self-hosting platform for running open source, web-based linux Panel of lite PaaS
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
kwuking/TimeMixer
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
TaskingAI/TaskingAI
The open source platform for AI-native application development.
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
zuoyebang/bitalostored
Bitalostored is a high-performance distributed storage system, core engine based on bitalosdb(self-developed), compatible with Redis protocol.
polyfillpolyfill/polyfill-service
Automatic polyfill service.
DeepInsight-AI/DeepBI
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.