Libresse
Too many of us are not living our dreams because we are living our fears.
@TrustFiNetwork Asia/Shanghai
Libresse's Stars
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
cocos/cocos-engine
Cocos simplifies game creation and distribution with Cocos Creator, a free, open-source, cross-platform game engine. Empowering millions of developers to create high-performance, engaging 2D/3D games and instant web entertainment.
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
fudan-generative-vision/hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
GuijiAI/duix.ai
Everlyn-Labs/Everlyn-1
The first open autoregressive foundational video AI model.
Docta-ai/docta
A Doctor for your data
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
juggleim/im-server
A high-performance IM server.
risesoft-y9/Digital-Infrastructure
数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸多成熟且好用的内部生态应用。
gpt-omni/mini-omni2
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
dingodb/dingo
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.
om-ai-lab/OmAgent
A Multimodal Language Agent Framework for Problem Solving and More
lxtGH/OMG-Seg
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Thinklab-SJTU/Bench2Drive
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
520CCC/AIGenerateCode
Android谷歌上架马甲包垃圾代码混淆
Text-to-Audio/AudioLCM
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
Zefan-Cai/KVCache-Factory
Unified KV Cache Compression Methods for Auto-Regressive Models
alibaba/Tora
The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"
fufankeji/MateGen
Next-Generation Interactive Intelligent Programming Assistant
fudan-generative-vision/dynamicPDB
Dynamic Protein Data Bank
gh0stintheshe11/Stats-SVG
⭐ Dynamically generate stats SVG from your Github, LeetCode, Steam, and more in #Cyberpunk style :)
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
freechat-fun/freechat
https://freechat.fun
Windsander/ADI-Stable-Diffusion
Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms.
mqz111a/virtual_human_stream
The "virtual_human_stream" project is a real-time digital human system supporting audio-video dialogue. It integrates models like ernerf, musetalk, and wav2lip for voice cloning, video stitching, and streaming via RTMP/WebRTC. It’s optimized for high performance and easy customization, with support for ChatGPT dialogue integration.
guanchuwang/redis-bench
open-halo/halo-starter
Next Generation Java Starter Project
dirtycomputer/O2M_attack
vue-rookie/uni-vue3
基于前端最新技术栈:vue3+vite5+uniapp+unocss+uview-plus搭建的小程序快速开发模板框架