wuxiqi's Stars
gpt-omni/mini-omni2
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
BonnieZbw/CT2CQA
guyouyin123/tools
开发人员使用的各种工具包合集
fudan-generative-vision/dynamicPDB
Dynamic PDB datasets
gh0stintheshe11/Stats-SVG
⭐ Dynamically generate stats SVG from your Github, LeetCode, Steam, and more in #Cyberpunk style :)
3DTopia/3DTopia-XL
3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion
fudan-generative-vision/hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Ww656556431/AICC
AI外呼系统,基于自然语言处理(NLP)、语音识别(ASR)、语音合成(TTS)和通讯(freeswitch)技术,实现自动语音应答,听说状态的实时切换,用自然逼真的对话与客户沟通。
risesoft-y9/Network-Drive
网络硬盘是通过存储、分类、检索、分享、协作、下发、回收等方式管理文档、文件、图片、音频、视频等资料的工具。网络硬盘擅长在国产的私有化环境中,管控文档权限、分配存储空间、安全加密、分享共享,同时可以完成一定轻量级的文件任务收发。网络硬盘是一个完全开源的项目,无商业版,但是需要依赖开源的数字底座进行人员岗位管控。
yangbincv/ADCA
guanchuwang/redis-bench
risesoft-y9/Digital-Infrastructure
数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸多成熟且好用的内部生态应用
om-ai-lab/OmAgent
A multimodal agent framework for solving complex tasks [EMNLP'2024]
yaninsanity/Sutra_QAS
A system demo based on Retrival Argument Generation to answer buddism question
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
xtaci/gaio
High performance minimalism async-io(proactor) networking for Golang.
juggleim/im-server
A high-performance IM server.
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
fufankeji/MateGen
Next-Generation Interactive Intelligent Programming Assistant
Lattice-zjj/On-Device-FinLLM
OD-FinLLM is a refined model derived from the LLaMA series, with specific enhancements for Chinese financial knowledge. This model is built by fine-tuning LLaMA using a specialized instruction dataset created from publicly available Chinese financial Q&A data and additional web-scraped financial information.
j66n/acte
A framework to build GUI-like Agent Tools, enhancement to Function Calling of LLM AI.
OpenCSGs/csghub-server
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
dingodb/dingo
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.
520CCC/AIGenerateCode
Android谷歌上架马甲包垃圾代码混淆
risesoft-y9/Data-Labeling
数据标注是一款专门对文本数据进行处理和标注的工具,通过简化快捷的文本标注流程和动态的算法反馈,支持用户快速标注关键词并能通过算法持续减少人工标注的成本和时间。数据标注的过程先由人工标注构筑基础,再由自动标注反哺人工标注,最后由人工标注进行纠偏,从而大幅度提高标注的精准度和高效性。数据标注是一个完全开源的项目,无商业版,但是需要依赖开源的数字底座进行人员岗位管控。各类词库结果会定期在本平台公开。
NexaAI/Awesome-LLMs-on-device
Awesome LLMs on Device: A Comprehensive Survey
SheldongChen/AMD.github.io
Explainable Person Re-Identification with Attribute-guided Metric Distillation
WoodScene/TaSL