ajiansoft's Stars
netease-youdao/QAnything
Question and Answer based on Anything.
anliyuan/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
CyberAgentAILab/TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
tw93/Pake
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用
PeterH0323/Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
janhq/ichigo
Local realtime voice AI
yerfor/MimicTalk
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code
crmeb/crmeb_java
Java商城 免费 开源 CRMEB商城JAVA版,SpringBoot + Maven + Swagger + Mybatis Plus + Redis + Uniapp +Vue+elementUI 包含移动端、小程序、PC后台、Api接口;有产品、用户、购物车、订单、积分、优惠券、营销、余额、权限、角色、系统设置、组合数据、可拖拉拽的form表单等模块,大量的减少了二开的成本。
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
xiangyuecn/Recorder
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
Anttwo/Frosting
[ECCV 2024 - ORAL] Official PyTorch implementation of Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering
kevmo314/ffmpeg-webrtc
FFmpeg WebRTC (WHIP) muxer
lipku/python_rtmpstream
python库,实现推送实时rtmp音视频流
kamya-ai/Realtime-speech-detection
Welcome to the Real-Time Voice Activity Detection (VAD) program, powered by Silero-VAD model! 🚀 This program allows you to perform live voice activity detection, detecting when there is speech present in an audio stream and when it goes silent.
ianmarmour/speech-detector
Local voice activity detection of PCM audio streams using Silero VAD
Lcasmendes/Meet_aplication_withZMQ
Python based application using ZeroMQ, capable of send text, video and audio
Lifestohack/ffm_zmq
Real time video streaming service to transmit live or offline video from one computer to another computer using ffmpeg to read the source and zmq to send the video over tcp.
aliyunvideo/AliyunPlayer_Web
The kinds of demo for H5 Aliplayer, which cover live, playback and multiple platforms, such as mobile, pc and weixin and so on
131/h264-live-player
A live h264 player for the browser (ideal for raspberrypi / raspicam )
LxbNNN/webrtc-player
webrtcPlayer.js是基于webrtc协议的播放器 兼容各主流流媒体webrtc协议 SRS、ZLM、M7s等
oddengine/odd.js
This is not only an HTML5 FLV Player, but also a WebRTC, IM SDK, and FC/NES emulator.
ossrs/flutter_live
Live streaming player, iOS+Android, RTMP/HTTP-FLV/HLS/WebRTC, by Flutter+SRS.
ambianic/peerfetch
Peer-to-peer HTTP over WebRTC.
chenchenwuai/H265Player
在浏览器上播放H265视频流,通过websocket获取到每一帧h265流,并解析播放. 技术包含:WebAssembly(FFmpeg)解码;web worker; webgl;canvas
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.