ajiansoft

ajiansoft's Stars

netease-youdao/QAnything
Question and Answer based on Anything.
Language:Python12.2k1.2k
anliyuan/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
Language:Python1.3k200
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Language:Python2k235
CyberAgentAILab/TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
Language:Python918107
tw93/Pake
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用
Language:Rust33.8k6k
PeterH0323/Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
Language:Python2.7k421
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
Language:C++2.3k407
janhq/ichigo
Local realtime voice AI
Language:Python2.1k111
yerfor/MimicTalk
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code
Language:Python48755
crmeb/crmeb_java
Java商城免费开源 CRMEB商城JAVA版，SpringBoot + Maven + Swagger + Mybatis Plus + Redis + Uniapp +Vue+elementUI 包含移动端、小程序、PC后台、Api接口；有产品、用户、购物车、订单、积分、优惠券、营销、余额、权限、角色、系统设置、组合数据、可拖拉拽的form表单等模块，大量的减少了二开的成本。
Language:Java1.5k417
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
Language:Python2.5k200
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python3.2k402
xiangyuecn/Recorder
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式，支持pc和Android、iOS部分浏览器、Hybrid App（提供Android iOS App源码）、微信，提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
Language:JavaScript5k1k
Anttwo/Frosting
[ECCV 2024 - ORAL] Official PyTorch implementation of Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering
Language:Python2489
kevmo314/ffmpeg-webrtc
FFmpeg WebRTC (WHIP) muxer
Language:C264
lipku/python_rtmpstream
python库，实现推送实时rtmp音视频流
Language:C++10131
kamya-ai/Realtime-speech-detection
Welcome to the Real-Time Voice Activity Detection (VAD) program, powered by Silero-VAD model! 🚀 This program allows you to perform live voice activity detection, detecting when there is speech present in an audio stream and when it goes silent.
Language:Python121
ianmarmour/speech-detector
Local voice activity detection of PCM audio streams using Silero VAD
Language:TypeScript101
Lcasmendes/Meet_aplication_withZMQ
Python based application using ZeroMQ, capable of send text, video and audio
Language:Python1
Lifestohack/ffm_zmq
Real time video streaming service to transmit live or offline video from one computer to another computer using ffmpeg to read the source and zmq to send the video over tcp.
Language:Python31
aliyunvideo/AliyunPlayer_Web
The kinds of demo for H5 Aliplayer, which cover live, playback and multiple platforms, such as mobile, pc and weixin and so on
Language:JavaScript9381k
131/h264-live-player
A live h264 player for the browser (ideal for raspberrypi / raspicam )
Language:JavaScript1.1k250
LxbNNN/webrtc-player
webrtcPlayer.js是基于webrtc协议的播放器兼容各主流流媒体webrtc协议 SRS、ZLM、M7s等
Language:TypeScript41
oddengine/odd.js
This is not only an HTML5 FLV Player, but also a WebRTC, IM SDK, and FC/NES emulator.
Language:JavaScript18265
ossrs/flutter_live
Live streaming player, iOS+Android, RTMP/HTTP-FLV/HLS/WebRTC, by Flutter+SRS.
Language:JavaScript336106
ambianic/peerfetch
Peer-to-peer HTTP over WebRTC.
Language:TypeScript60118
chenchenwuai/H265Player
在浏览器上播放H265视频流,通过websocket获取到每一帧h265流,并解析播放. 技术包含：WebAssembly(FFmpeg)解码;web worker; webgl;canvas
Language:JavaScript566
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python3.6k389
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python8.9k856
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python3.2k290