Pinned Repositories
Demo
HdCamera
基于PaddleLite 实现OCR识别
ijkplayer
Android/iOS video player based on FFmpeg n4.0, with MediaCodec, VideoToolbox support.
Lpr
车牌识别
ncnn-android-lighttrack
基于lighttrack和ncnn实现Android端目标跟踪
RePlugin-AndroidX
A RePlugin branch supports AndroidX 支持AndroidX的RePlugin框架分支
S2SAndroidClient
Speech2Speech Android client
speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
vehicle_keyboard
车牌键盘
wtplayer
音视频播放器(基于ijkplayer) 扩展录像、截屏、水印等方法
wuhongsheng's Repositories
wuhongsheng/wtplayer
音视频播放器(基于ijkplayer) 扩展录像、截屏、水印等方法
wuhongsheng/S2SAndroidClient
Speech2Speech Android client
wuhongsheng/HdCamera
基于PaddleLite 实现OCR识别
wuhongsheng/ijkplayer
Android/iOS video player based on FFmpeg n4.0, with MediaCodec, VideoToolbox support.
wuhongsheng/Lpr
车牌识别
wuhongsheng/ncnn-android-lighttrack
基于lighttrack和ncnn实现Android端目标跟踪
wuhongsheng/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
wuhongsheng/wuhongsheng.github.io
个人博客 欢迎访问
wuhongsheng/HitAnimation
车机语音命中动画
wuhongsheng/camera-samples
Multiple samples showing the best practices in camera APIs on Android.
wuhongsheng/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
wuhongsheng/DeepFilterNet
Noise supression using deep filtering
wuhongsheng/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
wuhongsheng/fish-speech
Brand new TTS solution
wuhongsheng/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
wuhongsheng/GraphRAG-Local-UI
GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.
wuhongsheng/Hawkeye
wuhongsheng/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
wuhongsheng/Logan
Logan is a lightweight case logging system based on mobile platform.
wuhongsheng/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
wuhongsheng/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
wuhongsheng/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
wuhongsheng/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
wuhongsheng/NeMo
NeMo: a framework for generative AI
wuhongsheng/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
wuhongsheng/TFexamples
TensorFlow examples
wuhongsheng/video-intelligence-api-visualiser
A simple app that lets you visualise annotations from the Google Cloud Video Intelligence API using your local files.
wuhongsheng/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
wuhongsheng/yasea
RTMP live streaming client for Android
wuhongsheng/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite