wuhongsheng

AI算法工程师(CV、LLM方向)

Earth

Pinned Repositories

Demo
Language:Java1 2 00
HdCamera
基于PaddleLite 实现OCR识别
Language:C++2 2 10
ijkplayer
Android/iOS video player based on FFmpeg n4.0, with MediaCodec, VideoToolbox support.
Language:C1 1 00
Lpr
车牌识别
Language:C++1 1 01
ncnn-android-lighttrack
基于lighttrack和ncnn实现Android端目标跟踪
Language:C++12
RePlugin-AndroidX
A RePlugin branch supports AndroidX 支持AndroidX的RePlugin框架分支
Language:Java1 1 02
S2SAndroidClient
Speech2Speech Android client
Language:Kotlin41
speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python10
vehicle_keyboard
车牌键盘
Language:Dart10 2 01
wtplayer
音视频播放器(基于ijkplayer) 扩展录像、截屏、水印等方法
Language:Java7 3 53

wuhongsheng's Repositories

wuhongsheng/wtplayer
音视频播放器(基于ijkplayer) 扩展录像、截屏、水印等方法
Language:Java7 3 53
wuhongsheng/S2SAndroidClient
Speech2Speech Android client
Language:Kotlin41
wuhongsheng/HdCamera
基于PaddleLite 实现OCR识别
Language:C++2 2 10
wuhongsheng/ijkplayer
Android/iOS video player based on FFmpeg n4.0, with MediaCodec, VideoToolbox support.
Language:C1 1 00
wuhongsheng/Lpr
车牌识别
Language:C++1 1 01
wuhongsheng/ncnn-android-lighttrack
基于lighttrack和ncnn实现Android端目标跟踪
Language:C++12
wuhongsheng/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python10
wuhongsheng/wuhongsheng.github.io
个人博客欢迎访问
Language:HTML1 2 00
wuhongsheng/HitAnimation
车机语音命中动画
Language:Java0 2 00
wuhongsheng/camera-samples
Multiple samples showing the best practices in camera APIs on Android.
Language:Kotlin1 0
wuhongsheng/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
Language:Python
wuhongsheng/DeepFilterNet
Noise supression using deep filtering
Language:Python
wuhongsheng/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python1 0
wuhongsheng/fish-speech
Brand new TTS solution
wuhongsheng/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python
wuhongsheng/GraphRAG-Local-UI
GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.
wuhongsheng/Hawkeye
Language:C++2 0
wuhongsheng/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
wuhongsheng/Logan
Logan is a lightweight case logging system based on mobile platform.
Language:C1 0
wuhongsheng/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
Language:C++1 0
wuhongsheng/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
wuhongsheng/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
wuhongsheng/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
wuhongsheng/NeMo
NeMo: a framework for generative AI
Language:Python0 0
wuhongsheng/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python0 0
wuhongsheng/TFexamples
TensorFlow examples
Language:Jupyter Notebook1 0
wuhongsheng/video-intelligence-api-visualiser
A simple app that lets you visualise annotations from the Google Cloud Video Intelligence API using your local files.
Language:JavaScript1 0
wuhongsheng/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python0 0
wuhongsheng/yasea
RTMP live streaming client for Android
Language:C1 0
wuhongsheng/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Language:Python0 0