Pinned Repositories
AlphaPose
Real-Time and Accurate Multi-Person Pose Estimation&Tracking System
edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
learnopencv
Learn OpenCV : C++ and Python Examples
maskrcnn-benchmark
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
OpenTracker
Real-time C++ ECO tracker etc. speed-up by SSE/NEON, support Linux, Mac, Jetson TX1/2, raspberry pi
PaddleClas
A treasure chest for visual classification and recognition powered by PaddlePaddle
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
ds-gong's Repositories
ds-gong/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
ds-gong/srs
SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.
ds-gong/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
ds-gong/video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
ds-gong/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
ds-gong/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ds-gong/PaddleClas
A treasure chest for visual classification and recognition powered by PaddlePaddle
ds-gong/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
ds-gong/rknn-toolkit
ds-gong/rknpu
ds-gong/learnopencv
Learn OpenCV : C++ and Python Examples
ds-gong/VideoProcessingFramework
Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions
ds-gong/AlphaPose
Real-Time and Accurate Multi-Person Pose Estimation&Tracking System
ds-gong/maskrcnn-benchmark
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
ds-gong/OpenTracker
Real-time C++ ECO tracker etc. speed-up by SSE/NEON, support Linux, Mac, Jetson TX1/2, raspberry pi