Cherrysaber's Stars
erebe/wstunnel
Tunnel all your traffic over Websocket or HTTP2 - Bypass firewalls/DPI - Static binary available
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ggerganov/llama.cpp
LLM inference in C/C++
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
saoudrizwan/claude-dev
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, and more with your permission every step of the way.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
provectus/kafka-ui
Open-Source Web UI for Apache Kafka Management
mamba-org/mamba
The Fast Cross-Platform Package Manager
wenet-e2e/west
We Speech Transcript based on LLM, in 300 lines of code.
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
mayaxcn/china-ip-list
每小时更新**IP范围列表,Update Mainland China ip‘s list in everyhour
phaserjs/phaser
Phaser is a fun, free and fast 2D game framework for making HTML5 games for desktop and mobile web browsers, supporting Canvas and WebGL rendering.
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
EasyTier/EasyTier
A simple, decentralized mesh VPN with WireGuard support.
WireGuard/wireguard-go
Mirror only. Official repository is at https://git.zx2c4.com/wireguard-go
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
OpenNMT/CTranslate2
Fast inference engine for Transformer models
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
opencv/opencv_contrib
Repository for OpenCV's extra modules
spmallick/learnopencv
Learn OpenCV : C++ and Python Examples
opencv/opencv
Open Source Computer Vision Library
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2