Cherrysaber

Cherrysaber's Stars

erebe/wstunnel
Tunnel all your traffic over Websocket or HTTP2 - Bypass firewalls/DPI - Static binary available
Language:Rust4.2k363
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Language:Python4.8k392
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C34.8k3.5k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python27.7k4.1k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++65.7k9.4k
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python52.2k5.5k
saoudrizwan/claude-dev
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, and more with your permission every step of the way.
Language:TypeScript5.8k540
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.6k1.2k
provectus/kafka-ui
Open-Source Web UI for Apache Kafka Management
Language:Java9.6k1.2k
mamba-org/mamba
The Fast Cross-Platform Package Manager
Language:C++6.8k348
wenet-e2e/west
We Speech Transcript based on LLM, in 300 lines of code.
Language:Python11911
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python6.9k710
mayaxcn/china-ip-list
每小时更新**IP范围列表，Update Mainland China ip‘s list in everyhour
Language:C#34753
phaserjs/phaser
Phaser is a fun, free and fast 2D game framework for making HTML5 games for desktop and mobile web browsers, supporting Canvas and WebGL rendering.
Language:JavaScript37k7.1k
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
Language:C832132
EasyTier/EasyTier
A simple, decentralized mesh VPN with WireGuard support.
Language:Rust1.4k140
WireGuard/wireguard-go
Mirror only. Official repository is at https://git.zx2c4.com/wireguard-go
Language:Go3.1k1k
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
15.2k1.4k
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6k757
OpenNMT/CTranslate2
Fast inference engine for Transformer models
Language:C++3.3k287
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python23.9k3.1k
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python83.8k6.5k
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook28.1k3.2k
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.1k95
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python6.2k657
opencv/opencv_contrib
Repository for OpenCV's extra modules
Language:C++9.4k5.8k
spmallick/learnopencv
Learn OpenCV : C++ and Python Examples
Language:Jupyter Notebook21.1k11.6k
opencv/opencv
Open Source Computer Vision Library
Language:C++78.3k55.7k
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.1k402
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python11.6k969