lujiale621

lujiale621's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python134k 1.1k 15.9k26.7k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python69.9k 574 08.2k
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java43.8k 149 8393.6k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python34.3k 204 1.3k3.9k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.3k 428 4.2k6.4k
fishaudio/fish-speech
Brand new TTS solution
Language:Python13.4k 93 3851k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.3k 101 556862
niedev/RTranslator
Open source real-time translation app for Android that runs locally
Language:C++6.7k 50 58501
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.1k 71 991763
YaoFANGUK/video-subtitle-extractor
视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Language:Python5.9k 44 273654
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python5.6k 55 435595
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Language:Python3.8k 54 149592
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
Language:C++3.4k 49 505392
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
Language:Python3.3k 40 307210
NapNeko/NapCatQQ
现代化的基于 NTQQ 的 Bot 协议端实现
Language:TypeScript2.3k 9 299158
ShareGPT4Omni/ShareGPT4Video
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Language:Python1.2k 32 3644
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.2k 31 6673
jsksxs360/How-to-use-Transformers
Transformers 库快速入门教程
Language:Python1k 9 20135
muzishen/IMAGDressing
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
Language:Python1k 15 4183
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python902 12 1468
lxfater/BilibiliSummary
A chrome extension helps you summary video on bilibili.
Language:TypeScript717 5 1554
X-T-E-R/Uni-TTS
本项目意图在于让使用各类语音合成引擎的方式变得统一，支持多种语音合成引擎适配器，允许直接作为模组使用或启动后端服务
Language:Python633 8 3156
R3gm/SoniTranslate
Synchronized Translation for Videos. Video dubbing
Language:Python523 17 91118
rese1f/MovieChat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
Language:Python507 10 7740
Xiaojiu-z/Stable-Hair
Stable-Hair: Real-World Hair Transfer via Diffusion Model
350 38 722
X-T-E-R/GPT-SoVITS-Inference
Inference Specialization
Language:Python322 6 024
OpenGVLab/video-mamba-suite
The suite of modeling video with Mamba
Language:Python223 3 1822
yeliudev/R2-Tuning
🌀 R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)
Language:Python60 7 171
wenet-e2e/wesubtitle
用 OCR 提取视频硬字幕
Language:Python56 5 410
sudo-Boris/mr-Blip
Official Implementation of "The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval"
Language:Python40 5 70