sutungpo's Stars
taishi-i/awesome-japanese-nlp-resources
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese
zyddnys/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
Huanshere/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
pytube/pytube
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
m1guelpf/auto-subtitle
Automatically generate and overlay subtitles for any video.
Zalunda/FunscriptToolbox
reazon-research/ReazonSpeech
Massive open Japanese speech corpus
elder-plinius/L1B3RT4S
TOTALLY HARMLESS PROMPTS FOR GOOD LIL AI'S
SillyTavern/SillyTavern
LLM Frontend for Power Users.
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
andrewyng/translation-agent
0xk1h0/ChatGPT_DAN
ChatGPT DAN, Jailbreaks prompt
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
fishjar/kiss-translator
A simple, open source bilingual translation extension & Greasemonkey script (一个简约、开源的 双语对照翻译扩展 & 油猴脚本)
Stability-AI/stable-audio-tools
Generative models for conditional audio generation
abdeladim-s/subsai
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
ngosang/trackerslist
Updated list of public BitTorrent trackers
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
smacke/ffsubsync
Automagically synchronize subtitles with video.
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
lucidrains/e2-tts-pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
staxrip/staxrip
🎞 Video encoding GUI for Windows.
DachunKai/EvTexture
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
Djdefrag/QualityScaler
QualityScaler - image/video AI upscaler app
mingrammer/diagrams
:art: Diagram as Code for prototyping cloud system architectures