bicolino34's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
dteviot/WebToEpub
A simple Chrome (and Firefox) Extension that converts Web Novels (and other web pages) into an EPUB.
fonol/anki-search-inside-add-card
An add-on providing full-text-search and PDF reading functionality to Anki's Add card dialog
TeamPiped/Piped
An alternative privacy-friendly YouTube frontend which is efficient by design.
jbaiter/archiscribe
Web application for transcribing OCR ground truth from Archive.org
WilsonNet/japanase-youtube-channels-with-japanese-subtitles
A list of Japanese Youtube channels with japanese subtitles. So you can easily mine Anki cards with a tool like MPV.
lmcinnes/umap
Uniform Manifold Approximation and Projection
manisandro/gImageReader
A Gtk/Qt front-end to tesseract-ocr.
cloudonlanapps/hocr_editor
An text editor that loads from HOCR XML, allows multiple operation on the text in it.
ethereal-developers/OpenScan
A privacy-friendly Document Scanner app
killergerbah/asbplayer
Browser-based media player and Chrome extension for subtitle sentence mining
modernmt/modernmt
Neural Adaptive Machine Translation that adapts to context and learns from corrections.
knowclip/knowclip
Quickly make Anki flashcards from video and audio files, with handy features like silence detection and subtitles integration.
hermitdave/FrequencyWords
Repository for Frequency Word List Generator and processed files
anuraghazra/github-readme-stats
:zap: Dynamically generated stats for your github readmes
not-implemented/hocr-proofreader
Web based JavaScript GUI library for proofreading/editing hOCR
LibreTranslate/LibreTranslate
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
FreeLanguageTools/vocabsieve
Simple sentence mining tool for language learning
ArthurFDLR/whisper-youtube
🔉 Youtube Videos Transcription with OpenAI's Whisper
alex73/Software-Korpus
Corpus Linguistics Software
openspeech-team/openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
UB-Mannheim/ocr-gt-tools
Ergonomic line-by-line transcription of scanned text.
espy/transcribe
A simple audio transcription helper. No signup, no logs, no tracking.
espnet/espnet
End-to-End Speech Processing Toolkit
0xbad1d3a5/Kaku
画 - Japanese OCR Dictionary
omegat-org/omegat
Official OmegaT development repository
searx/searx
Privacy-respecting metasearch engine
egorsmkv/speech-recognition-uk
🇺🇦 Speech Recognition & Synthesis for Ukrainian
networkx/networkx
Network Analysis in Python