Pinned Repositories
coho
Base libraries for C++ development
ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
kaldi-lstm
C++ implementation of LSTM (Long Short Term Memory), in Kaldi's nnet1 framework. Used for automatic speech recognition, possibly language modeling etc, the training can be switched between CPU and GPU(CUDA). This repo is now merged into official Kaldi codebase(Karel's setup), so this repo is no longer maintained, please check out the Kaldi project instead.
revisiting_the_evaluation_metric_of_asr
textnorm
words2num
Convert words to numbers
GigaSpeech
Large, modern dataset for speech recognition
Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
BigCiDian
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
chinese_text_normalization
Chinese text normalization for speech processing
dophist's Repositories
dophist/.tmux
🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️
dophist/audio-dataset
Audio Dataset for training CLAP and other models
dophist/C-Macro-Collections
Easy to use, modular, header only, macro based, generic and type-safe Data Structures in C
dophist/cc2dataset
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
dophist/chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily search and find personal or work documents by asking questions in everyday language.
dophist/cJSON
Ultralightweight JSON parser in ANSI C
dophist/dbg-macro
A dbg(…) macro for C++
dophist/emhash
Fast and memory efficient c++ flat hash map/set
dophist/faster-whisper
Faster Whisper transcription with CTranslate2
dophist/GigaSpeech
Large, modern dataset for speech recognition
dophist/hamt
A hash array-mapped trie implementation in C
dophist/highway
Performance-portable, length-agnostic SIMD with runtime dispatch
dophist/ipa-dict
Monolingual wordlists with pronunciation information in IPA
dophist/kaldi-native-fbank
Kaldi-compatible online fbank extractor without external dependencies
dophist/llama-dl
High-speed download of LLaMA, Facebook's 65B parameter GPT model
dophist/lossless-cut
The swiss army knife of lossless video/audio editing
dophist/mediamtx
Ready-to-use RTSP / RTMP / LL-HLS / WebRTC server and proxy that allows to read, publish and proxy video and audio streams. Formerly known as rtsp-simple-server.
dophist/mimalloc
mimalloc is a compact general purpose allocator with excellent performance.
dophist/narr
dophist/parallel-hashmap
A family of header-only, very fast and memory-friendly hashmap and btree containers.
dophist/pykan
Kolmogorov Arnold Networks
dophist/pysubs2
A Python library for editing subtitle files
dophist/qoa
The “Quite OK Audio Format” for fast, lossy audio compression
dophist/re2
RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++ library.
dophist/sectorc
A C Compiler that fits in the 512 byte boot sector of an x86 machine
dophist/tcmalloc
dophist/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
dophist/unblob
Extract files from any kind of container formats
dophist/visidata
A terminal spreadsheet multitool for discovering and arranging data
dophist/yt-dlp
A youtube-dl fork with additional features and fixes