TrangDuLam's Stars
KindXiaoming/pykan
Kolmogorov Arnold Networks
shap/shap
A game theoretic approach to explain the output of any machine learning model.
csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
ExpTechTW/TREM-tauri
Taiwan Real-time Earthquake Monitoring(臺灣即時地震監測)
analyticsinmotion/werpy
🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.
rhasspy/gruut-ipa
Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)
mlouielu/twstock
台灣股市股票價格擷取 (含即時股票資訊) - Taiwan Stock Opendata with realtime
stitionai/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
qiangmzsx/Software-Engineering-at-Google
《Software Engineering at Google》的中英文对译版本
viraptor/reverse-interview
Questions to ask the company during your interview
tyiannak/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
VinAIResearch/PhoGPT
PhoGPT: Generative Pre-training for Vietnamese (2023)
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
sktime/sktime
A unified framework for machine learning with time series
covarep/covarep
A Cooperative Voice Analysis Repository for Speech Technologies
YuanGongND/ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
ardaillon/FCN-f0
Fully-Convolutional Network for Pitch Estimation of Speech Signals
dpilger26/NumCpp
C++ implementation of the Python Numpy library
cycfi/q
C++ Library for Audio Digital Signal Processing
fighting41love/zhvoice
Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
max32002/tixcraft_bot
MaxBot open source code bot
TrangDuLam/audioviz
A Python-based music information retrieval visualization tools interfacing with Google Colab.
aishell-foundation/DaCiDian
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files
guanlongzhao/ppg-gmm
Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"
hhguo/EA-SVC
An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"
ASR-project/Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
ZhengkunTian/OpenTransformer
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition