Pinned Repositories
fixed-size-priority-queue
A priority queue with fixed size, based on STL. When the maximum size was reached, the element with the lowest priority would be removed automatically.
GigaSpeech
Large, modern dataset for speech recognition
gradio-chatgpt
A simple web-based interface for ChatGPT.
handle-solver
A solver for 汉兜 (Chinese idiom Wordle) puzzles
hf_transformers_custom_model_ced
🤗 Transformers custom models (CED)
hf_transformers_custom_model_dasheng
🤗 Transformers custom models (Dasheng)
kaldi-gop
Kaldi-based goodness of pronunciation (GOP)
speechocean762
A non-native English corpus for pronunciation scoring task
TushouCNN
A tensorflow-compatible,dependency-free lightweight C++ library for neural network inference.
yesno-example-for-undergraduates
jimbozhang's Repositories
jimbozhang/speechocean762
A non-native English corpus for pronunciation scoring task
jimbozhang/yesno-example-for-undergraduates
jimbozhang/gradio-chatgpt
A simple web-based interface for ChatGPT.
jimbozhang/fixed-size-priority-queue
A priority queue with fixed size, based on STL. When the maximum size was reached, the element with the lowest priority would be removed automatically.
jimbozhang/hf_transformers_custom_model_ced
🤗 Transformers custom models (CED)
jimbozhang/GigaSpeech
Large, modern dataset for speech recognition
jimbozhang/handle-solver
A solver for 汉兜 (Chinese idiom Wordle) puzzles
jimbozhang/k2
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
jimbozhang/lhotse
jimbozhang/snowfall
jimbozhang/xares
X-ARES:eXtensive Audio Representation and Evaluation Suite
jimbozhang/xares-template
Template for creating audio encoders compatible with X-ARES
jimbozhang/hf_transformers_custom_model_dasheng
🤗 Transformers custom models (Dasheng)
jimbozhang/PySpeechColab
A library of speech gadgets.
jimbozhang/aes-encrypt-decrypt
jimbozhang/busygpu
jimbozhang/Dasheng
Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"
jimbozhang/HEAR2021_EfficientLatent
Submission to the HEAR2021 Challenge
jimbozhang/HEAR_CED
Hear evaluation for CED models.
jimbozhang/icefall
jimbozhang/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
jimbozhang/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
jimbozhang/ml-clap
jimbozhang/moshi
jimbozhang/nbterm
Jupyter Notebooks in the terminal.
jimbozhang/openslr
Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository
jimbozhang/PSL
Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"
jimbozhang/refactored-fortnight
jimbozhang/SAT
Streaming Audiotransformers for online Audio tagging
jimbozhang/transformers-ced