Pinned Repositories
-SVM-
CNN_SVM
2017.9.20仿照wepon的代码大体实现卷积神经网络以及支持向量机在项目中图像分类中的作用
diagnosis_demo
this is a little Qt project demo for fresh man and I upload many useful source about Qt
HOG
这个呢,就是用于修改HOG的参数
interview_kws
这是一些语音唤醒的面经的总结,欢迎大家多多push新的的内容
interview_ns
音频降噪的一些面试题,欢迎大家可以补充
Keras_CNN_image_classification
A image classification model based on Keras
mydata_idx
myLeetCode
pruning
fmbao's Repositories
fmbao/interview_kws
这是一些语音唤醒的面经的总结,欢迎大家多多push新的的内容
fmbao/pruning
fmbao/interview_ns
音频降噪的一些面试题,欢迎大家可以补充
fmbao/myLeetCode
fmbao/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
fmbao/3D-TransUNet
This is the official repository for the paper "3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers"
fmbao/aimoneyhunter
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。
fmbao/aloha
fmbao/asv-subtools
An Open Source Tools for Speaker Recognition
fmbao/Attention_Backend_for_ASV
Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
fmbao/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
fmbao/av_hubert
A self-supervised learning framework for audio-visual speech
fmbao/awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
fmbao/Causal-U-Net
unofficial PyTorch implementation of 《A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement》
fmbao/crawler
multi threading and process for crawler example
fmbao/DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
fmbao/DeepFilterNet
Noise supression using deep filtering
fmbao/easy-rl
强化学习中文教程(蘑菇书),在线阅读地址:https://datawhalechina.github.io/easy-rl/
fmbao/FCH-TTS
A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
fmbao/INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
fmbao/pykaldi
A Python wrapper for Kaldi
fmbao/qtrader
A Light Event-Driven Algorithmic Trading Engine
fmbao/Quantization
fmbao/SiamTrackers
(2020-2022)The PyTorch version of SiamFC,SiamRPN,DaSiamRPN, UpdateNet , SiamDW, SiamRPN++, SiamMask, SiamFC++, SiamCAR, SiamBAN, Ocean, LightTrack , TrTr, NanoTrack; Visual object tracking based on deep learning
fmbao/SpeechAlgorithms
Speech Algorithms Collections
fmbao/sudo_rm_rf
fmbao/ttskit
text to speech toolkit. 好用的中文语音合成工具箱,包含语音编码器、语音合成器、声码器和可视化模块。
fmbao/voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
fmbao/VoiceprintRecognition-Tensorflow
使用Tensorflow实现声纹识别
fmbao/w2v2-speaker
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053