shahruk10
Electronics Engineer with a fascination about space, deep learning and robotics !
Bangladesh
Pinned Repositories
bfcom2018
BFCom2018 Energy Forecasting
DeepSpeech
DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
go-sctk
Go CLI wrapper around SCTK binaries for word error rate evaluation and error analysis for ASR systems.
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
kaldi-tflite
Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.
nixshells
Frequently used nix shells for Python, CUDA and more.
opencv_contrib
Repository for OpenCV's extra modules
PAPRnet
A Peak to Average Power Ratio (PAPR) Reduction method for OFDM Systems using neural networks using the encoder-decoder approach. [Course project for EEE 6207 Broadband Wireless Communication MSc 2019]
tensorflow
An Open Source Machine Learning Framework for Everyone
tensorflow
An Open Source Machine Learning Framework for Everyone
shahruk10's Repositories
shahruk10/PAPRnet
A Peak to Average Power Ratio (PAPR) Reduction method for OFDM Systems using neural networks using the encoder-decoder approach. [Course project for EEE 6207 Broadband Wireless Communication MSc 2019]
shahruk10/kaldi-tflite
Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.
shahruk10/nixshells
Frequently used nix shells for Python, CUDA and more.
shahruk10/go-sctk
Go CLI wrapper around SCTK binaries for word error rate evaluation and error analysis for ASR systems.
shahruk10/bfcom2018
BFCom2018 Energy Forecasting
shahruk10/DeepSpeech
DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
shahruk10/opencv_contrib
Repository for OpenCV's extra modules
shahruk10/tensorflow
An Open Source Machine Learning Framework for Everyone
shahruk10/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)
shahruk10/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
shahruk10/aoc-2023
Trying out Zig for the first time, solving Advent of Code 2023 challenges
shahruk10/brs-img-basic
Assorted files for BRS Workshop on Basic Image Processing with OpenCV
shahruk10/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion (tweaks focused on training faces)
shahruk10/gdown
Download a large file from Google Drive (curl/wget fails because of the security notice).
shahruk10/grpc-websocket-proxy
A proxy to transparently upgrade grpc-gateway streaming endpoints to use websockets
shahruk10/keras
Deep Learning for humans
shahruk10/keras-vis
Neural network visualization toolkit for keras
shahruk10/nixos-vscode-server
Visual Studio Code Server support in NixOS
shahruk10/nixos_cfg
shahruk10/nixpkgs
Nix Packages collection
shahruk10/obs-backgroundremoval
An OBS plugin for removing background in portrait images (video), making it easy to replace the background when screen recording.
shahruk10/PINNs
Physics Informed Deep Learning: Data-driven Solutions and Discovery of Nonlinear Partial Differential Equations
shahruk10/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
shahruk10/SCTK
shahruk10/sklearn-porter
Transpile trained scikit-learn estimators to C, Java, JavaScript and others.
shahruk10/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
shahruk10/StreamingSpeakerDiarization
Official open source implementation of the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"
shahruk10/super-res
Keras scripts for training DNNs used for super-resolution imaging
shahruk10/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
shahruk10/wav
golang .wav reader and writer