Pinned Repositories
AI-Calc
AI enabled calculator. Don't ask.
aits
AI Text Search
AnkushMalaker
Aria_backend
asr-webservice
ASR Webservice API
bert
TensorFlow code and pre-trained models for BERT
HUSE
Tensorflow implimentation of HUSE: Hierarchical Universal Semantic Embeddings
Knowledgebase
Knowledgebase repository started in month of March 2021
pretrained-dcnn-attention-ser
Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"
speech-emotion-recognition
Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.
AnkushMalaker's Repositories
AnkushMalaker/speech-emotion-recognition
Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.
AnkushMalaker/pretrained-dcnn-attention-ser
Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"
AnkushMalaker/aits
AI Text Search
AnkushMalaker/AnkushMalaker
AnkushMalaker/asr-webservice
ASR Webservice API
AnkushMalaker/Knowledgebase
Knowledgebase repository started in month of March 2021
AnkushMalaker/core
:house_with_garden: Open source home automation that puts local control and privacy first.
AnkushMalaker/crispy
Crispy is a machine-learning algorithm to make video-games montages efficiently. It uses a neural network to detect highlights in the video-game frames
AnkushMalaker/easy-stt
Easy way to use one of transformer models to do inference locally. Can be done live through mic, or on local files. The first run needs to be online to download necessary models.
AnkushMalaker/excalidraw-recognition
Virtual whiteboard for sketching hand-drawn like diagrams
AnkushMalaker/homeassistant-satellite
Streaming audio satellite for Home Assistant
AnkushMalaker/icassp2021-mscnn-spu
Code for our paper "Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention" (ICASSP 2021, co-first authorship)
AnkushMalaker/laughr
Recurrent neural network audio manipulation tool to mute "laugh track" audio segments found commonly in sitcoms.
AnkushMalaker/LiveEd-Smart-Teachers-App
LiveEd is a smart application meant for virtual teachers allowing them to teach from anywhere in the world. It allows teachers to draw in the air as they would using a whiteboard and also import images into the screen to show them to the viewers.
AnkushMalaker/nanoGPT-agent
The simplest, fastest repository for training/finetuning medium-sized GPTs.
AnkushMalaker/obsidian-aicommander-plugin-local
AnkushMalaker/openWakeWord
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
AnkushMalaker/pi-camera-stream-flask
[Docker] Create your own live camera stream using a Raspberry Pi 4
AnkushMalaker/piper-recording-studio
Local voice recording for creating Piper datasets
AnkushMalaker/python-audio-interfaces
Easy audio interfaces in python
AnkushMalaker/pytorch-attention
Attention mechanisms implemented with basic math and pytorch to gain an understanding. This is kept intentionally feature-poor so as to not be confusing.
AnkushMalaker/RustProjects
Rust Projects I made while learning from "The Book"
AnkushMalaker/TC-ResNet-PyTorch
AnkushMalaker/translate-with-whisper-live
dibs on implementing a live stream version
AnkushMalaker/whisper-autotune
AnkushMalaker/whisper-obsidian-plugin-local
Speech-to-text in Obsidian using Local Whisper
AnkushMalaker/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
AnkushMalaker/wyoming-addons
Docker builds for Home Assistant add-ons using Wyoming protocol
AnkushMalaker/wyoming-distil-whisper
Wyoming protocol server for distil-whisper speech to text system
AnkushMalaker/yolo_v1_pytorch
PyTorch implementation of YOLO-v1 including training