Pinned Repositories
AndroidAudioWaveViewer
Plays audio and plots the waveform in an Android App
ASV-Anti-Spoofing-DADA
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
Deep-Convolutional-ResNet-Audio-Duplicate-Detector
A Deep CNN for short audio (~15 seconds) Duplicate Detector is trained in a siamese style fashion.
Deep-Learning-Developer-Course-Projects
easy
This repository is the official implementation Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.
GraphBEAN
Interaction-Focused Anomaly Detection on Bipartite Node-and-Edge-Attributed Graphs
mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Shazam-An-Industrial-Strength-Audio-Search-Algorithm-
Detecting segments belonging to which song in database, and return Nil if does not exist in a database.
Speech-Enhancement-Using-Time-Domain-Loss
This is an adaptation of the paper "Two-Stage Deep Learning for Noisy-Reverberant Speech Enhancement". It uses Time Domain Reconstruction (TDR) as an additional loss function to make use of clean phase in the enhancement process. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6519714/
speechbrain
A PyTorch-based Speech Toolkit
leonardltk's Repositories
leonardltk/Shazam-An-Industrial-Strength-Audio-Search-Algorithm-
Detecting segments belonging to which song in database, and return Nil if does not exist in a database.
leonardltk/Speech-Enhancement-Using-Time-Domain-Loss
This is an adaptation of the paper "Two-Stage Deep Learning for Noisy-Reverberant Speech Enhancement". It uses Time Domain Reconstruction (TDR) as an additional loss function to make use of clean phase in the enhancement process. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6519714/
leonardltk/Deep-Learning-Developer-Course-Projects
leonardltk/AndroidAudioWaveViewer
Plays audio and plots the waveform in an Android App
leonardltk/Deep-Convolutional-ResNet-Audio-Duplicate-Detector
A Deep CNN for short audio (~15 seconds) Duplicate Detector is trained in a siamese style fashion.
leonardltk/GraphBEAN
Interaction-Focused Anomaly Detection on Bipartite Node-and-Edge-Attributed Graphs
leonardltk/RAGcipe
Use LLM + Advanced RAG to get desired #ROTD (recipe of the day)
leonardltk/ASV-Anti-Spoofing-DADA
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
leonardltk/Barbershop
Barbershop: GAN-based Image Compositing using Segmentation Masks (SIGGRAPH Asia 2021)
leonardltk/Basic-Templates
To start off with.
leonardltk/Deep-Learning-Seminar-Series
leonardltk/easy
This repository is the official implementation Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.
leonardltk/lrpd-paper-code
Code for "LRPD: Large Replay Parallel Dataset" paper
leonardltk/Medium-Audio-Speech-Processing
Codes to reproduce the figures in my medium article.
leonardltk/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
leonardltk/speechbrain
A PyTorch-based Speech Toolkit
leonardltk/chatbot-retrieval
leonardltk/gen-ai-gradio
Building Generative AI Applications with Gradio
leonardltk/VisSpectAnd
Visualisation of Spectrogram in Android