thomeou
I am a PhD student at NTU, Singapore. My research areas are deep learning, audio signal processing, microphone array, and real-time processing.
Nanyang Technological UniversitySingapore
thomeou's Stars
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
yunjey/pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
Lightning-AI/pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
jiaaro/pydub
Manipulate audio with a simple and easy high level interface
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
experiencor/keras-yolo2
Easy training on custom dataset. Various backends (MobileNet and SqueezeNet) supported. A YOLO demo to detect raccoon run entirely in brower is accessible at https://git.io/vF7vI (not on Windows).
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
bytedance/music_source_separation
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
qiuqiangkong/torchlibrosa
edwardzhou130/PolarSeg
Implementation for PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation (CVPR 2020)
fakufaku/fast_bss_eval
A fast implementation of bss_eval metrics for blind source separation
shervinea/pytorch-data-generator
Template for data generator with PyTorch
yinkalario/Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
yinkalario/EIN-SELD
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
l3das/L3DAS22
polarch/Array-Response-Simulator
A set of routines that simulate array responses for sensors with arbitrary geometry and directional characteristics.
karnwatcharasupat/latte
Latte: Cross-framework Python Package for Evaluation of Latent-based Generative Models
pquochuy/sasegan
pquochuy/dcase2020-seld
Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"
nglehuy/sasegan
Self-Attention Generative Adversarial Network for Speech Enhancement using Tensorflow 2
thomeou/audio_streaming_using_pyaudio
Python program for reading and writing multi-channel audio input stream
mangeption/Final_year_project
mangeption/freesound_scrapy
mangeption/dotnet-a
learning dotnet/c#