thomeou

I am a PhD student at NTU, Singapore. My research areas are deep learning, audio signal processing, microphone array, and real-time processing.

Nanyang Technological UniversitySingapore

thomeou's Stars

huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python32.4k 313 9324.8k
yunjey/pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
Language:Python30.3k 626 1798.1k
Lightning-AI/pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Language:Python28.5k 250 7.2k3.4k
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Language:Python20.3k 257 722.5k
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Language:Python11.5k 103 821.9k
jiaaro/pydub
Manipulate audio with a simple and easy high level interface
Language:Python9k 133 5831k
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Language:Python8.4k 154 5431.1k
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.3k 52 220423
experiencor/keras-yolo2
Easy training on custom dataset. Various backends (MobileNet and SqueezeNet) supported. A YOLO demo to detect raccoon run entirely in brower is accessible at https://git.io/vF7vI (not on Windows).
Language:Jupyter Notebook1.7k 67 437784
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Language:Python1.5k 41 239432
bytedance/music_source_separation
Language:Python1.3k 27 64195
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Language:MATLAB722 32 5151
qiuqiangkong/torchlibrosa
Language:Python477 6 648
edwardzhou130/PolarSeg
Implementation for PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation (CVPR 2020)
Language:Python379 14 6080
fakufaku/fast_bss_eval
A fast implementation of bss_eval metrics for blind source separation
Language:Python131 4 108
shervinea/pytorch-data-generator
Template for data generator with PyTorch
Language:Python131 6 073
yinkalario/Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
Language:Python107 2 426
yinkalario/EIN-SELD
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
Language:Python67 4 1415
l3das/L3DAS22
Language:Python49 3 315
polarch/Array-Response-Simulator
A set of routines that simulate array responses for sensors with arbitrary geometry and directional characteristics.
Language:Matlab49 7 016
karnwatcharasupat/latte
Latte: Cross-framework Python Package for Evaluation of Latent-based Generative Models
Language:Python35 5 113
pquochuy/sasegan
Language:MATLAB21 4 26
pquochuy/dcase2020-seld
Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"
Language:Python16 3 02
nglehuy/sasegan
Self-Attention Generative Adversarial Network for Speech Enhancement using Tensorflow 2
Language:Python14 2 26
thomeou/audio_streaming_using_pyaudio
Python program for reading and writing multi-channel audio input stream
Language:Python5 1 00
mangeption/Final_year_project
Language:Jupyter Notebook2 1 01
mangeption/freesound_scrapy
Language:Python2 1 00
mangeption/dotnet-a
learning dotnet/c#
Language:C#1 1 00

thomeou

thomeou's Stars

huggingface/pytorch-image-models

yunjey/pytorch-tutorial

Lightning-AI/pytorch-lightning

karpathy/minGPT

xmu-xiaoma666/External-Attention-pytorch

jiaaro/pydub

facebookresearch/demucs

asteroid-team/asteroid

experiencor/keras-yolo2

LCAV/pyroomacoustics

bytedance/music_source_separation

nanahou/Awesome-Speech-Enhancement

qiuqiangkong/torchlibrosa

edwardzhou130/PolarSeg

fakufaku/fast_bss_eval

shervinea/pytorch-data-generator

yinkalario/Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization

yinkalario/EIN-SELD

l3das/L3DAS22

polarch/Array-Response-Simulator

karnwatcharasupat/latte

pquochuy/sasegan

pquochuy/dcase2020-seld

nglehuy/sasegan

thomeou/audio_streaming_using_pyaudio

mangeption/Final_year_project

mangeption/freesound_scrapy

mangeption/dotnet-a