The-Sad-Zewalian's Stars
Developer-Y/cs-video-courses
List of Computer Science courses with video lectures.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
eriklindernoren/PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
aleju/imgaug
Image augmentation for machine learning experiments.
SpotX-Official/SpotX
SpotX patcher used for patching the desktop version of Spotify
KwaiVGI/LivePortrait
Bring portraits to life!
abetlen/llama-cpp-python
Python bindings for llama.cpp
garrettj403/SciencePlots
Matplotlib styles for scientific plotting
cappuccino/cappuccino
Web Application Framework in JavaScript and Objective-J
omry/omegaconf
Flexible Python configuration system. The last one you will ever need.
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
OFA-Sys/ONE-PEACE
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
barrust/pyspellchecker
Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/
ehabets/RIR-Generator
Generating room impulse responses
ruizhecao96/CMGAN
Conformer-based Metric GAN for speech enhancement
Enny1991/beamformers
Easy to use Beamformers for multi-channel speech separation/enhancement
HL-hanlin/VideoDirectorGPT
official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning (COLM 2024)
giulioz/laser-scanning
📷🔦💭 A 3D Scanner using Laser Structured Light, written in Python using OpenCV and NumPy.
NickWilkinson37/voxseg
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
hellohaptik/spello
Fast and accurate spell correction library
seorim0/DNN-based-Speech-Enhancement-in-the-frequency-domain
DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping method.
ranvijaykumar/typo
A python package to simulate typographical errors.
Okrio/tinyrecurrentunet
Real-Time De-noising and De-reverbing with Tiny Recurrent UNet
piksels-and-lines-orchestra/gimp
Modified version of GIMP to act in the Piksels & Lines Orchestra
BUTSpeechFIT/MultiSV
MultiSV: scripts for data preparation
furkanarius/Multichannel-Speech-Enhancement-with-Deep-Neural-Networks
This thesis applies an autoencoder deep neural network to the multichannel speech enhancement problem. It takes the problem from dataset generation to the model training
rafaelgreca/voxseg-pytorch
The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.
Valdiolus/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.