/css

PyTorch implementation of Continuous Speech Separation

Primary LanguagePythonMIT LicenseMIT

css

PyTorch implementation of Continuous Speech Separation

This repository is under development.

This repository contains training and inference pipelines for continuous speech separation on long recordings, similar to that in the LibriCSS paper.

Salient features:

  • On-the-fly training data creation using Lhotse.
  • Conformer and BLSTM encoders.
  • Multi-node training is based on Matthew Wiesner's nnet_pytorch. It is a naive form of multi-node training suitable for clusters where DDP may be slowed down due to an I/O bottleneck.