pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

PythonBSD-2-Clause

Issues

[RelEng] torchaudio macOS nightly promotion failed since 2024-11-27
#3858 opened 20 days ago by hvaara
2
Unsupported subtype: PCM_24
#3806 opened 6 months ago by nicobrb
1
ffmpeg 7
#3857 opened 24 days ago by lee101
0
`kaldi.fbank` does not work with non-contiguous input when `snip_edges=False`
#3856 opened a month ago by gau-nernst
0
Using MMS model with `star` token for batch size > 1
#3772 opened 8 months ago by huangruizhe
1
Can anyone provide a real-time pretrain model for Visual Speech Recognition?
#3852 opened a month ago by bernie-122
0
[RFC] Support non-GPU hardware-based video decoding and encoding
#3841 opened 2 months ago by cdzhan
2
Adopt aligner from "Huang et al., Less Peaky and More Accurate CTC Forced Alignment by Label Priors"
#3826 opened 4 months ago by dmitry-mli
4
Not building CUDA 12.6
#3835 opened 3 months ago by johnnynunez
1
How to train a real-time av-asr pretrain model
#3838 opened 3 months ago by Zhaninh
0
Ability to build manylinux2014 compliant wheels for other archs (ppc64le)
#3834 opened 3 months ago by mgiessing
0
torchaudio load opus failed
#3753 opened 10 months ago by Mddct
1
Prebuilt binaries of torch.audio for aarch64 cuda
#3827 opened 4 months ago by chulkilee
1
Ability to provide initial phase to Griffin-Lim
#3828 opened 4 months ago by aaron-dees
0
Torchaudio is not detecting FFmpeg
#3789 opened 7 months ago by ruliworst
7
StreamRead failing when Reading RTSP stream with CPU
#3798 opened 4 months ago by pedromoraesh
7
torchaudio.transforms.Resample causes Float Point Exception
#3825 opened 4 months ago by zhc7
0
The seek functionality of StreamReader on the video stream does not return the correct frame if the start_time_stamp of the video stream is nonzero.
#3824 opened 5 months ago by w238liu
0
StreamWriter doesn't correctly write audio chunks
#3823 opened 5 months ago by arch-user-france1
1
Termux patch for default APT version of audio - for relative file paths, tilde (~) expansion not working in filepath for torchaudio.load()
#3802 opened 5 months ago by Manamama
5
Loading Opus files from MLS dataset fails because of file metadata
#3821 opened 5 months ago by niemiaszek
0
transforms.MFCC results in NaN values on Jetson Orin Nano
#3822 opened 5 months ago by frmser
0
torchaudio.load not loading all the frames
#3762 opened 9 months ago by ashinkajay
1
Division by zero in loudness calculation
#3816 opened 5 months ago by DanTremonti
0
Division by zero in loudness calculation
#3815 opened 5 months ago by dhanvanth-pk-13760
0
Video reading: torchaudio.io.StreamReader seek method returns the first frame, regardless of the input start_timestep (on version 0.13.1)
#3813 opened 5 months ago by StolikTomer
0
Loading failure errors should indicate what was being loaded when error occured
#3810 opened 5 months ago by pokepress
0
StreamReader.add_basic_video_stream drops last frame if `frame_rate` is specified
#3809 opened 5 months ago by tyler-rt
0
Differentiable filtering using a cascade of second order IIR filters
#3808 opened 6 months ago by SuperKogito
0
NV12/YUV->RGB colour accuracy and CUDA
#3799 opened 7 months ago by gtebbutt
5
`torchaudio.functional.lfilter` returns `nan` when processing sub-array but not for the whole input array.
#3807 opened 6 months ago by SuperKogito
4
frame offset + num frames to utilize http range header
#3783 opened 8 months ago by mogwai
1
StreamReader seek method seeks to wrong frame for opus format
#3770 opened 9 months ago by ashinkajay
2
MAC M3 audio backend no longer appearing
#3785 opened 8 months ago by tval2
3
Get cudaErrorIllegalAddress when running ctc decoder on H100
#3754 opened 10 months ago by Ray-Leung
2
RTSP with StreamReader
#3797 opened 7 months ago by pedromoraesh
0
How to use my finetuned version of wave2vec2 for forced alignment as shown in example/
#3796 opened 7 months ago by omerarshad
0
Packet passthrough support
#3795 opened 7 months ago by materight
0
Real time synthesis with oscillator_bank
#3788 opened 7 months ago by peastman
0
Failed to open output "-" (Invalid argument).
#3784 opened 8 months ago by liuliujiang
0
Can not load commonvoice dataset on windows
#3781 opened 8 months ago by jacobjennings
1
Support for 10bit / 12bit encoding (e.g. yuv420p10le) in StreamWriter
#3776 opened 8 months ago by tvercaut
0
Cannot load audio from pathlib.Path
#3775 opened 8 months ago by roedoejet
1
DEVICE AV-ASR WITH EMFORMER RNN-T tutorial : avsr not found
#3773 opened 8 months ago by sfcgta4794
0
cherry-picks for 2.3
#3768 opened 9 months ago by ahmadsharif1
1
Windows CI is broken
#3767 opened 9 months ago by ahmadsharif1
0
Cannnot create the MFCC of a tensor that is already on a gpu
#3765 opened 9 months ago by Greenscreen23
3
Training Hangs During HuBERT Pretraining with DDP When Loss Becomes Invalid
#3749 opened 10 months ago by jojonki
2
Installation with pip git+ is broken
#3752 opened 10 months ago by gdagil
0
I have some questions about RNNT loss.
#3750 opened 10 months ago by girlsending0
6