pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Jupyter NotebookMIT
Pinned issues
Issues
- 8
huggingface_hub.errors.HFValidationError: Repo id must be in the form 'repo_name' or 'namespace/repo_name': '/home/ubuntu/whisperx/pyannote-offline/config.yaml'. Use `repo_type` argument if needed.
#1800 opened by lucasmirachi - 4
Diarization pipeline fails at end of audio file (RuntimeError: Sizes of tensors must match except in dimension 0.)
#1752 opened by ccmilne - 0
3.3.2 or 3.3.1?!
#1852 opened by FeatureSpitter - 0
- 1
Issue in distance threshold parameter
#1746 opened by manish-kumar-iisc - 1
Diarization Causes High System Memory Usage
#1819 opened by agorman - 2
- 1
AttributeError: partially initialized module 'torchaudio' has no attribute 'lib' (most likely due to a circular import)
#1740 opened by behroozazarkhalili - 2
The timeline is wrong
#1737 opened by Lixi20 - 1
- 0
- 0
Warning suppression
#1832 opened by antoinelaurent - 5
When fine tuning pretrained segmentation model using pyannote.audio==3.1.1 on well-defined and registered custom finance dataset, it shows the following error message 'PicklingError: Can't pickle <class 'pyannote.database.registry.Finance'>: attribute lookup Finance on pyannote.database.registry failed'
#1734 opened by ZhouFang928 - 6
Why is pyannote not using my GPU ro CPU? So slow too.
#1702 opened by CrackerHax - 0
How to report a security issue responsibly?
#1828 opened by zpbrent - 1
Can't load pyannote
#1825 opened by CrackerHax - 2
Hi, I'm currently trying to use an updated wespeaker voice model like the one shown in the picture, but when I follow the file pyannote/audio/models/embedding/wespeaker/convert.py I can't adapt it, it shows the following error, how do I change ?
#1772 opened by LiLiWangzz - 3
`torchaudio.info.num_frames` can give wrong results so it can provide false exceptions
#1724 opened by grazder - 0
Using speech-separation-ami-1.0, results a distorted audio, much louder than the original
#1818 opened by danielsasso - 2
speaker-diarization-3.0 won't load, ONNX error
#1814 opened by sedol1339 - 2
Running speaker-diarization-3.1 leads to GPU failure
#1813 opened by sedol1339 - 3
Possible to use reference speaker embeddings in Pyannote diarization pipeline?
#1750 opened by Arche151 - 0
Separation stitching is wrong when two local speakers are assigned to the same cluster
#1811 opened by hbredin - 2
Wavlm modules are always in `eval` mode when training `ToTaToNet` and `SSeRiouSS` models
#1793 opened by clement-pages - 1
- 2
What is the purpose of the Resegmentation and AdaptiveVoiceActivityDetection Pipeline?
#1700 opened by asusdisciple - 4
DER Calculation on the Aishell-4 Dataset Using pyannote.audio Model Returns NaN
#1790 opened by sipercai - 2
[Speech Separation/ValueError] v.3.3 - "speech_separation.py", line 648, in apply np.concatenate(remaining_zeros) ValueError: need at least one array to concatenate
#1747 opened by ai-nikolai - 6
save speech separation results to disk throw IndexError,size of diarization.labels() and shape of sources.data is not same
#1735 opened by yinyao - 1
- 3
- 3
- 5
[Unexpected Performance Drop] Using 44.1K sample_rate vs. default 16K leads to better performance in `pyannote/speaker-diarization-3.1`
#1755 opened by ai-nikolai - 6
VAD model
#1754 opened by adriondragon - 6
outputs of separation module is clipping
#1729 opened by faroit - 2
Speech Separation cracking the volume too high
#1770 opened by ajtopper - 1
- 5
Dependency errors while running the evaluation notebook for the speech separation uploaded recently
#1767 opened by BNarayanaReddy - 1
High CPU usage during embeddings step of diarization
#1753 opened by henriklied - 2
numpy.NAN crash
#1758 opened by KiARC - 2
- 3
- 4
- 2
After fine-tuning with MagicData-RAMC dataset,i test on Aishell-4 dadaset DER increased.
#1738 opened by Arnold134777 - 3
- 2
3.3 dependencies
#1727 opened by faroit - 1
- 2
How to map the transcribed text with their respective speakers in speaker diarization?
#1718 opened by ThiruRJST - 2
- 4