WeidiXie/VGG-Speaker-Recognition
Utterance-level Aggregation For Speaker Recognition In The Wild
Python
Issues
- 0
How to use our own dataset?
#73 opened by SanaullahOfficial - 3
Error while attempting to train the model
#67 opened by aserzhqa54zwsz - 2
How to train the model on the Voxceleb1 dataset?
#22 opened by YaKaiLi - 2
The training set contains the validation set
#38 opened by tae-jun - 12
- 0
issues in training Voxceleb1 model
#69 opened by Fan0fan - 5
Error in loading the pretrained weights
#34 opened by aliakbar09a - 1
Learning rate schedule problem
#66 opened by suyuzhang - 5
Librosa version requirements
#59 opened by go2chayan - 6
about loss and acc
#61 opened by hermanseu - 1
Preprocessing function - WAV extending
#60 opened by celpas - 3
Non-trainable params: 19,136
#54 opened by llearner - 1
TypeError: __init__() missing 2 required positional arguments: 'mode' and 'k_centers'
#58 opened by preniqivjosa - 4
- 0
- 1
- 3
Validation and testing meta data
#52 opened by hmen97 - 0
Got core dumped when training with GPU option
#51 opened by huynguyen82 - 1
Some doubts about pre-trained model weights
#50 opened by lyu-joe - 0
TypeError: 'int' object is not callable
#47 opened by IvanEvan - 1
questions about downloading pre-trained model
#49 opened by lyu-joe - 1
ValueError: Layer #125 (named "gvlad_center_assignment"), weight <tf.Variable 'gvlad_center_assignment/kernel:0' shape=(7, 1, 512, 12) dtype=float32_ref> has shape (7, 1, 512, 12), but the saved weight has shape (10, 512, 7, 1).
#46 opened by IvanEvan - 1
- 1
Answer about hardware
#45 opened - 2
Training accuracy oscillates between 51-52% early while loss decreases slowly.
#35 opened by alamnasim - 1
using tripletLoss function
#42 opened by shatealaboxiaowang - 2
- 4
- 2
Section "Probing verification based on length" -- about verification pairs 25,020
#40 opened by llearner - 8
Training is quite slow.
#33 opened by alamnasim - 6
how to increase gpu usage?
#39 opened by ArtemisZGL - 1
- 3
problem with testing my own trained model
#37 opened by aSafarpoor - 1
- 2
- 1
error in data generation MP 2
#29 opened by gogyzzz - 2
- 3
- 1
is there any qualitative analysis of the network learn a good embedding feature, any visualize tools for this ?
#26 opened by mmxuan18 - 5
Why extend audio?
#28 opened by happypanda5 - 4
Question on train loss
#25 opened by seungwonpark - 1
will you share the pretrained model which used amsoftmax, i trained myself use amsoftmax but very hard to convergence
#23 opened by mmxuan18 - 1
Duration of audio file for the trained model?
#21 opened by happypanda5 - 8
- 2
- 1
some question about the thin-ResNet?
#19 opened by mmxuan18 - 3
error in data generation MP
#17 opened by rohithkodali - 11
Training under Windows
#15 opened by AntonBiryukovUofC - 3
toolkits.initialize_GPU(args) error
#16 opened by WrathOfGrapes - 5
why the wav preprocess not directly use librosa.feature.melspectrogram? what's the difference?
#14 opened by mmxuan18