Sampling scheme for continuous auxiliary variables clarification
mariakesa opened this issue · 3 comments
Hello,
Thank you so much for your work! I am definitely having fun with CEBRA:-)
I just wanted to verify whether I understood the sampling scheme for shaping embeddings using an auxiliary variable. I read the supplement to the paper and looked at the code. Do I understand correctly that if I specify a continuous auxiliary variable, the model will create a distribution of delta in values between neighboring time points and then use this distribution to find the closest sample to the reference sample with this delta as a positive sample (the model still only learns from the main data ie the data that I would apply the time only algorithm to, except now I get the positive samples using the auxiliary variable)? Are the negative samples still sampled uniformly as in time only learning?
Thanks again!
thanks @mariakesa for the question; just a first small note to say we have Discussions open for questions: https://github.com/AdaptiveMotorControlLab/CEBRA/discussions , so in the future that would be an ideal place to post questions and leave issues to bug reports
- and yes, the negative is still sample uniformly, unless you change that as well.
I apologize, I didn't notice the discussions forum.
Thank you so much! I'm glad I understood correctly!
To this point,
Do I understand correctly that if I specify a continuous auxiliary variable, the model will create a distribution of delta in values between neighboring time points and then use this distribution to find the closest sample to the reference sample with this delta as a positive sample
That is correct. What you describe is implemented here:
CEBRA/cebra/distributions/continuous.py
Lines 231 to 242 in 6409239
However, there are other possibilities, like sampling around the reference samples, as done in this case:
CEBRA/cebra/distributions/continuous.py
Lines 271 to 285 in 6409239
(the model still only learns from the main data ie the data that I would apply the time only algorithm to, except now I get the positive samples using the auxiliary variable)?
Yes, it simply "re-indexes" the dataset you provide to fit()