ucl-pond/pySuStaIn

Parallel CV doesn't work (aka "Why do all my CV jobs run for fold 0 only???")

noxtoby opened this issue · 1 comments

I was running cross-validation in parallel on a cluster using cross_validate_sustain_model() with argument select_fold set to the CV fold desired for each compute job.

I noticed that all 10 folds were returning results for only fold0.

The culprit is line 276, where the loop is through range(Nfolds) (where Nfolds=len(select_fold)) rather than explicitly through the select_fold array itself.

Will send a PR to fix shortly, but wanted to raise this in case others have the same problem

See PR38