cross-validation question
micrazy opened this issue · 2 comments
micrazy commented
Why when using k-fold cross-validation with cleanlab, the results of each fold are spliced and then analyzed instead of each fold analyzed separately?
jwmueller commented
You should not analyze the results of each fold separately with cleanlab. Instead you should concatenate the Out-of-Sample predictions from each fold into one big array whose size = size of your entire dataset, and then pass this entire array into cleanlab methods.
See the conceptual tutorial here: https://docs.cleanlab.ai/stable/tutorials/pred_probs_cross_val.html
jwmueller commented
closing due to lack of activity, feel free to reopen this issue if you still have questions!