miheerdew/cbce

Check the input for outliers

Opened this issue · 2 comments

At the beginning of the method, raise a warning if the X and Y matrices have any columns which are like 4SD away from the mean. This causes problems with the correlations.

Actually, at least one of the columns need to look like a normal.

Remove columns with zero variance. How to handle NA's?
Create a function called sanitize and let the user use the function before passing it to CBCE.

Added #11 .

We further need to add sanity_check() method which creates the various plots sd vs mad, sd vs mean, outlier vs skewness, and check for missing data. If needed impute it.