awslabs/deequ

[FEATURE] Support Wilson Score Interval for RetainCompletenessRule

zeotuan opened this issue · 0 comments

Is your feature request related to a problem? Please describe.
Currently, RetainCompletenessRule is using the Wald Interval (Normal approximation interval) for calculating the interval but this method perform poorly when p value is close to either 0 or 1 (documented as TODO in the code itself)

Describe the solution you'd like
Switch to either Wilson score interval or maybe provide a strategy pattern so we can switch between Wilson, Wald or in future other Interval calculation techniques.

I am happy to help with this implementation.