Using percentages instead of counts to compare distribution of two tables
borisRa opened this issue · 2 comments
borisRa commented
Hi,
How can I compare between train/test distributions ?
Using this code :
plot_diff([train_df[train_df.columns[~train_df.columns.isin(['Survived'])]], test_df],config={"diff.label": ["train_df", "test_df"]})
I am getting counts as is , I would like to compare percentage instead.
Similar to this plot for Age distribution :
Thanks !
Boris
jinglinpeng commented