Tool for comparing similarity of data contained in different attributes of a dataset Setup mkdir logs mkdir outputs