Images
boersmamarcel opened this issue · 1 comments
boersmamarcel commented
Hi aleksei,
I’ve tried some stuff with the plots and I think it has to do with the marker size. Resizing to 5 from 150 produced better plots already.
We can play around with that to see what the right parameters are.
AlexWorldD commented
I guess one of the possible problems was in the next line of code df = df.merge(embds, on="ID")
, actually it concated our found embedding to journal entries rather than to Business Process, as a result, the output column Emb included a lot of duplicates... Now I modified it to
# //////// Merge with GroundTruth \\\\\\\\\
if MODE == "SimulatedData":
d = add_ground_truth(embds)
if MODE == "RealData":
d = embds.merge(d.groupby("ID", as_index=False).agg({"GroundTruth": "first"}), on="ID")