Now able to find `detect_duplicate` in the documentation or as code
ayulockin opened this issue · 2 comments
ayulockin commented
I am trying to dedup my dataset. The Atlas Duplicate Clustering section in the documentation has a line - "Make sure to enable duplicate clustering by setting detect_duplicate = True when building a map". I am not able to find this argument in the Atlas API references or in this GitHub repo.
The bottom line question is - can I dedup my dataset using Atlas?
AndriyMulyar commented
Hi, there is an error in that documentation. The field is 'duplicate_detection' and should be set to True.
https://docs.nomic.ai/atlas_api.html#nomic.project.AtlasProject.add_text
AndriyMulyar commented