/Deduplication-task-with-Python-and-snowflake

Two(or more) car sellers have similar names, such as "Car Angelo" and "Car Angelo s.r.o". The task is to calculate the percentage of name similarity by using Levenshtein distance, and then to analyze how many cars in stock have two sellers with similar name, also to determine whether these cars are "duplicates".

Primary LanguageRich Text Format

This repository is not active