/valentine

A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema marching methods.

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Schema matcher tests based on Valentine

For my master's thesis on schema integration, I did some tests on schema matching algorithms. These are based on Valentine (https://github.com/delftdata/valentine).

Three subfolders with data needed:

Valentine was extended with JaccardLevenshtein matcher based on column names only (https://github.com/Anphisa/valentine/tree/master/valentine/algorithms/jaccard_levenshtein_colnames_only).