catalyst-cooperative/ccai-entity-matching
An exploration of generalizable approaches to unsupervised entity matching for use in linking tabular public energy data sources.
Jupyter NotebookMIT
Issues
- 0
Integrate experiment tracking with `mlflow`.
#116 opened by zschira - 0
Integrate splink matching model into pipeline
#32 opened by katie-lamb - 0
Generalize Entity Matching Framework
#106 opened by zschira - 0
word2vec + Splink + Equal Weights
#36 opened by zaneselvans - 0
- 0
FERC-EIA Record Linkage Experiments
#34 opened by zaneselvans - 0
Evaluate TF-IDF Attribute Embedding
#33 opened by zaneselvans - 2
Evaluate and compare current performance of models
#110 opened by zschira - 0
Apply PUDL entity matching framework to FERC-EIA
#108 opened by zschira - 0
Generalize blocking step to take two arbitrary dataframes and produce candidate sets
#107 opened by zschira - 0
- 0
Create KNN Cosine Similarity Function
#48 opened by katie-lamb - 0
TF-IDF + Splink + Equal Weights
#35 opened by zaneselvans - 0
Get CI set up to run notebooks
#39 opened by zaneselvans - 0
- 2
Overview Of Experiments and Progress Checklist
#31 opened by katie-lamb