Pinned Repositories
febrl_python3_fork
A Python package designed to allow health, biomedical and other researchers to clean (standardise) and deduplicate or link data sets of all sizes faster, with less effort and with improved quality. Forked to allow to work on Python 3.
adfdemo
duplicate-data-generator
A Python script for generating duplicate data to test the performance of record linkage and master data management systems.
xdm-helm-chart
Helm Chart for Semarchy xDM
xdm-tutorials
Semarchy xDM tutorials
thomaswyrick's Repositories
thomaswyrick/duplicate-data-generator
A Python script for generating duplicate data to test the performance of record linkage and master data management systems.
thomaswyrick/febrl
A Python package designed to allow health, biomedical and other researchers to clean (standardise) and deduplicate or link data sets of all sizes faster, with less effort and with improved quality.
thomaswyrick/xdm-helm-chart
Helm Chart for Semarchy xDM
thomaswyrick/adfdemo
thomaswyrick/django-markdown-editor
Awesome Django Markdown Editor, supported for Bootstrap & Semantic-UI
thomaswyrick/splink
Implementation in Apache Spark of the EM algorithm to estimate parameters of Fellegi-Sunter's canonical model of record linkage.
thomaswyrick/xdm-tutorials
Semarchy xDM tutorials