Pinned Repositories
brittstenekes.github.io
splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
splink_scalaudfs
Data linking functions in Scala, to be used in a Pyspark environment.
lenroc14's Repositories
lenroc14/brittstenekes.github.io
lenroc14/splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
lenroc14/splink_scalaudfs
Data linking functions in Scala, to be used in a Pyspark environment.