OlivierBinette
Data Scientist @ American Institutes for Research Duke Statistical Science PhD
Duke UniversityDurham, NC
Pinned Repositories
assert
Lightweight validation tool for checking function arguments and data analysis scripts.
cache
Easily cache and retrieve computation results in R
CSVMeta
Lightweight csv read/write, keeping track of csv dialect and other metadata.
dgaFast
Multiple Systems Estimation Using Decomposable Graphical Models. This is an efficient re-implementation and extension of the dga R package.
er-evaluation
An End-to-End Evaluation Framework for Entity Resolution Systems
fingermatchR
Fingerprint matching tools based on NIST's mindtct and bozorth3 algorithms.
groupbyrule
Deduplicate data using fuzzy and deterministic matching rules.
StringCompare
Efficient String Comparison Functions and Fuzzy String Matching
VisTree
er-evaluation
An End-to-End Evaluation Framework for Entity Resolution Systems
OlivierBinette's Repositories
OlivierBinette/assert
Lightweight validation tool for checking function arguments and data analysis scripts.
OlivierBinette/BayesianRecordLinkage.jl
Perform Bayesian record linkage with a one-to-one matching assumption.
OlivierBinette/dgaFast
Multiple Systems Estimation Using Decomposable Graphical Models. This is an efficient re-implementation and extension of the dga R package.
OlivierBinette/earthquakes
3D data visualization with WebGL/three.js
OlivierBinette/pretty
Better baser plots in R.
OlivierBinette/actions
GitHub Actions for the R community
OlivierBinette/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
OlivierBinette/awesome-official-statistics-software
An awesome list of statistical software for creating and accessing official statistics
OlivierBinette/BOOM
A C++ library for Bayesian modeling, mainly through Markov chain Monte Carlo, but with a few other methods supported. BOOM = "Bayesian Object Oriented Modeling". It is also the sound your computer makes when it crashes.
OlivierBinette/cenzus
Multiple Systems Estimation
OlivierBinette/Cielographie
OlivierBinette/clevr
Clustering and Link Prediction Evaluation in R
OlivierBinette/compsci590s21.github.io
Blog for COMPSCI 590 - Spring 2021 - Applied Cryptography
OlivierBinette/cora
Cora data set for Entity Resolution
OlivierBinette/detectron2
Detectron2 for Document Layout Analysis
OlivierBinette/DGA
Multiple systems estimation (capture-recapture) using decomposable graphical models
OlivierBinette/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
OlivierBinette/fuzzyjoin
Join tables together on inexact matching
OlivierBinette/hocrjs
Working with hOCR in Javascript
OlivierBinette/Labs-STA-360
Slides and code for STA 360 Friday Labs (Duke University)
OlivierBinette/layout-parser
A Python Library for Document Layout Understanding
OlivierBinette/ocrfeeder
Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder
OlivierBinette/ocropy
Python-based tools for document analysis and OCR
OlivierBinette/reclin
Probabilistic Record Linkage in R
OlivierBinette/rmdActionBug
OlivierBinette/splinit
Periodic spline regression and closed curve reconstruction
OlivierBinette/stringdist
String distance functions for R
OlivierBinette/tesseract
Tesseract Open Source OCR Engine (main repository)
OlivierBinette/Three
2d game engine.
OlivierBinette/Uncertainty-Quantification-Workshop
Uncertainty quantification in a data-driven world: an overview of cross-validation, conformal prediction and the bootstrap