1. etl managerA python package to create a database on the platform using our moj data warehousing framework
3. splinkImplementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
4. splink demosInteractive notebooks containing demonstration code of the splink library