/bigcerts

Automated creation of arbitrarily large Iris certificate databases

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Automated creation of arbitrarily large Iris certificate databases

Usage: python build.py <path>\<name>.mdb <number_of_records

Example: python build.py D:\dev\big.mdb 200000

Requires a database with ICD-11 tables called 'BidIdent' and 'BigMedCod'. It will delete all data in those tables each time it's run.

Uses a list of terms from an external document. A good starting point is a dictionary, e.g. taking just the 'DiagnosisText' from https://github.com/ONSdigital/icd11-ons-english-dictionary/blob/main/Dictionary.txt. A sample file is included to get you started.

Because the script selects at random from the list you provide, the certs it creates will often be nonsensical because it doesn't understand the due to sequence. So the cert could show 'brain cancer due to a cough'! The purpose is to create large databases for load/performance testing of Iris, not to create meaningful data.

On a modest laptop the script generates around 100,000 records per minute.