/synpuf

A Fictitious Patient Data Set for Population Representativeness of Multiple Diabetes Trials

Primary LanguageDockerfile

SynPUF 250 OMOP CDM Dataset Information

These csv data files contains a 250 person sample of fictitious data set.

Information on the SynPUF data source is available here

The csv data files in this repo are compatible with OMOP CDM version 5.2.2.

CDM 5.2.2 DDL for the OHDSI supported DBMSs is available here

Data loading instructions

  1. git clone the contents of this repo. There is one csv file for each database table.

  2. The format of each data file is as follows:

    • No header record
    • UTF8 character set
    • Tab field separator
    • CSV
  3. Here is an example PostgreSQL DBMS copy statement to load the CDM person csv file with the psql command line client:

\COPY PERSON FROM 'person.csv' DELIMITER E'\t' CSV ENCODING 'UTF8';

Docker

docker-compose up