use DB to mediate access to raw records
Closed this issue · 2 comments
dckc commented
- load raw records into a DB table (e.g. as CLOBs)
- feed
PatientFlatReader
from aReader
that gets data from the DB table
The "data lake" approach here was somewhat half-hearted: code for reading the NAACCR file had to run on the host with the NAACCR file, since we didn't deploy a distributed filesystem such as HDFS. This would treat the DB somewhat like a distributed filesystem. Using the DB to mediate access to the DB is the norm in HERON development for a decade or so in any case.
p.s. this is the motivation for the loadRaw experiment.