Discrepancy between various data sources [IR001]
Closed this issue · 0 comments
mimani68 commented
Intro
Most of financial activity due the diversity of diversity in data sources and institutional operations come with discrepancy in output result. The process of unifying and checking the result is not only exhausted but also has many drawbacks like human error, inaccuracies and possible duplication.
Demands
- Continues process for data gathering
- Determining some functionality for cleaning and normalizing data
- Show latest status of each transaction
Inputs
- Excel file (ftp server)
- Database (query)
- Online modification
- Policy file (S3 bucket)
4.1) bond time of operation
4.2) database cred
Pipelines
- Dump polices
- Clear customer confidential data
- Checking all inputs and providing result
- Store updated record into database
- Send and email consist of list of updated record
Outputs
- Adding new record in database