OpenFn/unicef-cambodia

F1-J2 job times out on prod

aleksa-krolls opened this issue · 3 comments

Issue

See run examples for Flow 1 where we (1) fetch cases from Primero, and (2) upload to Oscar. Strangely it only looks like there are 2-3 cases to process... yet the job times out. Is this an issue related to the data or OSCaR destination system?
https://www.openfn.org/projects/pdngk6/runs/rmkyyzdx
https://www.openfn.org/projects/pdngk6/runs/r7x775y6

state

To test

Re-run Flow 1 on production: https://www.openfn.org/projects/pdngk6/jobs/jyazq6

@taylordowns2000 fyi, Kiry has been doing some work and the /upsert_links API endpoint seems to be back up and running: https://www.openfn.org/projects/pdngk6/runs/r5eadpxa
Therefore if we can get this new fail retry solution in place this week, I think we can plan for go-live this week. So as discussed yesterday, just keep me posted on whenever ready to test ... I'll then switch the jobs on prod to the different staging environments to give it a go.

@taylordowns2000 Requests to Oscar failed this morning and the J3 jobs successfully created 2 Messages: https://www.openfn.org/projects/pdngk6/messages

Can you add the Name of the job that failed to these Message bodies so that it's very clear which job to re-run?

{
  "__query_params": {},
  "lastCaseCreated": "27-01-2021",
  "lastCreated": "14-01-2021",
  "lastUpdated": "15-12-2020 03:03"
}