/nodejs-sqs-ocr

Docker image to OCR PDF files. Jobs pulled from AWS SQS, files stored in S3

Primary LanguageJavaScript

nodejs-sqs-ocr

NodeJS application / Docker image to OCR images. Jobs pulled from AWS SQS, files stored in S3

Environment Variables

Example SQS Message Body for S3 bucket notifications

{
  "Records": [
    {
      "eventVersion": "2.1",
      "eventSource": "aws:s3",
      "awsRegion": "eu-west-1",
      "eventTime": "2020-11-16T19:10:20.178Z",
      "eventName": "ObjectCreated:Put",
      "userIdentity": {
        "principalId": "example"
      },
      "requestParameters": {
        "sourceIPAddress": "10.20.30.40"
      },
      "responseElements": {
        "x-amz-request-id": "...",
        "x-amz-id-2": "..."
      },
      "s3": {
        "s3SchemaVersion": "1.0",
        "configurationId": "09c216f5-caa0-4b93-9853-a86dbbf9a8b5",
        "bucket": {
          "name": "inputbucket",
          "ownerIdentity": {
            "principalId": "..."
          },
          "arn": "arn:aws:s3:::inputbucket"
        },
        "object": {
          "key": "demo.png",
          "size": 70668,
          "eTag": "f5e86c36a7874ba2492b0143ec56360d",
          "sequencer": "005FB2CE9EA7BE0F74"
        }
      }
    }
  ]
}

Records[].s3.object.key should contain the key to the input file in AWS_S3_OCR_INPUT input bucket