upscan-initiate

Microservice for initiating the upload of files created externally to HMRC estate. These could be from members of the public or third-party services. This service is not for transfer of files from one HMRC service to another. See the Transmission Service as documented in Confluence for this use-case.

TLDR

We strongly advise against hardcoding the "fields" in the response of initiate and v2/initiate. These are subject to change.
The file must be the last field in the actual upload request.
You must use multipart encoding (multipart/form-data) NOT application/x-www-form-urlencoded. The error message returned by AWS is obscure when the wrong content type is used.

Upscan user manual

Introduction
File upload workflow
Service usage a. Requesting a URL to upload to b. The file upload c. File upload outcome d. File processing outcome i. Success ii. Failure
Error handling
Design considerations a. Uploading multiple files b. Security c. File metadata
Architecture of the service
Running and maintenance of the service a. Running locally
Appendix a. Quick reference figures b. Related projects, useful links i. Testing ii. Slack c. License

Introduction

In this "user manual" the collection of microservices that make up Upscan are discussed, not just upscan-initiate. This documentation is here as upscan-initiate is the microservice which developers will interact with directly.

The Upscan service allows consuming services to orchestrate the uploading of files. Upscan provides temporary storage of the uploaded file, ensures that the file isn't harmful (doesn't contain viruses) and verifies against predefined restrictions provided by the consuming service (e.g. file type & file size). Once the upload URL has been requested, upload and verification of a file are performed asynchronously without the involvement of the consuming service.

Header name	Description	Required
User-Agent	Identifier of the service that calls upscan	yes
X-Session-ID	Identifier of the user's session	no
X-Request-ID	Identifier of the user's request	no

Parameter name	Description	Required
callbackUrl	Url that will be called to report the outcome of file checking and upload, including retrieval details if successful. Notification format is detailed further down in this file. Must be https.	yes
successRedirect	Url to redirect to after file has been successfully uploaded.	no
errorRedirect	Url to redirect to if error encountered during upload.	no
minimumFileSize	Minimum file size (in Bytes). Default is 0.	no
maximumFileSize	Maximum file size (in Bytes). Cannot be greater than 100MB. Default is 100MB.	no
expectedContentType	MIME type describing the upload contents.	no

Metric	Value	Comments
Expiration of S3 upload pre-signed URL	7 days	A relatively long period, since we can't control exactly when users will initiate the upload process
Expiration of S3 download pre-signed URL (scanned docs)	1 day (default)	Configurable per-service up to 7 days. Upscan is not intended as a storage solution for services
Callback request retry time	60 seconds
Maximum callback notification retries	30

arturopala/upscan-initiate

upscan-initiate

TLDR

Upscan user manual

Contents

Introduction

File upload workflow

Service usage

Requesting a URL to upload to

POST upscan/v2/initiate

HTTP Headers:

Body parameters:

POST upscan/initiate

HTTP Headers:

Body parameters:

The file upload

File upload outcome

File processing outcome

Success

Failure

Error handling

Design considerations

Uploading multiple files

Security

File metadata

Architecture of the service

Running and maintenance of the service

Running locally

Appendix

Quick reference figures

Related projects, useful links:

Testing

Slack

License