Create RDS (non-Aurora) reusable terraform module

Question

Create RDS (non-Aurora) reusable terraform module

rocketnova opened this issue a year ago · 13 comments

Context

We discovered that AWS RDS Aurora v2 is more expensive than AWS RDS. On PRP, Aurora is costing ~$150/mo, even with very little load. On MWDP, RDS is costing ~$15/mo. Even though on large projects, the difference of $135/mo is not significant, it ends up being quite costly for our long-lived demo projects.

In #116, we got really far on creating an RDS drop-in alternative to the RDS Aurora module. But the PR was never merged because 2 comments were not resolved.

Tasks

Conduct the research requested in comment 1 and advise me on what the correct allocated_storage should be for the participant portal should be
Add a backup service to address comment 2
- Planning: Let's talk through what backup service you would recommend and why before you begin implementation.

Answer 1 · 2023-09-13T18:57:57.000Z

Task 1

The database is mostly used for data persistence rather than long-term storage which means our actual allocated storage doesn't need to be that large. The recommendation is to have a max_allocated_storage of 2 GiB and to configure autoscaling to further reduce costs. I believe the minimum value is 1 GiB. Which matches our current configuration for the aurora database.

current storage allocation

If the need arises, we could also set up an automated job (cron, lambda, etc) that drops data after one week.

Additional reading

pricing informatinon for Aurora
pricing information for RDS

Answer 2 · 2023-09-13T19:05:52.000Z

Excellent! Thank you.

The recommendation is to have a max_allocated_storage of 2 GiB and to configure autoscaling to further reduce costs. I believe the minimum value is 1 GiB. Which matches our current configuration for the aurora database.

Does this mean that we should change this line to the following?

  allocated_storage     = 1
  max_allocated_storage = 2

Answer 3 · 2023-09-13T21:11:38.000Z

@rocketnova I think so? Nothing in the documents say if the values for max_allocated_storage should be the actual amount of space or the capacity units

Answer 4 · 2023-09-13T21:17:39.000Z

@aplybeah I think it must be? The terraform argument reference says allocated_storage is in GiB:

allocated_storage - (Required unless a snapshot_identifier or replicate_source_db is provided) The allocated storage in gibibytes.

And then the storage autoscaling section says:

To enable Storage Autoscaling with instances that support the feature, define the max_allocated_storage argument higher than the allocated_storage argument. Terraform will automatically hide differences with the allocated_storage argument value if autoscaling occurs.

resource "aws_db_instance" "example" {
... other configuration ...

allocated_storage = 50
max_allocated_storage = 100
}

Which implies to me that the example sets the base allocated_storage to 50 GiB and max_allocated_storage to 100 GiB, but I could be wrong!

Answer 5 · 2023-09-13T21:18:16.000Z

When you test out the implementation, could you try the following and see if it works as expected/hoped?

  allocated_storage     = 1
  max_allocated_storage = 2

Answer 6 · 2023-09-13T21:18:55.000Z

Sure! I'll give that a try.

Answer 7 · 2023-09-14T15:24:08.000Z

Task 2
Following the request in this comment: #116 (comment)
I think the only thing we need to do is set retention periods in the database and possibly make sure a bucket exists for the snapshots

# something like this 
backup_retention_period = 7
backup_window = 02:00-03:00 # this is a random time but in UTC

Answer 8 · 2023-09-14T16:56:34.000Z

I assume this these would be arguments on the aws_db_instance resource? If so, how do we tell it what s3 bucket to use?

Answer 9 · 2023-09-14T17:08:01.000Z

@rocketnova I don't think the s3 bucket applies in our case, I misread something when I added that comment. But yes, these are arguments in aws_db_instance

Answer 10 · 2023-09-14T17:16:07.000Z

@aplybeah Oh I see. What does the backup restore process look like if we use backup_retention_period?

Answer 11 · 2023-09-14T17:23:37.000Z

@rocketnova Every day there is a backup window that is about 30 min. We can either define when we want this backup to be, or use AWS's default time. During this time there may be increased latency, etc (nothing we need to be too concerned about), and the database creates a snapshot of the entire instance

Answer 12 · 2023-09-14T17:27:57.000Z

@aplybeah Are we talking about these RDS automated backups? If so, is this the correct restoration process?

Answer 13 · 2023-09-14T17:35:36.000Z

@rocketnova Yes we are, and that is the correct process.