cal-itp/data-analyses

Research Task - Unique Project ID

Opened this issue · 0 comments

Research Task

In relation to Epic #676

  • Question or Goal: overarching goal of this issue

  • Creating the project list involves pulling projects from various different sources. These sources have their own system of identifying projects with an alphanumeric sequence. However, we need to generate project ids that is unique to this comprehensive list as a key.

  • Research Required:

  • What is the best way to create this system?

  • There are a few options.

  1. Concatenate the database name + database project ID (example LP2000 + 21039402). However, the project IDs will not be the same lengths, though this can be rectified. Or is having project IDS of the same length not of importance?
  2. Generate a hash ID based on the columns. However, if one of the columns change (which is a real possibility), then the hash will change. We want the project ID to stay the same over time?
  • Expected Outputs / Findings: what do you hope to have created when this is complete?
  • A project ID system for the final product created by Epic #676 .