Tong Shu Li
Last updated: 2019-06-03
Commercial drugs in the United States are assigned a unique, three-segment product identifier called the National Drug Code (NDC) by the Food and Drug Administration (FDA). The three segments of the NDC identify the labeler, the product, and the commercial package size of the drug. However, it is not clear what the actual active ingredients of a drug are based on the NDC.
To generate a mapping of NDCs to identifiers for the active ingredients.
Written in Python 3 for Linux environments.
Required command line utilities:
dos2unix
unzip
Required Python packages:
pandas
jupyter
code/
: All code for determining the active ingredients.data/
: Source data from the FDA and RxNorm.pipeline/
: Intermediate processing results.