/admiral

ADaM in R Asset Library

Primary LanguageROtherNOASSERTION

admiral

CRAN status Test Coverage

ADaM in R Asset Library

Purpose

To provide an open source, modularized toolbox that enables the pharmaceutical programming community to develop ADaM datasets in R.

Installation

The package is available from CRAN and can be installed by running install.packages("admiral").

To install the latest development version of the package directly from GitHub use the following code:

if (!requireNamespace("remotes", quietly = TRUE)) {
  install.packages("remotes")
}

remotes::install_github("pharmaverse/admiral.test", ref = "devel") # This is a required dependency of {admiral}
remotes::install_github("pharmaverse/admiraldev", ref = "devel") # This is a required dependency of {admiral}
remotes::install_github("pharmaverse/admiral", ref = "devel")

Release Schedule

{admiral} releases are targeted for the first Monday of the last month of each quarter. Pull Requests will be frozen the week before a release. The admiral family has several downstream and upstream dependencies and so this release shall be done in three Phases:

  • Phase 1 release is for {admiraldev} and {admiral.test}, which feed into all admiral packages
  • Phase 2 release is only for core {admiral}
  • Phase 3 release is extension packages, e.g. {admiralonco}
Release Schedule Phase 1- Date and Packages Phase 2- Date and Packages Phase 3- Date and Packages
Q1-2023 February 27th March 6th March 13th
{admiraldev} {admiral} {admiralonco}
{admiral.test}
Q2-2023 May 29th June 5th June 12th
{admiraldev} {admiral} {admiralonco}
{admiral.test}

Main Goal

Provide users with an open source, modularized toolbox with which to create ADaM datasets in R. As opposed to a “run 1 line and an ADaM appears” black-box solution or an attempt to automate ADaM.

One of the key aspects of {admiral} is its development by the users for the users. It gives an entry point for all to collaborate, co-create and contribute to a harmonized approach of developing ADaMs in R across the pharmaceutical industry.

Scope

To set expectations: It is not our target that {admiral} will ever provide all possible solutions for all ADaM datasets outside of study specific needs. It depends on the user's collaboration and contribution to help grow over time to an asset library that is robust, easy to use and has an across-industry focus. We do not see a coverage of 100% of all ADaM derivations as ever achievable---ADaM is endless.

We will provide:

  • A toolbox of re-usable functions and utilities to create ADaM datasets using R scripts in a modular manner (an "opinionated" design strategy)
  • Pharmaceutical communities and companies are encouraged to contribute to {admiral} following the provided programming strategy and modular approach
  • Functions that are comprehensively documented and tested, including example calls---these are all listed in the Reference section
  • Vignettes on how to create ADSL, BDS and OCCDS datasets, including example scripts
  • Vignettes for ADaM dataset specific functionality (i.e. dictionary coding, date imputation, SMQs ...)

Types of Packages

There will be 3 foreseeable types of {admiral} packages:

  • Core package---one package containing all core functions required to create ADaMs, usable by any company (i.e. general derivations, utility functions and checks for ADSL, OCCDS and BDS)
  • TA (Therapeutic Area) package extensions---one package per TA with functions that are specific to algorithms and requirements for that particular TA (e.g. {admiralonco})
  • Company package extensions---specific needs and plug-ins for the company, such as access to metadata (e.g. {admiralroche} or {admiralgsk})

Admiral Manifesto

For {admiral} and all extension packages, we prioritize providing our users with a simple to adopt toolkit that enables them to produce readable and easily constructible ADaM programs. The following explains our philosophy, which we try to adhere to across the {admiral} family of packages. There isn’t always a clear single, straightforward rule, but there are guiding principles we adhere to for {admiral}. This manifesto helps show the considerations of our developers when making decisions.

We have four design principles to achieve the main goal:

Usability

All {admiral} functions should be easy to use.

  • Documentation is an absolute priority. Each function reference page should cover the purpose, descriptions of each argument with permitted values, the expected input and output, with clear real-life examples---so that users don’t need to dig through code to find answers.
  • Vignettes that complement the functional documentation to help users see how best the functions can be applied to achieve ADaM requirements.
  • Functions should be written and structured in a way that users are able to read, re-use or extend them for study specific purposes if needed (see Readability below).

Simplicity

All {admiral} functions have a clear purpose.

  • We try not to ever design single functions that could achieve numerous very different derivations. For example if you as a user pick up a function with >10 different arguments then chances are it is going to be difficult to understand if this function could be applied for your specific need. The intention is that arguments/parameters can influence how the output of a function is calculated, but not change the purpose of the function.

  • We try to combine similar tasks and algorithms into one function where applicable to reduce the amount of repetitive functions with similar algorithms and to group together similar functionality to increase usability (e.g. one study day calculation rather than a function per variable).

  • We strive to design functions that are not too general and trying to fulfil multiple, complex purposes.

  • Functions should not allow expressions as arguments that are used as code snippets in function calls.

  • We recommend to avoid copy and paste of complex computational algorithms or repetitive code like checks and advise to wrap them into a function. However we would also like to avoid multi-layered functional nesting, so this needs to be considered carefully to keep the nesting of 3-4 functions an exception rather than the rule.

Findability

All {admiral} functions are easily findable.

  • In a growing code base, across a family of packages, we make every effort to make our functions easily findable.
  • We use consistent naming conventions across all our functions, and provide vignettes and ADaM templates that help users to get started and build familiarity. Each {admiral} family package website is searchable.
  • We avoid repetitive functions that will do similar tasks (as explained above with study day example).
  • Each package extension is kept focused on the specific scope, e.g. features that are relevant across multiple extension packages will be moved to the core {admiral} package.

Readability

All {admiral} functions follow the Programming Strategy that all our developers and contributors must follow, so that all our code has a high degree of consistency and readability.

  • We mandate use of tidyverse (e.g. dplyr) over similar functionality existing in base R.
  • For sections of code that perform the actual derivations (e.g. besides assertions or basic utilities), we try to limit nesting of too many dependencies or functions.
  • Modularity is a focus---we don’t try to achieve too many steps in one.
  • All code has to be well commented.
  • We recognize that a user or a Health Authority reviewer may have the wish to delve into the code base (especially given this open source setting), or users may need to extend/adapt the code for their study specific needs. We therefore want any module to be understandable to all, not only the {admiral} developers.

References and Documentation

Conference Presentations

Contact

We use the following for support and communications between user and developer community:

  • Slack---for informal discussions, Q&A and building our user community. If you don't have access, use this link to join the pharmaverse Slack workspace
  • GitHub Issues---for direct feedback, enhancement requests or raising bugs