langcog/wordbankr

add create_norms() and get_norm_scores() functions?

Opened this issue · 2 comments

Background:

  • lots of people ask for norms, and we've duplicated effort (and scripts) across projects
  • it's a standard procedure (although we don't have demographics for many languages: just EN and SP):
  • look up by age and sex, look up in manual tables and interpolate between 5 percentiles, etc..

Potential solutions:

  1. offer 'standard' script (on GH / email with proper warnings) -> or just a wordbankr function? (with good documentation and a vignette example)
  2. scoring tool (in wordbankr, shiny, or both?): import your CSV (id, age, sex, sumscore) - too many people used it incorrectly... (need to clearly document expected mismatch between published tables and wordbank %iles)
  3. add 'wordbank_percentile' column to https://wordbank.stanford.edu/data/?name=admin_data

@vmarchman , @alvinwmtan and I thought adding functionality to wordbankr makes the most sense, as it offers (sophisticated) users the option of using our SotA that should match published norms where we have them

@mcfrank your thoughts welcome!