This package contains several datasets of use for historians and social scientists studying gender. These data sets were compiled in this package for use with the gender package for predicting gender from first names.
The raw data sets used in this package are available here:
- Mark Kantrowitz's name corpus
- Social Security Administration's baby names by year (and by state and territories)
- IPUMS Census data
- North Atlantic Population Project
See also Hadley Wickham's babynames package.
This package can be installed using the remotes package:
# install.packages("remotes")
remotes::install_github("lmullen/genderdata")