Demonstration of a very simple data set that cannot be easily handled by Excel; however, it is very easy to handle this data set in R.
Source data come from here and are downloaded using the vroom
package. As of 2020, there were 6,215,834 records. You should run downloadData.R
before running the other files in this repos. Note: For names at the national level see the babynames
R package.
The USNamingTrends
notebook performs a cursory EDA, examining popular names and names shared by both sexes. See here for an example.
The Shiny app displays counts by year for any name and any combination of states. See here for an example.