- Instructor: Bradley McDonnell (mcdonn@hawaii.edu)
- Dates ~ times: TBD (meets twice a month)
- Location: Phonetics Lab (Moore 162)
This course is designed to equip students working with a broad range of linguistic data to use cutting edge research methodologies in data science. It will be held in conjunction with the Data Science Group at the University of Hawai‘i at Mānoa, which is currently run by the Department of Linguistics. Students enrolled in the course will be expected to participate in the Data Science Group meetings below.
Below is a tentative course schedule.
Date | Topic | Facilitator | |
---|---|---|---|
1 | TBD | Intro to git | Brad |
2 | TBD | Using git commands | Brad |
3 | TBD | Linking git & GitHub | Brad |
4 | TBD | Collaborating on GitHub | TBD |
5 | TBD | Data visualization with ggplot (part 1) | TBD |
6 | TBD | Data visualization with ggplot (part 1) | TBD |
7 | TBD | Data transformation with tidyverse (part 1) | TBD |
8 | TBD | Data transformation with tidyverse (part 2) | TBD |
This course will draw on two valuable resources:
- R for Data Science by Garrett Grolemund and Hadley Wickham
- Lessons from Software Carpentry
Students are required to participate in the 8 meetings listed above and will be asked to be a facilitator for at least one of these meetings. In addition, students are required to either participate in 4 LAE lab workshops throughout the semester or demonstrate equivilent progress on a project that relates to this course. Students are also required to complete a final project.
Students in the course will be required to create a GitHub repository that contains:
- README.md file that explains the content of the repository
- RStudio project with...
- Dataset (saved as a .csv file)
- Code that transforms the data with tidyverse
- Plots that display the data in a meaningfull way with ggplot.