In this course, students have an introduction to Python so that they can conduct initial analysis on socio-cultural phenonena using data. Socio-cultural data pose great challenges and offer many opportunities due to their variety and richness, such as historical and literary sources, online social networks and media. During this course, students are introduced to the foundation concepts and techniques for analyzing such a variety of data.
You can run all the notebooks for this course using Colab (link above). If you prefer to work locally, I recommend using Jupyter notebooks which are easily accessible using the Anaconda interface.
Week | Topic | Materials |
---|---|---|
0 | Preparation (Giovanni Colavizza's Material) | notebook |
1 | Introduction (Giovanni Colavizza's Material) | slides + notebook |
2 | Data Gathering and Cleaning + Foundation of Statistics | slides + notebook |
3 | Data Analysis: Exploring your data | slides + notebook |
4 | Combining Data | slides + notebook |
5 | Visualizing Data | slides |
The datasets used in this course can be found here.
Students can also use these datasets(https://github.com/mromanello/ADA-DHOxSS/tree/master/data) to test their knowledge. They are from the Applied Data Analysis course.
See the assignments folder.
- Bradshaw, P. (2007). The inverted pyramid of data journalism. 1–7. http://onlinejournalismblog.com/2011/07/07/the-inverted-pyramid-of-data-journalism/
- Tong, Jingrong, and Landong Zuo. 2021. “The Inapplicability of Objectivity: Understanding the Work of Data Journalism.” Journalism Practice 15 (2): 153–69. https://doi.org/10.1080/17512786.2019.1698974.
- de-Lima-Santos, Mathias Felipe, Aljosha Karim Schapals, and Axel Bruns. 2021. “Out-of-the-Box versus in-House Tools: How Are They Affecting Data Journalism in Australia?” Media International Australia 181 (1): 152–66. https://doi.org/10.1177/1329878X20961569
- Mahmoud, Raghda, and Shaima’a Z. Zoghaib. 2023. “The Effects of Different Data Visualisation Formats on News Recall and Comprehension.” Media Watch, March, 097609112311587. https://doi.org/10.1177/09760911231158746
- Jansen, Yvonne, Pierre Dragicevic, Petra Isenberg, Jason Alexander, Abhijit Karnik, Johan Kildal, Sriram Subramanian, and Kasper Hornbæk. 2015. “Opportunities and Challenges for Data Physicalization.” Conference on Human Factors in Computing Systems - Proceedings 2015-April (April): 3227–36. https://doi.org/10.1145/2702123.2702180
- Lenzi, Sara, and Paolo Ciuccarelli. 2020. “Intentionality and Design in the Data Sonification of Social Issues.” Big Data & Society 7 (2): 205395172094460. https://doi.org/10.1177/2053951720944603
- Boss, Katherine, and Meredith Broussard. 2017. “Challenges of Archiving and Preserving Born-Digital News Applications.” IFLA Journal 43 (2): 150–57. https://doi.org/10.1177/0340035216686355
- Heravi, Bahareh, Kathryn Cassidy, Edie Davis, and Natalie Harrower. 2021. “Preserving Data Journalism: A Systematic Literature Review.” Journalism Practice, March, 1–23. https://doi.org/10.1080/17512786.2021.1903972
- Broussard, Meredith, and Katherine Boss. 2018. “Saving Data Journalism.” Digital Journalism 6 (9): 1206–21. https://doi.org/10.1080/21670811.2018.1505437
A good companion for this course is John Canning, Statistics for the Humanities, 2014. Also recommended are Melanie Walsh, Introduction to Cultural Analytics & Python, 2021 and Karsdorp, Kestemont, Riddell, Humanities Data Analysis: Case Studies with Python, 2021.
Mathias-Felipe de-Lima-Santos (Ph.D.) is a postdoctoral researcher in the Human(e) AI project at the University of Amsterdam, Netherlands. He is also a faculty associate in the Digital Media and Society Observatory (DMSO) at the Federal University of São Paulo (Unifesp), Brazil. Previously, he was a researcher at the University of Navarra, Spain, under the JOLT project, a Marie Skłodowska-Curie European Training Network funded by the European Commission’s Horizon 2020. He was also a Visiting Researcher at the Queensland University of Technology (QUT) in Brisbane, Australia. Mathias-Felipe is co-editor of the book “Journalism, Data, and Technology in Latin America” published by Palgrave Macmillan in 2021. His research interests include the changing nature of communications driven by technological innovations, particularly in journalism, media, and online social networks.
Everything in this repository which is not already attributed to someone else is released under CC BY 4.0.