/ungdc

Analysis of the UN General Debate Corpus (UNGDC) from a computational social science perspective

GNU General Public License v3.0GPL-3.0

Analysis of the UN General Debate from the Perspective of Sociology

This is an analysis of the UN General Debate Corpus (UNGDC).

The presentation demonstrates the promising opportunities when combining sociological inquiries with methods of Natural language Processing (NLP). The talk was given together with Sophie Mützel in the session Computational Social Science at the "39. Kongress der Deutschen Gesellschaft für Soziologie, 2018, Göttingen".

This project builds on the resources of:

Alexander Baturo, Niheer Dasandi, and Slava Mikhaylov, "Understanding State Preferences With Text As Data: Introducing the UN General Debate Corpus" Research & Politics, 2017.

For this analysis, I further cleaned the data to get more robust results and allow the application of more sensitive methods. Additionally, the dataset is enriched with additional country-specific information. I am happy to share this improved dataset. Please get in touch if you have any questions. The original data can be found here: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/0TJX8Y