/python-for-social-scientists

Getting started with Python: A how-to guide for social scientists by Ruben Bach and Andreas Küpfer

Primary LanguageJupyter Notebook

Getting started with Python: A how-to guide for social scientists

📍 Hybrid Event: MZES, Mannheim + Zoom

📆 February 15, 2023 & February 22, 2023

Day 1

Other than with R, getting started with Python can be burdensome at times as there is no one-stop shop solution like RStudio. Although tons of introductory tutorials for Python are available on the web, navigating and setting up one’s programming environment can be challenging, especially for users with little programming experience. To lower the burden of getting started with Python, we will talk in this workshop about the basics of Python, installing and maintaining virtual environments and the various graphical user interfaces and integrated development environments out there like Jupyter Notebooks, Google Colab, and Anaconda. We show situations where Python may be beneficial for your research and when you may choose to go with R. Please note that this talk is the first part of a two-day workshop in the Social Science Data Lab. In the second event (February 22, 2023), we will focus our attention on implementing a simple machine learning routine in Python.

📝 Slides

Day 2

The merits of Python for social scientists become tangible when working on a concrete use case. In this follow-up event of our Social Science Data Lab workshop series on Python we use Jupyter Notebooks in the Google Colab environment to implement a simple machine learning routine for prediction. To do that, we first take a step-by-step look at the peculiarities of Python such as data wrangling and basic visualization techniques. With that knowledge, we delve into the basics of applied machine learning by implementing the pipeline for both a logistic regression as well as a random forest model using the Python package scikit-learn. We conclude this workshop with a brief outlook on more advanced possibilities with Python to lay the foundation for your own research.

📝 Slides 💻 Code

Presenters

👤 Andreas Küpfer is a doctoral researcher at the Technical University of Darmstadt. His interdisciplinary research interests include text as data, applying machine learning technologies, and substantial inference in the fields of political communication and political competition.

👤 Ruben Bach is a postdoctoral researcher at the MZES, University of Mannheim, focusing on social science quantitative research methods. His interests include topics related to big data in the social sciences, machine learning, causal inference, and survey research.