Wissenschaftszentrum Berlin für Sozialforschung / WZB Berlin Social Science Center
Repository with scripts & tools used & developed @ WZB Berlin Social Science Center. Public money – public code! See also https://datascience.blog.wzb.eu.
Berlin, Germany
Pinned Repositories
geovoronoi
a package to create and plot Voronoi regions within geographic boundaries
germalemma
A lemmatizer for German language text
otree_custom_models
Example project showing how to use custom models in oTree for recording complex decisions in experiments
otree_iat
Implicit Association Test (IAT) experiment for oTree
otreeutils
Facilitate oTree experiment implementation with extensions for custom data models, surveys, understanding questions, timeout warnings and more.
pandas-excel-styler
Styling individual cells in Excel output files created with pandas.
pdf2xml-viewer
A simple viewer and inspection tool for text boxes in PDF documents
pdftabextract
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
plz_geocoord
Dataset of all German postal codes and their geographic center as geo-coordinates.
tmtoolkit
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
Wissenschaftszentrum Berlin für Sozialforschung / WZB Berlin Social Science Center's Repositories
WZBSocialScienceCenter/pdftabextract
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
WZBSocialScienceCenter/tmtoolkit
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
WZBSocialScienceCenter/geovoronoi
a package to create and plot Voronoi regions within geographic boundaries
WZBSocialScienceCenter/pdf2xml-viewer
A simple viewer and inspection tool for text boxes in PDF documents
WZBSocialScienceCenter/germalemma
A lemmatizer for German language text
WZBSocialScienceCenter/plz_geocoord
Dataset of all German postal codes and their geographic center as geo-coordinates.
WZBSocialScienceCenter/otreeutils
Facilitate oTree experiment implementation with extensions for custom data models, surveys, understanding questions, timeout warnings and more.
WZBSocialScienceCenter/pandas-excel-styler
Styling individual cells in Excel output files created with pandas.
WZBSocialScienceCenter/otree_iat
Implicit Association Test (IAT) experiment for oTree
WZBSocialScienceCenter/gemeindeverzeichnis
Python-Modul zum Einlesen von Gemeindeverzeichnisdaten des Statistischen Bundesamts als pandas DataFrame
WZBSocialScienceCenter/mdb-twitter-network
Twitter network of members of the 19th German Bundestag
WZBSocialScienceCenter/r-geodata-workshop
Workshop held at WZB: Working with geo-spatial data in R - Obtaining, linking and plotting geographic data
WZBSocialScienceCenter/tm_corona
A small showcase for topic modeling with the tmtoolkit Python package. I use a corpus of articles from the German online news website Spiegel Online (SPON) to create a topic model for before and during the COVID-19 pandemic.
WZBSocialScienceCenter/covid19-placesapi
Code to obtain and analyse "popular times" data from Google Places. Also contains data fetched between March 22nd and April 15th 2020 for different places world-wide.
WZBSocialScienceCenter/patternlite
Lightweight, Python 3.6+ fork of the original Pattern package: Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
WZBSocialScienceCenter/r_clustered_se
Code for blog post "Clustered standard errors with R: Three ways, one result".
WZBSocialScienceCenter/r_simplify_features
Code for blog post showing how to simplify spatial features with R.
WZBSocialScienceCenter/spatially_weighted_avg
Code for "Spatially weighted averages in R with sf"
WZBSocialScienceCenter/dataverse
Open source research data repository software
WZBSocialScienceCenter/lda
Topic modeling with latent Dirichlet allocation using Gibbs sampling
WZBSocialScienceCenter/wzb_r_tutorial
Documents for R tutorial given at WZB accompanying the lecture "Studying Social Stratification with Big Data" (Hipp, Ulbricht) in winter semester 2018
WZBSocialScienceCenter/aas-chronik-scraper
Web-Scraper für Chronik von antisemitischen Vorfällen erstellt von der Amadeu Antonio Stiftung und publiziert unter https://www.amadeu-antonio-stiftung.de/chronik/
WZBSocialScienceCenter/apis_for_social_scientists_a_review
A review of APIs.
WZBSocialScienceCenter/fetch_google_places_api_data
Script to fetch data of Google Places in Berlin using the Google Places API and popularity data. Used at the beginning of the COVID-19 pandemic to measure change of popularity of different places.
WZBSocialScienceCenter/github_covid_gender_jfr
Replication code for article "Has Covid-19 increased gender inequalities in professional advancement? Cross-country evidence on productivity differences between male and female software developers" published in the Journal of Family Research.
WZBSocialScienceCenter/gmapswrapper
Google Maps API wrapper for Python enables convenient caching of Maps API results.
WZBSocialScienceCenter/hipp_konrad_2021
Replication code for Men’s and women’s productivity before and during the COVID-19 pandemic: Evidence from a cross-country comparison of software developers
WZBSocialScienceCenter/otree_amp
Affect Misattribution Procedure (AMP) experiment for oTree
WZBSocialScienceCenter/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
WZBSocialScienceCenter/wzbsocialsciencecenter.github.io
wzbsocialsciencecenter.github.io landing page.