/affinity_algorithms

NLP Research project examines language evolution of high affinity terms in online communities (Reddit)

Primary LanguageJupyter Notebook

Affinity Analysis Algorithms

This repository contains the package and code used for research in the paper "Community Characteristics and Semantic Shift of High Affinity Terms in Reddit Communities". The research project consited of computing affinity values, semantic shifts and community characteristics for Reddit communities from a selected corpus. The code for this is specifically in the affinity_algorithms package.

Project Structure

The project contains scripts that were explicitly used for the Network Dynamics Lab servers to process Reddit Datasets, as well as a package that can be used independently to the data structure
  • affinity_analysis package is an independent package that consists of exact affinity analysis, semantic shifts and community characteristics algorithms as defined in the paper.
  • docs contains the documentation for the package. TODO
  • ndl contains scripts specific to the Network Dynamics Lab server-side data processing.
  • examples contains an example notebook that demonstrates use cases of the package
  • tests

Prerequisites

Install the packages from the requirements.txt

pip install requirements.txt

Tests

TODO

Examples

TODO