/text_cleaning

Corpora and scripts for cleaning political science texts. Scripts are translated into transformations that support SAGE Texti.

Primary LanguagePythonMozilla Public License 2.0MPL-2.0

TEXT CLEANING FOR POLITICAL SCIENCE

This repository contains the python scripts powering the transformations on Texti in app_scripts as well as the documentation for it in source.

Take a look at the detailed documentation for Texti here

If you would like to contribute transformations or coprora examples, check this page

Contents of the file

  • Introduction
  • Initial setup
  • Tests

Introduction

This repository contains the python scripts powering the transformations on Texti in app_scripts as well as the documentation for it in source.

Setup

pip install -r requirements.txt

Tests

1. `cd to app_scripts/clean directory`
1. `python3 -m unittest"`