Pre Process UCI Text Data 1. UCI preprocessing files are borrowed from the original code for the paper: A Practical Algorithm for Topic Modeling with Provable Guarantees