/convert_to_ldac

Convert tweets to LDA-C form for using David Blei's LDA and turbotopics code

Primary LanguagePython

convert_to_ldac

Convert tweets to LDA-C form for using David Blei's LDA and turbotopics code

where [M] is the number of unique terms in the document, and the [count] associated with each term is how many times that term appeared in the document. Note that [term_1] is an integer which indexes the term; it is not a string.