hrs/python-tf-idf

Question: Any plans on a top-n terms?

Opened this issue · 1 comments

I've seen tf-idf used in many cases to identify the top-n terms that are most unique to a particular document. I don't know how much active development there is on this project, but if there is active development on that front this might be a useful capability for this tool/class to support!

Great work by the way, thanks for posting all this on Github! 👍

I would like to work on this issue