nok/sklearn-porter

Port CountVectorizer

nicolalandro opened this issue · 3 comments

For text mining it's important to fit also a CountVectorizer (or a TFIDFTransformer), so should be possible to export it in the targhet lenguage

Currently, is there any alternative way?

@Opdoop the way is to extract the dictionary from CountVectorizer (get_feature_names()) and reimplement the logic manually.
You can do the same thing on TFIDF.

nok commented

Hello @nicolalandro ,

can you provide some snippets? Withit I will have a better start.

Kind regards,
Darius