TimSchopf/KeyphraseVectorizers
Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase matrix.
PythonBSD-3-Clause
Issues
- 1
Dependency issue
#36 opened by Darshan2104 - 1
- 2
installation issue
#27 opened by dineshdisprz - 2
Can not work with Chinese
#22 opened by JaheimLee - 2
Divide by zero error when trying to use `KeyphraseCountVectorizer` with BERTopic
#26 opened by Pratik--Patel - 2
use of custom stop words
#28 opened by gboyega1 - 1
It does not exclude stop words in Portuguese
#31 opened by phuclh - 1
OnlineKeyphraseVectorizer
#29 opened by edloginova - 1
Reducing outliers in BERTopic
#34 opened by ddenz - 3
Memory Issues
#6 opened by amoschoomy - 0
Regex from the paper?
#32 opened by turian - 1
Use list of POS patterns to reduce runtime
#30 opened by saied71 - 2
Lemmatizing documents and keyphrases
#9 opened by hboisgibault - 2
POS PATTERN for italian language?
#14 opened by spolo96 - 8
Custom Stopwords
#7 opened by amoschoomy - 1
Expose regex token_pattern
#20 opened by raj-shah - 2
Keyword similarity values
#25 opened by arash-hajikhani - 0
- 2
Can't tag with spaCy in some languages
#12 opened by mdsutter - 1
POS PATTERN for Chinese language?
#18 opened by Duanmu0312 - 1
- 0
Cannot use this for japanese text
#17 opened by MARUD84 - 1
- 11
Spacy tagger is not available in French
#2 opened by hboisgibault - 2
Doesn't seem to work in french
#13 opened by maximelucet - 2
- 1
- 2
Keyphrases retrieved do not match regex
#8 opened by hboisgibault - 5
spacy. gold
#3 opened by aph61 - 3