eklem/stopword-trainer
A module for creating stopword lists for any language, based on a set of documents.
JavaScriptMIT
Issues
- 0
Move CI test from TravisCI to GitHub
#138 opened by eklem - 1
- 0
- 0
Document `redlist`
#141 opened by eklem - 0
bundle/build cjs, esm and umd
#123 opened by eklem - 19
- 1
Browser demo
#121 opened by eklem - 1
tests for node.js and the browser
#108 opened by eklem - 4
- 2
- 1
Add check if stopWordiness === 0
#50 opened by eklem - 4
only take keys (also dot notated) as input
#109 opened by eklem - 1
Drop object-end-keys dependency
#106 opened by eklem - 2
Threshold for how short documents can be
#102 opened by eklem - 1
Drop or update lodash
#105 opened by eklem - 2
- 0
- 1
Show how to use on search-index export
#35 opened by eklem - 1
Use the Wikimedia Downloads
#56 opened by eklem - 1
Creating red-list array
#40 opened by eklem - 0
- 1
Add standard.js to package.config
#64 opened by eklem - 1
swap from greenkeeper to GitHub
#65 opened by eklem - 2
Do a toLowerCase() on all paragraph array strings
#55 opened by eklem - 1
Move ndjson.parse out of stopword-trainer
#30 opened by eklem - 1
console client: Exit when no key provided
#51 opened by eklem - 1
Add check if file content is wrong
#52 opened by eklem - 1
Move around documentation
#53 opened by eklem - 0
upgrade lodash to latest
#54 opened by eklem - 0
Make it "pipe" properly
#27 opened by eklem - 1
Possibility to define regex in options
#16 opened by eklem - 4
- 3
- 1
Add tests for a couple of non English languages
#49 opened by eklem - 2
Check if whole extractionKeys array is extracted
#48 opened by eklem - 0
Version 10 of node.js has been released
#44 opened by greenkeeper - 0
Make Greenkeper a little less chatty
#34 opened by eklem - 0
Make setting options optional
#25 opened by eklem - 0
move some stuff to global data object
#28 opened by eklem - 0
Test nested objects
#26 opened by eklem - 5
Possibility to define which fields to analyze?
#14 opened by eklem - 4
- 0
Remove config.json
#24 opened by eklem - 0
Switch from full reuters library to 1000 documents
#23 opened by eklem - 1
Test framework in place
#22 opened by eklem - 1
Error handling
#18 opened by eklem - 1
Switch to file input
#21 opened by eklem - 1
take configuration/options input
#20 opened by eklem - 1
Make it a library
#19 opened by eklem - 1
Remove numbers from text extraction
#15 opened by eklem