TokenMill
We can help you with your natural language generation and processing projects
Vilnius, Lithuania
Pinned Repositories
beagle
Beagle helps you identify keywords, phrases, regexes, and complex search queries of interest in streams of text documents.
clojure-graalvm-aws-lambda-template
Leiningen template for AWS Lambda custom runtime with GraalVM native image compiled Clojure projects.
common-crawl-utils
Various Common Crawl utilities in Clojure.
crawling-framework
Easily crawl news portals or blog sites using Storm Crawler.
dictionary-annotator
Fast and configurable UIMA dictionary annotator.
docx-utils
Easily work with .docx files from Clojure (a wrapper on Apache POI library).
fast-url-access-checker
Easily run HTTP GET requests against a list of URLs to check their HTTP status.
ltlangpack
Tools for Lithuanian language processing
snowball
Snowball version of the Porter stemmer for the Lithuanian language.
timewords
Multilingual library to easily parse date strings to java.util.Date objects.
TokenMill's Repositories
tokenmill/beagle
Beagle helps you identify keywords, phrases, regexes, and complex search queries of interest in streams of text documents.
tokenmill/clojure-graalvm-aws-lambda-template
Leiningen template for AWS Lambda custom runtime with GraalVM native image compiled Clojure projects.
tokenmill/timewords
Multilingual library to easily parse date strings to java.util.Date objects.
tokenmill/crawling-framework
Easily crawl news portals or blog sites using Storm Crawler.
tokenmill/ltlangpack
Tools for Lithuanian language processing
tokenmill/docx-utils
Easily work with .docx files from Clojure (a wrapper on Apache POI library).
tokenmill/fast-url-access-checker
Easily run HTTP GET requests against a list of URLs to check their HTTP status.
tokenmill/dictionary-annotator
Fast and configurable UIMA dictionary annotator.
tokenmill/snowball
Snowball version of the Porter stemmer for the Lithuanian language.
tokenmill/common-crawl-utils
Various Common Crawl utilities in Clojure.
tokenmill/docker-images
Docker configurations, images, and examples of Dockerfiles for various TokenMill products and projects.Official source for Docker configurations, images, and examples of Dockerfiles for TokenMill products and projects
tokenmill/crawling-framework-example
Demonstration on how to use the Crawling Framework to setup a simple science news crawler and store results in ElasticSearch. Use this configuration to set up your own crawler.
tokenmill/beagle-performance-benchmarks
Performance benchmarks for the Beagle library, and comparisons with other stored-query solutions.
tokenmill/es-utils
Clojure helper functions for Elasticsearch.
tokenmill/metadata-detector
Library to detect metadata from html files.
tokenmill/openccg
OpenCCG library for parsing and realization with CCG
tokenmill/doccano
Open source text annotation tool for machine learning practitioner.
tokenmill/faraday
DynamoDB client for Clojure
tokenmill/gf-wordnet
A WordNet in GF
tokenmill/spaCy
💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
tokenmill/unsupervised-keyphrase-extraction
EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)