/spacy-dev-resources

💫 Scripts, tools and resources for developing spaCy

Primary LanguagePythonMIT LicenseMIT

spaCy Developer Resources

⚠️ This repository is outdated and has been archived. Instead of providing separate scripts, we've been focusing on standardising our formats, improving the documentation and updating the core library to require fewer utilities.

This repository is a collection of community resources and contains scripts, tools and helpers for developing spaCy, adding new languages and training new models.

Table of contents

Directory Description
corpus-utils Scripts used to transform treebanks. Work in progress.
fabfile Utility commands to be used with Fabric.
jupyter-displacy Simple extension to visualise displaCy dependency parses in Jupyter Notebooks.
spacy-annotator Trainable server for named entities that can be connected to a front-end, by @tcrossland
templates Template files for adding spaCy extensions, like new languages.
training Scripts and services for producing spaCy models.
vocab Fabric scripts for creating vocabulary, Brown clusters and word vectors from a Wikipedia and other corpora.