Pinned Repositories
brat
brat rapid annotation tool (brat) - for all your textual annotation needs
annodoc
Annodoc annotation documentation support system
conlleval.py
Python version of the evaluation script from CoNLL'00-
conllu.js
CoNLL-U format library for JavaScript
ncbi-disease
NCBI disease corpus - related resources
nxml2txt
NLM .nxml to text format conversion
standoff2conll
Conversion from brat-flavored standoff to CoNLL format
wiki-bert-pipeline
Generate BERT vocabularies and pretraining examples from Wikipedias
wvlib
word vector library
docs
Universal Dependencies online documentation
spyysalo's Repositories
spyysalo/conllu.js
CoNLL-U format library for JavaScript
spyysalo/ncbi-disease
NCBI disease corpus - related resources
spyysalo/bc2gm-corpus
Work related to the BioCreative II Gene Mention corpus
spyysalo/pubtator
PubTator tools
spyysalo/genia-pos
GENIA corpus v3.02 part-of-speech annotations (GENIA tagger variant)
spyysalo/sols
soft-matching ontology lookup service
spyysalo/interleave-layer
Special-purpose Keras layer for merging word and dependency vectors for relation classification
spyysalo/crf-test
Keras CRF experiments
spyysalo/multiling-cnn
Simple multi/cross-lingual CNN text classifier
spyysalo/knowtator2standoff
Knowtator to standoff format conversion for CRAFT corpus
spyysalo/add-text-to-conllu
Add "# text = " lines from a text file to CoNLL-U data
spyysalo/conllu-language
Language identification for CoNLL-U data
spyysalo/conllu-matches
Find identically annotated sentences in two sets of CoNLL-U data
spyysalo/consensus-viewer
Consensus annotation viewer
spyysalo/craft-ud-fix
CRAFT corpus Universal Dependencies data fixes
spyysalo/crfsuite-tools
Tools for working with CRFsuite (http://www.chokkan.org/software/crfsuite/)
spyysalo/finer-tools
Tools for working with FiNER data (https://github.com/mpsilfve/finer-data)
spyysalo/finnish-registers
Tools for working with Finnish register data
spyysalo/genia-dependency-trees
Universal Dependencies (v1.0) for the GENIA 1.0 Treebank, along with additional raw abstracts and metadata.
spyysalo/greekc-triage-service
spyysalo/jensenlab-tagger
Copy of https://bitbucket.org/larsjuhljensen/tagger/
spyysalo/jensenlab-tools
Tools for working with JensenLab tools
spyysalo/MutationFinder
Tools for the MutationFinder corpus (http://mutationfinder.sourceforge.net/)
spyysalo/pickanno
spyysalo/pytorch-transformers
👾 A library of state-of-the-art pretrained models for Natural Language Processing (NLP)
spyysalo/tagger
Named Entity Recognition Tool
spyysalo/tees-xml
Tools for working with TEES XML (EVEX DB format)
spyysalo/tokens-x-mesh
Create cross-product of tokens and MeSH terms from data extracted from PubMed.
spyysalo/tweakconllu
Tools for modifying CoNLL-U data
spyysalo/wordvec-oov
Determine out-of-vocabulary rate for word vectors on text