Research Group of Language Technology, NYTK
Research Group of Language Technology, NYTK
Budapest, Hungary
Pinned Repositories
embedding-demo
visualization for word2vec datasets
emMorph
emtsv
e-magyar text processing system -- inter-module communication via tsv + REST API
hadifogoly-adatbazis
A magyar hadifoglyok adatbázisának orosz-magyar transzkripciója
huwn
Hungarian WordNet / Magyar WordNet
huwn.rdf
Hungarian WordNet in RDF format for the Linked Open Data cloud
NYTK-NerKor
The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.
panmorph
Tagsets and description of Hungarian morphological analysers.
quntoken
Hungarian tokenizer.
xtsv
A generic TSV-style format based intermodular communication framework and REST API implemented in Python
Research Group of Language Technology, NYTK's Repositories
nytud/emtsv
e-magyar text processing system -- inter-module communication via tsv + REST API
nytud/NYTK-NerKor
The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.
nytud/quntoken
Hungarian tokenizer.
nytud/HuLU
Hungarian Language Understanding Benchmark Kit
nytud/machine-translation
nytud/HunTag3
A sequential tagger for NLP using Maximum Entropy Learning and Hidden Markov Models
nytud/hunspellpy
Hunspell integrated with the xtsv framework
nytud/anonymizer_hu
The Hungarian anonymization tool for CURLICAT
nytud/bert_coref_hu
nytud/HAPP
nytud/HuCOLA
Hungarian Corpus of Linguistic Acceptability
nytud/HuCoPA
Hungarian Choice of Plausible Alternatives Corpus
nytud/HuSST
Hungarian version of the Stanford Sentiment Treebank
nytud/HuWiC
Hungarian Word-in-Context Corpus
nytud/HuWS
Hungarian Winograd Schemes
nytud/parallelbible
TSV files of the Parallel Bible Reader
nytud/site-data-corpus
Personal site data for Research Group for LangTech @ MTA NYTI
nytud/ae-sentence-embeddings
Sentence embeddings with autoencoders
nytud/emdummy
An example module for emtsv
nytud/HuCommitmentBank
nytud/HuParlaMintII
nytud/HuRTE
Hungarian version of the Recognising Textual Entailment datasets
nytud/HuWNLI
Anaphora resolution datasets for Hungarian as an inference task
nytud/ITK-Transformer-NLP
Bevezetés az NLP-be Transformer-alapú modellekkel
nytud/news-please
news-please - an integrated web crawler and information extractor for news that just works
nytud/nytud.github.io
nytud/ParlaMint
ParlaMint: Comparable Parliamentary Corpora
nytud/pseudo-anonimization
nytud/PWS
nytud/w2v_models
Various models trained on parts of Webcorpus 2.0