Pinned Repositories
NLP-Cube
Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing
RO-STS
Romanian Semantic Textual Similarity Dataset
RO-WSD
Romanian Word Sense Disambiguation Dataset based on RoWordNet
Romanian-Transformers
This repo is the home of Romanian Transformers.
ronec
Romanian Named Entity Corpus (RONEC) version 2.0
roner
Named Entity Recognition for Romanian, based on transformer models
RoWordNet
Romanian WordNet (Data + API for Python)
sustain-seq2seq
This repo is a playground for seq2seq models with PyTorch
t5x_models
wiki-ro
Romanian Wikipedia dump that is cleaned and pre-processed, for language model capacity and perplexity evaluation.
dumitrescustefan's Repositories
dumitrescustefan/Romanian-Transformers
This repo is the home of Romanian Transformers.
dumitrescustefan/ronec
Romanian Named Entity Corpus (RONEC) version 2.0
dumitrescustefan/RoWordNet
Romanian WordNet (Data + API for Python)
dumitrescustefan/RO-STS
Romanian Semantic Textual Similarity Dataset
dumitrescustefan/roner
Named Entity Recognition for Romanian, based on transformer models
dumitrescustefan/t5x_models
dumitrescustefan/sustain-seq2seq
This repo is a playground for seq2seq models with PyTorch
dumitrescustefan/wiki-ro
Romanian Wikipedia dump that is cleaned and pre-processed, for language model capacity and perplexity evaluation.
dumitrescustefan/RO-WSD
Romanian Word Sense Disambiguation Dataset based on RoWordNet
dumitrescustefan/dcnews-corpus
dumitrescustefan/agerpres-corpus
dumitrescustefan/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
dumitrescustefan/dumitrescustefan
dumitrescustefan/LiroBenchmark.github.io
Romanian benchmark leaderboard
dumitrescustefan/NLP-Cube
Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing
dumitrescustefan/RO-NLI
Romanian Natural Language Inference Dataset
dumitrescustefan/ro-pos-tagger
dumitrescustefan/shields
Concise, consistent, and legible badges in SVG and raster format
dumitrescustefan/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
dumitrescustefan/UPFMT
Unified Processing Framework for raw Multilingual Text