Pinned Repositories
arpa-paraphrase-corpus
Sentential paraphrase datasets and BERT-based paraphrase detection models for the Armenian language.
babylondigger
Toolkit for text segmentation, part-of-speech tagging, lemmatization and dependency parsing
pioner
Named-entity datasets and GloVe models for the Armenian language
style-change-analysis
Datasets and resources for stylometry-based intrinsic plagiarism detection research for the Armenian language.
word-embeddings-eval-hy
Pre-trained fastText, word2vec, GloVe embeddings for the Armenian language and datasets for their intrinsic and extrinsic evaluation
Ivannikov Lab's Repositories
ivannikov-lab/arpa-paraphrase-corpus
Sentential paraphrase datasets and BERT-based paraphrase detection models for the Armenian language.
ivannikov-lab/style-change-analysis
Datasets and resources for stylometry-based intrinsic plagiarism detection research for the Armenian language.
ivannikov-lab/word-embeddings-eval-hy
Pre-trained fastText, word2vec, GloVe embeddings for the Armenian language and datasets for their intrinsic and extrinsic evaluation
ivannikov-lab/babylondigger
Toolkit for text segmentation, part-of-speech tagging, lemmatization and dependency parsing
ivannikov-lab/pioner
Named-entity datasets and GloVe models for the Armenian language