Pinned Repositories
jLDADMM
A Java package for the LDA and DMM topic models
jPTDP
Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)
LFTM
Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
RDRPOSTagger
A fast and accurate POS and morphological tagging toolkit (EACL 2014)
BERTweet
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
PhoBERT
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
PhoGPT
PhoGPT: Generative Pre-training for Vietnamese (2023)
PhoNLP
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
VnCoreNLP
A Vietnamese natural language processing toolkit (NAACL 2018)
datquocnguyen's Repositories
datquocnguyen/LFTM
Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
datquocnguyen/jPTDP
Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)
datquocnguyen/RDRPOSTagger
A fast and accurate POS and morphological tagging toolkit (EACL 2014)
datquocnguyen/jLDADMM
A Java package for the LDA and DMM topic models
datquocnguyen/RDRsegmenter
A Fast and Accurate Vietnamese Word Segmenter (LREC 2018)
datquocnguyen/STransE
STransE: a novel embedding model of entities and relationships in knowledge bases (NAACL 2016)
datquocnguyen/PhoW2V
Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese
datquocnguyen/jointRE
End-to-end neural relation extraction using deep biaffine attention (ECIR 2019)
datquocnguyen/BioPosDep
Tokenization, sentence segmentation, POS tagging and dependency parsing for biomedical texts (BMC Bioinformatics 2019)
datquocnguyen/VnDT
VnDT: A Vietnamese Dependency Treebank
datquocnguyen/VnMarMoT
A state-of-the-art pre-trained model for Vietnamese POS tagging (ALTA 2017)
datquocnguyen/MAP4LDA
Improving Topic Coherence with Latent Feature Word Representations in MAP Estimation for Topic Modeling (ALTA 2015)
datquocnguyen/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
datquocnguyen/TransE-NMM
Neighborhood Mixture Model for Knowledge Base Completion (CoNLL 2016)
datquocnguyen/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
datquocnguyen/2025
datquocnguyen/datquocnguyen
datquocnguyen/datquocnguyen.github.io