indolem
"Indonesian Language Evaluation Montage”, a comprehensive dataset encompassing spanning morpho-syntax, semantics, and discourse for Indonesian NLP.
Pinned Repositories
IndoBERTweet
IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)
indolem
IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.
indolem.github.io
blog & blog theme🤘
sum_liputan6
The first large-scale summarization corpus for the Indonesian language. AACL 2020.
indolem's Repositories
indolem/indolem
IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.
indolem/IndoBERTweet
IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)
indolem/sum_liputan6
The first large-scale summarization corpus for the Indonesian language. AACL 2020.
indolem/indolem.github.io
blog & blog theme🤘