mbert
There are 40 repositories under mbert topic.
csebuetnlp/banglabert
This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.
cambridgeltl/ContrastiveBLI
Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.
lirondos/lazaro
An observatory of anglicism usage in the Spanish press
ishan00/meta-learning-for-multi-task-multilingual
Official Repository for the paper titled "Meta-Learning for Effective Multi-task and Multilingual Modelling" accepted at EACL 2021
Mukaffi28/Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset
A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language
negar-foroutan/multiLMs-lang-neutral-subnets
[EMNLP 2022] Discovering Language-neutral Sub-networks in Multilingual Language Models.
harikrishnan669/Youtube_summarizer
Your personal study assistant! This bot makes learning easier by converting YouTube videos into summarized notes with key points. Just send a video link, and get a clear PDF summary—perfect for studying, revising, or quickly understanding any topic.
fatemafaria142/MultiBanFakeDetect-An-Extensive-Benchmark-Dataset-for-Multimodal-Bangla-Fake-News-Detection
This study introduces MultiBanFakeDetect, a novel multimodal dataset for Bangla fake news detection, combining textual and visual information. It features TextFakeNet for text analysis and MultiFusionFake for integrating multimodal data.
juletx/multilingual-question-answering
Zero-shot and Translation Experiments on XQuAD, MLQA and TyDiQA
BassaniRiccardo/ICEBERT
ICEBERT: Interlingual-Clusters Enhanced BERT. A BERT-like model trained on clusters of monolingual subwords.
DiFronzo/Multilingual-Models
mBERT and XLM-R for encodeing of Scandinavian languages
fatemafaria142/Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI
This research examines the performance of Large Language Models (GPT-3.5 Turbo and Gemini 1.5 Pro) in Bengali Natural Language Inference, comparing them with state-of-the-art models using the XNLI dataset. It explores zero-shot and few-shot scenarios to evaluate their efficacy in low-resource settings.
elsheikh21/cross-natural-language-inference
ZeroShot XNLI
AditiBagora/Hasoc2021CodeMix
HASOC2021: Subtask 2 a) Codemix Challenge; Contains baselines and hierarchical approach that extracts the relevant context useful for classification of hostile tweets on English-Hindi code-mix data obtained from twitter.
Elijas/lithuanian-text-summarization-model
Deployed model which can summarize Lithuanian language text by leveraging Artificial Neural Networks, Transformers, mBERT.
michaelpeterhoffmann/masterthesis
Multilingual hate speech detection for German, Italian and Spanish Social Media Posts #machine learning #classifier
peterzee-tsien/LING484-COMP599-Final-Projects
By using the hypothesis of historical linguistics, we found a way to improve the performance of multilingual transformers with limited amount of data
sankar-2002/Gendered_Abuse_Detection_In_Indic-Languages
Online gender-based violence limits marginalized voices. Detection in Indic languages is hard due to limited data and linguistic complexity. This work builds better classifiers for improved abuse detection in such settings.
SKG24/VIVARAN_chatbot_Supreme-court-hackathon
It is an ideation of the AI powered chatbot to help in legal understanding of the Indian government and its laws. To reach larger audience it supports all the constitutional languages.
Ali-Mhrez/Stance-Detection-MBERT-Features
This repository contains the code for a Ph.D. research project that focuses on improving the performance of mBERT for fake news stance detection. Our key contribution is the BERT-ESDM methodology, a novel approach that uses convolutional neural networks to enrich mBERT's contextual embeddings.
Ali-Mhrez/Stance-Detection-MLLM
This repository is dedicated to a Ph.D. research project that systematically investigates the effectiveness of multilingual transformer models on the task of stance detection. The goal is to not only benchmark these models but also to analyze their ability to handle linguistic challenges, transfer knowledge, and perform under dataset constraints.
fatemafaria142/BanglaCalamityMMD-A-Comprehensive-Benchmark-Dataset-for-Multimodal-Disaster-Identification
This study presents a hybrid multimodal fusion technique for disaster identification in Bangla, combining text and image data using the "BanglaCalamityMMD" dataset. Employing DisasterTextNet, DisasterImageNet, and DisasterMultFusionNet, the approach addresses a key gap in Bangla disaster research.
fatemafaria142/Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset
This study addresses the gap in translating Bangla regional dialects into standard Bangla by creating a large-scale multilingual benchmark dataset of 32,500 sentences in Bangla, Banglish, and English, representing five regional Bangla dialects such as Sylheti, Chittagong, Mymensingh, Noakhali, and Barishal.
jessicasaikia/multilingual-BERT-mBERT
This repository implements a Multilingual BERT (mBERT) model for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.
joyou159/SWIZT
Exploring the use of multilingual transformers, specifically mBERT and XLM-RoBERTa, for named entity recognition (NER) in the context of Switzerland’s multi lingual environment.
Koharu24/mBERT_crosslingual_rd
This is a project proposal to implement Yan et al.'s (2020) mBERT-Unaligned for cross-lingual RDs with Japanese, German and Italian untranslatable terms
Manoj632004/Question_Answering_System
The Multilingual QA System is a Flask-based web app that allows users to ask questions in multiple languages (Tamil, English) and receive accurate answers. Using pretrained transformer models for efficient question answering.
MusfiqDehan/Multilingual-Sentence-Alignments-Demo
Align Parallel Sentence of 104 Languages with the help of mBERT and LaBSE
Revanth-Reddy-Pingala/Abusive_Comment_Detector_BERT
Fine tuned BERT, mBERT and XLMRoBERTa for Abusive Comments Detection in Telugu, Code-Mixed Telugu and Telugu-English.
RobinSmits/GPT-3.5-FineTuning
GPT 3.5 FineTuning
ShafakatArnob/Bengali-Misogyny-Identification-Deep-Learning-LIME
Bengali Misogyny Identification with Deep Learning and LIME.
shaitarAn/subword-evenness-crosslingual-transfer
Analysis of subword evenness as a predictor of cross-lingual transfer success in multilingual language models (mBERT, XLM-R, mT5)
Soumyo001/sentiment-emotion_detection_on_bengali_product_reviews
Sentiment and emotion detection using mBERT and XLM-R. It comes with a trained model which you can download and test it. Read below for instructions.
Tanlouie/Gendered_Abuse_Detection_In_Indic-Languages
Online gender-based violence limits marginalized voices. Detection in Indic languages is hard due to limited data and linguistic complexity. This work builds better classifiers for improved abuse detection in such settings.