mbert

There are 40 repositories under mbert topic.

csebuetnlp/banglabert
This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.
Language:Python245 8 833
cambridgeltl/ContrastiveBLI
Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.
Language:Python35 6 010
lirondos/lazaro
An observatory of anglicism usage in the Spanish press
Language:Python11 2 02
ishan00/meta-learning-for-multi-task-multilingual
Official Repository for the paper titled "Meta-Learning for Effective Multi-task and Multilingual Modelling" accepted at EACL 2021
Language:Python9 2 12
Mukaffi28/Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset
A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language
Language:Jupyter Notebook9 1 05
negar-foroutan/multiLMs-lang-neutral-subnets
[EMNLP 2022] Discovering Language-neutral Sub-networks in Multilingual Language Models.
Language:Python8 1 31
harikrishnan669/Youtube_summarizer
Your personal study assistant! This bot makes learning easier by converting YouTube videos into summarized notes with key points. Just send a video link, and get a clear PDF summary—perfect for studying, revising, or quickly understanding any topic.
Language:Python6
fatemafaria142/MultiBanFakeDetect-An-Extensive-Benchmark-Dataset-for-Multimodal-Bangla-Fake-News-Detection
This study introduces MultiBanFakeDetect, a novel multimodal dataset for Bangla fake news detection, combining textual and visual information. It features TextFakeNet for text analysis and MultiFusionFake for integrating multimodal data.
Language:Jupyter Notebook4 1 02
juletx/multilingual-question-answering
Zero-shot and Translation Experiments on XQuAD, MLQA and TyDiQA
Language:Jupyter Notebook4 1 01
BassaniRiccardo/ICEBERT
ICEBERT: Interlingual-Clusters Enhanced BERT. A BERT-like model trained on clusters of monolingual subwords.
Language:Python3 0 00
DiFronzo/Multilingual-Models
mBERT and XLM-R for encodeing of Scandinavian languages
Language:Python3 2 00
fatemafaria142/Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI
This research examines the performance of Large Language Models (GPT-3.5 Turbo and Gemini 1.5 Pro) in Bengali Natural Language Inference, comparing them with state-of-the-art models using the XNLI dataset. It explores zero-shot and few-shot scenarios to evaluate their efficacy in low-resource settings.
Language:Jupyter Notebook3 1 01
elsheikh21/cross-natural-language-inference
ZeroShot XNLI
Language:Python2 1 00
AditiBagora/Hasoc2021CodeMix
HASOC2021: Subtask 2 a) Codemix Challenge; Contains baselines and hierarchical approach that extracts the relevant context useful for classification of hostile tweets on English-Hindi code-mix data obtained from twitter.
Language:Jupyter Notebook1 2 01
Elijas/lithuanian-text-summarization-model
Deployed model which can summarize Lithuanian language text by leveraging Artificial Neural Networks, Transformers, mBERT.
Language:Python1 2 00
michaelpeterhoffmann/masterthesis
Multilingual hate speech detection for German, Italian and Spanish Social Media Posts #machine learning #classifier
Language:Jupyter Notebook1 1 00
peterzee-tsien/LING484-COMP599-Final-Projects
By using the hypothesis of historical linguistics, we found a way to improve the performance of multilingual transformers with limited amount of data
Language:Jupyter Notebook1 1 00
sankar-2002/Gendered_Abuse_Detection_In_Indic-Languages
Online gender-based violence limits marginalized voices. Detection in Indic languages is hard due to limited data and linguistic complexity. This work builds better classifiers for improved abuse detection in such settings.
Language:Jupyter Notebook1
SKG24/VIVARAN_chatbot_Supreme-court-hackathon
It is an ideation of the AI powered chatbot to help in legal understanding of the Indian government and its laws. To reach larger audience it supports all the constitutional languages.
1 1 0
Ali-Mhrez/Stance-Detection-MBERT-Features
This repository contains the code for a Ph.D. research project that focuses on improving the performance of mBERT for fake news stance detection. Our key contribution is the BERT-ESDM methodology, a novel approach that uses convolutional neural networks to enrich mBERT's contextual embeddings.
Language:Jupyter Notebook0 1 00
Ali-Mhrez/Stance-Detection-MLLM
This repository is dedicated to a Ph.D. research project that systematically investigates the effectiveness of multilingual transformer models on the task of stance detection. The goal is to not only benchmark these models but also to analyze their ability to handle linguistic challenges, transfer knowledge, and perform under dataset constraints.
Language:Jupyter Notebook0 1 00
fatemafaria142/BanglaCalamityMMD-A-Comprehensive-Benchmark-Dataset-for-Multimodal-Disaster-Identification
This study presents a hybrid multimodal fusion technique for disaster identification in Bangla, combining text and image data using the "BanglaCalamityMMD" dataset. Employing DisasterTextNet, DisasterImageNet, and DisasterMultFusionNet, the approach addresses a key gap in Bangla disaster research.
Language:Jupyter Notebook0 0 00
fatemafaria142/Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset
This study addresses the gap in translating Bangla regional dialects into standard Bangla by creating a large-scale multilingual benchmark dataset of 32,500 sentences in Bangla, Banglish, and English, representing five regional Bangla dialects such as Sylheti, Chittagong, Mymensingh, Noakhali, and Barishal.
Language:Jupyter Notebook0 0 00
jessicasaikia/multilingual-BERT-mBERT
This repository implements a Multilingual BERT (mBERT) model for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.
Language:Python0 1 00
joyou159/SWIZT
Exploring the use of multilingual transformers, specifically mBERT and XLM-RoBERTa, for named entity recognition (NER) in the context of Switzerland’s multi lingual environment.
Language:Jupyter Notebook00
Koharu24/mBERT_crosslingual_rd
This is a project proposal to implement Yan et al.'s (2020) mBERT-Unaligned for cross-lingual RDs with Japanese, German and Italian untranslatable terms
Language:Python0 0 00
Manoj632004/Question_Answering_System
The Multilingual QA System is a Flask-based web app that allows users to ask questions in multiple languages (Tamil, English) and receive accurate answers. Using pretrained transformer models for efficient question answering.
Language:Jupyter Notebook0 1 00
MusfiqDehan/Multilingual-Sentence-Alignments-Demo
Align Parallel Sentence of 104 Languages with the help of mBERT and LaBSE
Language:Python0 1 00
NasserMohamedEid/Text-AI-Detection
Language:Jupyter Notebook0 1 10
Revanth-Reddy-Pingala/Abusive_Comment_Detector_BERT
Fine tuned BERT, mBERT and XLMRoBERTa for Abusive Comments Detection in Telugu, Code-Mixed Telugu and Telugu-English.
Language:Jupyter Notebook0 2 00
RobinSmits/GPT-3.5-FineTuning
GPT 3.5 FineTuning
Language:Jupyter Notebook0 1 00
ShafakatArnob/Bengali-Misogyny-Identification-Deep-Learning-LIME
Bengali Misogyny Identification with Deep Learning and LIME.
Language:Jupyter Notebook0 1 00
reascr/Cross-Lingual-Transfer-of-Grammatical-Gender
Language:Jupyter Notebook
shaitarAn/subword-evenness-crosslingual-transfer
Analysis of subword evenness as a predictor of cross-lingual transfer success in multilingual language models (mBERT, XLM-R, mT5)
Soumyo001/sentiment-emotion_detection_on_bengali_product_reviews
Sentiment and emotion detection using mBERT and XLM-R. It comes with a trained model which you can download and test it. Read below for instructions.
Language:Jupyter Notebook
Tanlouie/Gendered_Abuse_Detection_In_Indic-Languages
Online gender-based violence limits marginalized voices. Detection in Indic languages is hard due to limited data and linguistic complexity. This work builds better classifiers for improved abuse detection in such settings.
Language:Jupyter Notebook

mbert

csebuetnlp/banglabert

cambridgeltl/ContrastiveBLI

lirondos/lazaro

ishan00/meta-learning-for-multi-task-multilingual

Mukaffi28/Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset

negar-foroutan/multiLMs-lang-neutral-subnets

harikrishnan669/Youtube_summarizer

fatemafaria142/MultiBanFakeDetect-An-Extensive-Benchmark-Dataset-for-Multimodal-Bangla-Fake-News-Detection

juletx/multilingual-question-answering

BassaniRiccardo/ICEBERT

DiFronzo/Multilingual-Models

fatemafaria142/Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI

elsheikh21/cross-natural-language-inference

AditiBagora/Hasoc2021CodeMix

Elijas/lithuanian-text-summarization-model

michaelpeterhoffmann/masterthesis

peterzee-tsien/LING484-COMP599-Final-Projects

sankar-2002/Gendered_Abuse_Detection_In_Indic-Languages

SKG24/VIVARAN_chatbot_Supreme-court-hackathon

Ali-Mhrez/Stance-Detection-MBERT-Features

Ali-Mhrez/Stance-Detection-MLLM

fatemafaria142/BanglaCalamityMMD-A-Comprehensive-Benchmark-Dataset-for-Multimodal-Disaster-Identification

fatemafaria142/Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset

jessicasaikia/multilingual-BERT-mBERT

joyou159/SWIZT

Koharu24/mBERT_crosslingual_rd

Manoj632004/Question_Answering_System

MusfiqDehan/Multilingual-Sentence-Alignments-Demo

NasserMohamedEid/Text-AI-Detection

Revanth-Reddy-Pingala/Abusive_Comment_Detector_BERT

RobinSmits/GPT-3.5-FineTuning

ShafakatArnob/Bengali-Misogyny-Identification-Deep-Learning-LIME

reascr/Cross-Lingual-Transfer-of-Grammatical-Gender

shaitarAn/subword-evenness-crosslingual-transfer

Soumyo001/sentiment-emotion_detection_on_bengali_product_reviews

Tanlouie/Gendered_Abuse_Detection_In_Indic-Languages