multilingual-models
There are 49 repositories under multilingual-models topic.
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Astrotomic/laravel-translatable
A Laravel package for multilingual models
MilaNLProc/contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
frotms/PaddleOCR2Pytorch
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
jpWang/LiLT
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
AI4Bharat/Indic-BERT-v1
Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT
backprop-ai/backprop
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
csebuetnlp/banglabert
This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.
ai-forever/mgpt
Multilingual Generative Pretrained Model
cisnlp/Glot500
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
PyThaiNLP/WangChanGLM
WangChanGLM 🐘 - The Multilingual Instruction-Following Model
kaistAI/LangBridge
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision
microsoft/Litmus
AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems
juletx/self-translate
Do Multilingual Language Models Think Better in English?
JAugusto97/ToLD-Br
Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis
MarkusSagen/Master-Thesis-Multilingual-Longformer
Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre-train from scratch. We investigated if multilingual models could inherit these properties by making it an Efficient Transformer (s.a. the Longformer architecture).
arman-aminian/video-search
Video Search with CLIP
OpenNyAI/Jugalbandi-Manager
Jugalbandi (JB) Manager is a full AI-powered conversational chatbot platform. It's platform agnostic and can serve multiple channels such as WhatsApp or custom web interfaces. It can handle conversations in both text and voice across any language. It comes with Bhashini Speech models out of the box and can failover to Azure.
Data-Science-kosta/Long-texts-Sentiment-Analysis-RoBERTa
PyTorch implementation of Sentiment Analysis of the long texts written in Serbian language (which is underused language) using pretrained Multilingual RoBERTa based model (XLM-R) on the small dataset.
Sigil-Wen/TTS
XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate
INK-USC/XCSR
Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"
lwachowiak/Multilingual-Metaphor-Detection
The multilingual language model XLM-R fine-tuned for metaphor detection on a token-level using Huggingface
firojalam/COVID-19-disinformation
Dataset: Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society
margaritageleta/multilingual-toxicity-detector
NLP deep learning model for multilingual toxicity detection in text 📚
cambridgeltl/prompt4bli
On Bilingual Lexicon Induction with Large Language Models (EMNLP 2023). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.
ishan00/meta-learning-for-multi-task-multilingual
Official Repository for the paper titled "Meta-Learning for Effective Multi-task and Multilingual Modelling" accepted at EACL 2021
mobassir94/Multilingual-Speech-to-Speech-Translator
Multilingual Speech to Speech (STS) Translator is the First Ever Code-mixed English-Arabic speech to Bangla-Arabic Speech Translator
vitthal-bhandari/Homophobia-Transphobia-Detection
Code for the shared task on homophobia/transphobia detection at LT-EDI Workshop @ ACL 2022
sitamgithub-MSIT/PicQ
PicQ: Demo for MiniCPM-V 2.6 to answer questions about images using natural language.
KnowledgeDiscovery/MuSES
Code for "Multilingual Sentiment Elicitation System for Social Media Data" @ IEEE Intelligent Systems
esoyeon/Multilingual-StyleCLIP
Multilingual-StyleCLIP is a model that can edit StyleGAN2 's images with a multilingual text prompt
fajri91/Multi_SummEval
Evaluating the Efficacy of Summarization Evaluation across Languages. In Findings of ACL 2021.
SINGHxTUSHAR/ANUVADAK
This Project is based on multilingual Translation by using the Transformer with an encoder-decoder architecture along with the multi-head self-attention layers with the positional encoding and embedding for better result and accuracy. Overall, this model converts the English to French language using various Techniques of NLP and DL.
sitamgithub-MSIT/VidiQA
VidiQA: Demo for MiniCPM-V 2.6 to answer questions about videos using natural language.
uheal/machine-translation-models
This repository offers an evaluation of machine translation models for healthcare, focusing on languages like Telugu, Hindi, Arabic, and Swahili. It emphasizes accuracy and medical terminology, aiming to enhance medical communication across diverse languages. The dataset used in evaluation is provided.
cambridgeltl/sail-bli
Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL 2024). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.