multilingual-models

There are 49 repositories under multilingual-models topic.

  • linto-ai/whisper-timestamped

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

    Language:Python2.1k31158163
  • laravel-translatable

    Astrotomic/laravel-translatable

    A Laravel package for multilingual models

    Language:PHP1.3k20258159
  • MilaNLProc/contextualized-topic-models

    A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

    Language:Python1.2k17109147
  • frotms/PaddleOCR2Pytorch

    PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)

    Language:Python8891688175
  • jpWang/LiLT

    Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

    Language:Python34564641
  • AI4Bharat/Indic-BERT-v1

    Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT

    Language:Python279182941
  • backprop

    backprop-ai/backprop

    Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

    Language:Python24316912
  • csebuetnlp/banglabert

    This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.

    Language:Python2348831
  • ai-forever/mgpt

    Multilingual Generative Pretrained Model

    Language:Jupyter Notebook202121323
  • cisnlp/Glot500

    Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023

    Language:Python99883
  • PyThaiNLP/WangChanGLM

    WangChanGLM 🐘 - The Multilingual Instruction-Following Model

    Language:Jupyter Notebook94426
  • kaistAI/LangBridge

    [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision

    Language:Python821187
  • microsoft/Litmus

    AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems

    Language:Python45409
  • juletx/self-translate

    Do Multilingual Language Models Think Better in English?

    Language:Jupyter Notebook41225
  • JAugusto97/ToLD-Br

    Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis

    Language:Jupyter Notebook37417
  • MarkusSagen/Master-Thesis-Multilingual-Longformer

    Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre-train from scratch. We investigated if multilingual models could inherit these properties by making it an Efficient Transformer (s.a. the Longformer architecture).

    Language:Jupyter Notebook32278
  • arman-aminian/video-search

    Video Search with CLIP

    Language:Jupyter Notebook27000
  • OpenNyAI/Jugalbandi-Manager

    Jugalbandi (JB) Manager is a full AI-powered conversational chatbot platform. It's platform agnostic and can serve multiple channels such as WhatsApp or custom web interfaces. It can handle conversations in both text and voice across any language. It comes with Bhashini Speech models out of the box and can failover to Azure.

    Language:Python2754533
  • Data-Science-kosta/Long-texts-Sentiment-Analysis-RoBERTa

    PyTorch implementation of Sentiment Analysis of the long texts written in Serbian language (which is underused language) using pretrained Multilingual RoBERTa based model (XLM-R) on the small dataset.

    Language:Jupyter Notebook26227
  • Sigil-Wen/TTS

    XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate

    Language:Python24117
  • INK-USC/XCSR

    Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

    Language:Python22812
  • lwachowiak/Multilingual-Metaphor-Detection

    The multilingual language model XLM-R fine-tuned for metaphor detection on a token-level using Huggingface

    Language:Jupyter Notebook20105
  • COVID-19-disinformation

    firojalam/COVID-19-disinformation

    Dataset: Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society

    Language:Jupyter Notebook11224
  • margaritageleta/multilingual-toxicity-detector

    NLP deep learning model for multilingual toxicity detection in text 📚

    Language:Jupyter Notebook11211
  • cambridgeltl/prompt4bli

    On Bilingual Lexicon Induction with Large Language Models (EMNLP 2023). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.

    Language:Python9702
  • ishan00/meta-learning-for-multi-task-multilingual

    Official Repository for the paper titled "Meta-Learning for Effective Multi-task and Multilingual Modelling" accepted at EACL 2021

    Language:Python9212
  • mobassir94/Multilingual-Speech-to-Speech-Translator

    Multilingual Speech to Speech (STS) Translator is the First Ever Code-mixed English-Arabic speech to Bangla-Arabic Speech Translator

    Language:Jupyter Notebook8211
  • Homophobia-Transphobia-Detection

    vitthal-bhandari/Homophobia-Transphobia-Detection

    Code for the shared task on homophobia/transphobia detection at LT-EDI Workshop @ ACL 2022

    Language:Jupyter Notebook5101
  • sitamgithub-MSIT/PicQ

    PicQ: Demo for MiniCPM-V 2.6 to answer questions about images using natural language.

    Language:Python4200
  • KnowledgeDiscovery/MuSES

    Code for "Multilingual Sentiment Elicitation System for Social Media Data" @ IEEE Intelligent Systems

    Language:Python3001
  • esoyeon/Multilingual-StyleCLIP

    Multilingual-StyleCLIP is a model that can edit StyleGAN2 's images with a multilingual text prompt

    Language:Python2101
  • fajri91/Multi_SummEval

    Evaluating the Efficacy of Summarization Evaluation across Languages. In Findings of ACL 2021.

    Language:Jupyter Notebook2201
  • SINGHxTUSHAR/ANUVADAK

    This Project is based on multilingual Translation by using the Transformer with an encoder-decoder architecture along with the multi-head self-attention layers with the positional encoding and embedding for better result and accuracy. Overall, this model converts the English to French language using various Techniques of NLP and DL.

    Language:Jupyter Notebook2100
  • sitamgithub-MSIT/VidiQA

    VidiQA: Demo for MiniCPM-V 2.6 to answer questions about videos using natural language.

    Language:Python220
  • uheal/machine-translation-models

    This repository offers an evaluation of machine translation models for healthcare, focusing on languages like Telugu, Hindi, Arabic, and Swahili. It emphasizes accuracy and medical terminology, aiming to enhance medical communication across diverse languages. The dataset used in evaluation is provided.

  • cambridgeltl/sail-bli

    Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL 2024). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.

    Language:Python1601