multilingual-models

There are 58 repositories under multilingual-models topic.

  • linto-ai/whisper-timestamped

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

    Language:Python2.6k36166199
  • laravel-translatable

    Astrotomic/laravel-translatable

    A Laravel package for multilingual models

    Language:PHP1.3k19269170
  • MilaNLProc/contextualized-topic-models

    A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

    Language:Python1.2k16110151
  • frotms/PaddleOCR2Pytorch

    PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)

    Language:Python1.1k17113201
  • jpWang/LiLT

    Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

    Language:Python35564840
  • AI4Bharat/Indic-BERT-v1

    Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT

    Language:Python291172941
  • csebuetnlp/banglabert

    This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.

    Language:Python2458833
  • backprop

    backprop-ai/backprop

    Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

    Language:Python24215911
  • ai-forever/mgpt

    Multilingual Generative Pretrained Model

    Language:Jupyter Notebook207121322
  • cisnlp/Glot500

    Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023

    Language:Python104884
  • PyThaiNLP/WangChanGLM

    WangChanGLM 🐘 - The Multilingual Instruction-Following Model

    Language:Jupyter Notebook95427
  • kaistAI/LangBridge

    [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision

    Language:Python920198
  • sail-sg/sailor2

    🔱 Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

  • microsoft/Litmus

    AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems

    Language:Python47309
  • joaoaleite/ToLD-Br

    Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis

    Language:Jupyter Notebook43318
  • juletx/self-translate

    Do Multilingual Language Models Think Better in English?

    Language:Jupyter Notebook42225
  • floatai/HumanEval-XL

    [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization

    Language:Python38216
  • OpenNyAI/Jugalbandi-Manager

    Jugalbandi (JB) Manager is a full AI-powered conversational chatbot platform. It's platform agnostic and can serve multiple channels such as WhatsApp or custom web interfaces. It can handle conversations in both text and voice across any language. It comes with Bhashini Speech models out of the box and can failover to Azure.

    Language:Python3664540
  • MarkusSagen/Master-Thesis-Multilingual-Longformer

    Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre-train from scratch. We investigated if multilingual models could inherit these properties by making it an Efficient Transformer (s.a. the Longformer architecture).

    Language:Jupyter Notebook34278
  • arman-aminian/video-search

    Video Search with CLIP

    Language:Jupyter Notebook28000
  • Data-Science-kosta/Long-texts-Sentiment-Analysis-RoBERTa

    PyTorch implementation of Sentiment Analysis of the long texts written in Serbian language (which is underused language) using pretrained Multilingual RoBERTa based model (XLM-R) on the small dataset.

    Language:Jupyter Notebook26227
  • Sigil-Wen/TTS

    XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate

    Language:Python25116
  • INK-USC/XCSR

    Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

    Language:Python22712
  • lwachowiak/Multilingual-Metaphor-Detection

    The multilingual language model XLM-R fine-tuned for metaphor detection on a token-level using Huggingface

    Language:Jupyter Notebook21105
  • COVID-19-disinformation

    firojalam/COVID-19-disinformation

    Dataset: Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society

    Language:Jupyter Notebook11124
  • margaritageleta/multilingual-toxicity-detector

    NLP deep learning model for multilingual toxicity detection in text 📚

    Language:Jupyter Notebook11111
  • cambridgeltl/prompt4bli

    On Bilingual Lexicon Induction with Large Language Models (EMNLP 2023). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.

    Language:Python10602
  • ishan00/meta-learning-for-multi-task-multilingual

    Official Repository for the paper titled "Meta-Learning for Effective Multi-task and Multilingual Modelling" accepted at EACL 2021

    Language:Python9212
  • mobassir94/Multilingual-Speech-to-Speech-Translator

    Multilingual Speech to Speech (STS) Translator is the First Ever Code-mixed English-Arabic speech to Bangla-Arabic Speech Translator

    Language:Jupyter Notebook9211
  • Homophobia-Transphobia-Detection

    vitthal-bhandari/Homophobia-Transphobia-Detection

    Code for the shared task on homophobia/transphobia detection at LT-EDI Workshop @ ACL 2022

    Language:Jupyter Notebook7101
  • sitammeur/PicQ

    PicQ: Demo for MiniCPM-o 2.6 to answer questions about images using natural language.

    Language:Python4
  • cambridgeltl/sail-bli

    Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL 2024). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.

    Language:Python3501
  • KnowledgeDiscovery/MuSES

    Code for "Multilingual Sentiment Elicitation System for Social Media Data" @ IEEE Intelligent Systems

    Language:Python3001
  • AyaFestPe

    azminewasi/AyaFestPe

    Developing AyaFestPe, A Multi-lingual and Multi-cultural Festival Exploration Guide

    Language:Jupyter Notebook210
  • pixiiidust/semantic-ising

    Can different languages reveal the same underlying meaning space? This tool visualizes how words align across languages as they interact in a dynamic system. Inspired by the Platonic idea of ideal forms, it explores whether universal semantics emerge from linguistic diversity when viewed through the lens of energy and structure. (WIP)

    Language:Python2
  • sitammeur/VidiQA

    VidiQA: Demo for MiniCPM-V 2.6 to answer questions about videos using natural language.

    Language:Python2