code-switching
There are 52 repositories under code-switching topic.
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
gentaiscool/code-switching-papers
A curated list of research papers and resources on code-switching
microsoft/CodeMixed-Text-Generator
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
microsoft/LID-tool
This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.
audioku/meta-transfer-learning
Implementation of meta-transfer-learning for ASR and LM (ACL 2020)
sagorbrur/codeswitch
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
Nativeatom/NaturalLanguageProcessing
Natural Language Procesing
gentaiscool/meta-emb
Multilingual Meta-Embeddings for Named Entity Recognition (RepL4NLP & EMNLP 2019)
andi611/CS-Tacotron-Pytorch
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.
cisnlp/MaskLID
💬 MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024
amsuhane/ACL20-Code-switching-patterns
Code-switching patterns can be an effective route to improve performance of downstream NLP applications: A case study of humour, sarcasm and hate speech detection
gentaiscool/multi-task-cs-lm
Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning (CALCS 2018, ACL)
sedflix/unsacmt
Unsupervised Sentiment Analysis for Code-mixed Data
ash-shar/Code-Switching-and-Swearing-Patterns-on-Twitter
Repository containing Abusive Tweet Detection, Location Detection and Gender Detection codes
dieuthu/sequencetagging
A sequence tagging model with active learning
feyzaakyurek/newsframing
Code repository for ACL2020 paper Multi-label and Multilingual News Framing Analysis
javadr/PyTorch-Detect-Code-Switching
Implementation of a deep learning model (BiLSTM) to detect code-switching
PPPI/POSIT
POSIT aims to segment and tag mixed-text that contains English and C-like code, such that the user both knows what a token is, and within the language it's used in, what role, such as an AST tag or PoS tag, it serves.
umar1997/propaganda-codeswitched-text
[EMNLP 2023] Official repository of paper titled "Detecting Propaganda Techniques in Code-Switched Social Media Text"
mmaguero/josa-corpus
Jopara (Guarani-dominant mixed with Spanish) sentiment analysis corpus
pika-online/Foreign_Pronunciation_Generator_for_Code-Switch_ASR
a socket script to obtain chinese phones-sequence for any english word
sophiayk20/covoswitch
Code for "CoVoSwitch: Machine Translation of Synthetic Code-Switched Text Based on Intonation Units" (Accepted at ACL-SRW 2024) 🇹🇭
ishan00/translation-for-code-switching-acl
Official repository for the paper titled "From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text" accepted at ACL 2021
vincenthuang75025/chinglish
Chrome extension for translating highlighted English text into Chinglish (a chinese + english hybrid)
97arushisharma/Hindi-English-Code-Switching
A simple UI to translate a text written in romanised hindi form to fully english sentence
Nexdata-AI/207-Hours-Japanese-Speaking-English-Speech-Data-by-Mobile-Phone
Japanese Speaking English Speech Dataset
Nexdata-AI/300-Hours-Mixed-Speech-with-Korean-and-English-Data-by-Mobile-Phone
Mixed Speech with Korean and English Dataset
vsoto/crowdsourced_bangor
This repository contains crowdsourced universal part-of-speech tags for the Miami Bangor corpus.
andrianllmm/tagLID
A word level Language Identification (LID) tool for Tagalog-English (Taglish) text.
ChingtingC/Code-Switching-Sentence-Generation-by-GAN
Code-Switching Sentence Generation by Generative Adversarial Networks and its Application to Data Augmentation. (Interspeech 2019)
kjgpta/Code-Switch-Language-Modeling-for-English-and-Malay
Code-Switched Data generation based on Part-of-speech and Language Modeling of the generated text.
Lidan0241/language-detection
A language detection model for code-switched texts in es/en/zh
Wei-RongRong2/RojakLanguageSentimentAnalysis
This is a machine learning project focused on analysing and classifying sentiments in code-switched and code-mixed text, specifically targeting the unique linguistic characteristics found in Malaysian conversations.
yihao001/singlish-polarity-detection
Modelling code-switching in Singlish for polarity detection
Dharani-S93/MULTILINGUAL-ASR-FOR-INDIAN-LANGUAGES
The project "Multilingual Code Switching ASR (Automatic Speech Recognition) " targets the development of Automatic Speech Recognition (ASR) technology for Tamil-English speech within India. Tamil, culturally significant in southern India, is merged with English, a global language, reflecting India's bilingual nature.
selinah66/NeurotechUSC-Bilingual-Code-Switching
This project aims to use existing open-source eye-tracking data on code-switching in Bilingual Chinese-English individuals to train a machine learning model to predict bilingual code-switching.