word-alignment

There are 27 repositories under word-alignment topic.

  • neulab/awesome-align

    A neural word aligner based on multilingual BERT

    Language:Python332114848
  • THUNLP-MT/Mask-Align

    Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021

    Language:Python6031420
  • cambridgeltl/ContrastiveBLI

    Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

    Language:Python349010
  • yaoyiran/BLI-Reading-List

    A 2024 Reading List for Bilingual Lexicon Induction (BLI) / Word Translation. Frequently Updated.

    Language:Python24312
  • Heidelberg-NLP/xsrl_mbert_aligner

    X-SRL Dataset. Including the code for the SRL annotation projection tool and an out-of-the-box word alignment tool based on Multilingual BERT embeddings.

    Language:Python15313
  • cambridgeltl/BLICEr

    Improving Bilingual Lexicon Induction with Cross-Encoder Reranking (Findings of EMNLP 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

    Language:Python13703
  • sfu-natlang/HMM-Aligner

    This is the implementation of word aligner using Hidden Markov Model

    Language:Python10324
  • andreabac3/Word_Alignment_BERT

    This project provide an API to perform word alignment

    Language:Python9200
  • ruoyuxie/noisy_parallel_data_alignment

    Enhanced awesome-align for low-resource languages and noise simulation: https://arxiv.org/abs/2301.09685

    Language:Python9211
  • zhangmozhi/iternorm

    Are Girls Neko or Shōjo? Cross-Lingual Alignment of Non-Isomorphic Embeddings with Iterative Normalization (ACL 2019)

    Language:Python9300
  • qiyuw/WSPAlign

    WSPAlign: Word Alignment Pre-training via Large-Scale Weakly Supervised Span Prediction, to appear at ACL 2023 main conference.

    Language:Python8302
  • zhangmozhi/retrofit_clwe

    Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries (ACL 2020)

    Language:Python8400
  • zouharvi/SlowAlignDisplayer

    Create "pretty" graphs for aligned sentences

    Language:TypeScript7221
  • pixelneo/parapipeline

    A pipeline for POS tagging, sentence alignment, word alignment, and transliteration of texts in 30+ languages.

    Language:Python6200
  • lankaraniamir/lyric-source-separation

    Using alignments and posteriorgrams extracted from lyrics as novel input into source separation models

    Language:Jupyter Notebook4100
  • qiyuw/WSPAlign.InferEval

    Inference library and evaluation script for WSPAlign (https://github.com/qiyuw/WSPAlign)

    Language:Python4200
  • desilinguist/wordalignui

    Java application for creating bilingual word alignments

    Language:Java3412
  • kukas/word-alignment-visualization

    Word Alignment Visualization is a Python package for visualizing word alignments between two sentences in a Jupyter notebook. The package provides an interactive widget that displays original and translated sentences with word alignment lines.

    Language:Jupyter Notebook3140
  • DorinK/Assignment-1-IBM-Models

    Assignment 1: Word Alignment in 'Statistical Machine Translation' course by Dr. Roee Aharoni at Bar-Ilan University.

    Language:Python2200
  • maxkagamine/word-alignment-demo

    Demonstration of AI/neural word alignment of English & Japanese text using mBERT-based machine learning models.

    Language:Python2200
  • TajaKuzman/Parlamint-translation

    A pipeline for machine translation (using OPUS-MT models) of parliamentary text collections in 30+ languages (ParlaMint corpora). The pipeline includes parsing TEI XLM and CONLL-u files, linguistic processing with the Stanza pipeline, machine translation and word alignment with the Eflomal tool.

    Language:Jupyter Notebook2100
  • borh-lab/lexi-align

    Word alignment of multilingual sentences using structured generation

    Language:Python10
  • npedrazzini/parallelbibles

    Word-alignment models for Bible translations in 100+ historical and contemporary languages

    Language:R1100
  • zouharvi/LeverageAlign

    Leveraging Almost Black-Box NMT for Word Alignment

    Language:TeX127
  • AdityaYadavalli1/IBM-Model1

    This is simple replica of IBM Model-1. It is trained to find word-alignments between two Indo-European languages - English and Hindi

    Language:Jupyter Notebook10
  • hellomasaya/word-alignment-models

    IBM model 1

    Language:Jupyter Notebook10
  • pkolachi/wordpairings

    Word alignment methods to extract bi/multi -lingual lexica

    Language:Python10