bilingual-lexicon-extraction

There are 15 repositories under bilingual-lexicon-extraction topic.

  • kakaobrain/word2word

    Easy-to-use word-to-word translations for 3,564 language pairs.

    Language:Python356141353
  • kbatsuren/CogNet

    CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates

  • cambridgeltl/ContrastiveBLI

    Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

    Language:Python32909
  • yaoyiran/BLI-Reading-List

    A 2024 Reading List for Bilingual Lexicon Induction (BLI) / Word Translation. Frequently Updated.

    Language:Python24312
  • cambridgeltl/BLICEr

    Improving Bilingual Lexicon Induction with Cross-Encoder Reranking (Findings of EMNLP 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

    Language:Python13703
  • cambridgeltl/prompt4bli

    On Bilingual Lexicon Induction with Large Language Models (EMNLP 2023). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.

    Language:Python9702
  • zhangmozhi/iternorm

    Are Girls Neko or Shōjo? Cross-Lingual Alignment of Non-Isomorphic Embeddings with Iterative Normalization (ACL 2019)

    Language:Python9300
  • THUNLP-MT/UBiLexAT

    An Unsupervised Bilingual Lexicon Inducer From Non-Parallel Data by Adversarial Training

    Language:Python8501
  • THUNLP-MT/BiLex

    A Bilingual Lexicon Inducer From Non-Parallel Data

    Language:C5402
  • THUNLP-MT/UBiLexEMD

    An Unsupervised Bilingual Lexicon Inducer From Non-Parallel Data by Earth Mover's Distance Minimization

    Language:Python5400
  • accurat-toolkit/DEACC

    Lexical dictionary extractor from comparable corpora

    Language:C#1301
  • cambridgeltl/sail-bli

    Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL 2024). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.

    Language:Python1601
  • fdschmidt93/DynaDict

    Bilingual n-gram Phrase Table Induction with Dynamax-Jaccard

    Language:Python1201
  • jolivaresc/TSTL

    Temas Selectos de Tecnologías del Lenguaje

    Language:Jupyter Notebook1100
  • fdschmidt93/procrustes

    Weakly-supervised bilingual lexicon induction

    Language:Python20