anoopkunchukuttan
I work on Machine Learning and NLP. I am interested in multilingual computing, Indian language NLP and machine translation.
Microsoft Translator, AI4Bharat, IIT MadrasHyderabad, India
Pinned Repositories
Indic-BERT-v1
Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT
indicnlp_catalog
A collaborative catalog of NLP resources for Indic languages
crowd-indic-transliteration-data
Xlit-Crowd: Hindi-English Transliteration Corpus
geomm
Geometry-aware Multilingual Embeddings
indic_nlp_library
Resources and tools for Indian language Natural Language Processing
indic_nlp_resources
Resources to go with the Indic NLP Library
indowordnet_parallel
Parallel corpus mined from IndoWordnet synset gloss and examples
mlxlit
A Multilingual Neural Machine Transliteration System
multinmt_tutorial_coling2020
Material for the COLING 2020 Tutorial on Multilingual NMT
transliterator
Unsupervised Transliterator using phonetic features (particularly for Indian languages)
anoopkunchukuttan's Repositories
anoopkunchukuttan/indic_nlp_library
Resources and tools for Indian language Natural Language Processing
anoopkunchukuttan/indic_nlp_resources
Resources to go with the Indic NLP Library
anoopkunchukuttan/geomm
Geometry-aware Multilingual Embeddings
anoopkunchukuttan/multinmt_tutorial_coling2020
Material for the COLING 2020 Tutorial on Multilingual NMT
anoopkunchukuttan/indowordnet_parallel
Parallel corpus mined from IndoWordnet synset gloss and examples
anoopkunchukuttan/multilingual_extend_llm
anoopkunchukuttan/moses_job_scripts
A simple experiment management system for Moses
anoopkunchukuttan/news_evaluation_script
NEWS shared task evaluation script (ported to Python 3)
anoopkunchukuttan/DataAugForLRL
Generalized Data Augmentation for Low-Resource Translation
anoopkunchukuttan/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
anoopkunchukuttan/huggingface_notebooks
Notebooks using the Hugging Face libraries 🤗
anoopkunchukuttan/indic_transliteration_analysis
anoopkunchukuttan/mctorch
A manifold optimization library for deep learning
anoopkunchukuttan/OpenNMT-tf
My customizations to OpenNMT-tf
anoopkunchukuttan/sacreBLEU-Indic
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
anoopkunchukuttan/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
anoopkunchukuttan/UnsupervisedMT
Phrase-Based & Neural Unsupervised Machine Translation
anoopkunchukuttan/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
anoopkunchukuttan/gpt-MT
anoopkunchukuttan/indicnlp.ai4bharat.org
Archived old website for AI4Bhārat Indic-NLP
anoopkunchukuttan/Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
anoopkunchukuttan/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
anoopkunchukuttan/ml_timeline
Latest developments in LLM space
anoopkunchukuttan/MMQA
MMQA Dataset
anoopkunchukuttan/mRASP
anoopkunchukuttan/MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
anoopkunchukuttan/NER_Open_Data
This repo contains timely updated NER tagged data collected through a-mma NER data collection programme
anoopkunchukuttan/open-instruct
Fork of work in paper: "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources ."
anoopkunchukuttan/The-NLP-Pandect
A comprehensive reference for all topics related to Natural Language Processing
anoopkunchukuttan/yanmtt
Yet Another Neural Machine Translation Toolkit