vyraun

Senior Research Scientist at Microsoft

MicrosoftRedmond

Pinned Repositories

GEM-metrics
Automatic metrics for GEM tasks
Language:Python61 3 6120
BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Language:Python2.9k 51 151592
dlp
Code for "On Dimensional Linguistic Properties of the Word Embedding Space".
Language:Python8 4 01
DNI-tensorflow
DNI (Decoupled Neural Interfaces using Synthetic Gradients) Implementation with Tensorflow.
Language:Python28 8 014
Finding-Memo
Code for "Extractive Memorization in Constrained Sequence Generation Tasks"
Language:Python4 2 00
Half-Size
Code for "Effective Dimensionality Reduction for Word Embeddings".
Language:Python128 9 424
hallucinations
Code for "The Curious Case of Hallucinations in Neural Machine Translation".
6 6 20
long-tailed
Code for "On Long-Tailed Phenomena in NMT".
Language:Python10 4 03
Megalodon
Various ML/DL Resources organised at a single place.
185 19 043
seq2set-keras
An Implementation of Seq2Set (Pointer Network) in Keras.
Language:Jupyter Notebook9 3 01

vyraun's Repositories

vyraun/Megalodon
Various ML/DL Resources organised at a single place.
185 19 043
vyraun/Half-Size
Code for "Effective Dimensionality Reduction for Word Embeddings".
Language:Python128 9 424
vyraun/long-tailed
Code for "On Long-Tailed Phenomena in NMT".
Language:Python10 4 03
vyraun/dlp
Code for "On Dimensional Linguistic Properties of the Word Embedding Space".
Language:Python8 4 01
vyraun/hallucinations
Code for "The Curious Case of Hallucinations in Neural Machine Translation".
6 6 20
vyraun/Finding-Memo
Code for "Extractive Memorization in Constrained Sequence Generation Tasks"
Language:Python4 2 00
vyraun/literalness
Code for "Do GPTs Produce Less Literal Translations?"
Language:Python2 2 01
vyraun/blindspots
Seq2Seq Blindspots
Language:PLSQL1 5 0
vyraun/assignment_2
Low Resource Machine Translation.
Language:Python5 02
vyraun/awesome-align
A word aligner based on multilingual encoders
Language:Python2 01
vyraun/bert_score
BERT score for text generation
Language:Jupyter Notebook2 0
vyraun/biaffine-ner
Named Entity Recognition as Dependency Parsing
Language:Python2 0
vyraun/BIG-bench
Beyond the Imitation Game collaborative benchmark for enormous language models
Language:Jupyter Notebook2 0
vyraun/bin
bin files
Language:Python2 0
vyraun/bleurt
BLEURT is a metric for Natural Language Generation based on transfer learning.
Language:Python2 0
vyraun/CMC
pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination"
Language:Python3 0
vyraun/cookbook
The Unicode Cookbook for Linguists
Language:TeX2 0
vyraun/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Language:Python2 0
vyraun/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python1 0
vyraun/infotabs
Language:HTML2 0
vyraun/LM_NE_bias
Named Entity Biases in Pre-trained Language Models
Language:Jupyter Notebook2 0
vyraun/loss_dropper
Language:Python1 0
vyraun/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Language:Python2 0
vyraun/TVCaption
PyTorch implementation of MMT on TVCaption dataset
Language:Python3 0
vyraun/whisper
Language:Jupyter Notebook1 0
vyraun/Wikilingua
Multilingual abstractive summarization dataset extracted from WikiHow.
2 0
vyraun/wit
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
2 0
vyraun/wmt-format-tools
Tools for formatting WMT hypothesis and test sets in XML
Language:Python2 0
vyraun/wmt21-news-systems
Language:Smalltalk2 0
vyraun/xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.
Language:Shell2 0