vyraun

Senior Research Scientist at Microsoft

MicrosoftRedmond

vyraun's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python71.9k 584 08.5k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.4k 159 1.5k2.3k
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python19.3k 279 3k2.7k
FMInference/FlexiGen
Running large language models on a single GPU for throughput-oriented scenarios.
Language:Python9.2k 112 82548
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python8.9k 62 1.5k567
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Language:Python4.6k 82 244371
jeffkaufman/icdiff
improved colored diff
Language:Python4.2k 62 132175
OpenNMT/CTranslate2
Fast inference engine for Transformer models
Language:C++3.4k 60 706305
alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization.
Language:Python3.1k 44 297358
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Language:Python2.9k 51 151592
libffcv/ffcv
FFCV: Fast Forward Computer Vision (and other ML workloads!)
Language:Python2.9k 21 282179
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2.1k 46 129150
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in HEIM (https://arxiv.org/abs/2311.04287) and vision-language models in VHELM (https://arxiv.org/abs/2410.07112).
Language:Python2k 36 1.1k254
microsoft/mup
maximal update parametrization (µP)
Language:Jupyter Notebook1.4k 29 6295
NVIDIA/MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
Language:C++1.2k 24 20286
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Language:Python1k 20 75108
GEM-benchmark/NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
Language:Python778 23 52196
mayeaux/generate-subtitles
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
Language:JavaScript762 15 27106
dustinvtran/latex-templates
A collection of LaTeX templates used for research, courses, and miscellanea.
Language:TeX747 20 2151
google-research/bleurt
BLEURT is a metric for Natural Language Generation based on transfer learning.
Language:Python700 13 5185
songhwanjun/Awesome-Noisy-Labels
A Survey
543 5 131
microsoft/gpt-MT
Language:Ruby84 9 58
facebookresearch/mlqe
We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scale 1 to 100) generated though human evaluations that represent the quality of the translations.Paper Title Unsupervised Quality Estimation for Neural Machine Translation
81 16 1014
microsoft/semantic_parsing_with_constrained_lm
Code to reproduce experiments in the paper "Constrained Language Models Yield Few-Shot Semantic Parsers" (EMNLP 2021).
Language:Python61 9 58
unicode-cookbook/cookbook
The Unicode Cookbook for Linguists
Language:TeX53 8 404
mjpost/bin
bin files
Language:Python13 2 17
vyraun/long-tailed
Code for "On Long-Tailed Phenomena in NMT".
Language:Python10 4 03
srush/aima-arguments
7 2 0
vered1986/LM_NE_bias
Named Entity Biases in Pre-trained Language Models
Language:Jupyter Notebook6 2 01
vyraun/Finding-Memo
Code for "Extractive Memorization in Constrained Sequence Generation Tasks"
Language:Python4 2 00

vyraun

vyraun's Stars

openai/whisper

haotian-liu/LLaVA

huggingface/datasets

FMInference/FlexiGen

voxel51/fiftyone

facebookincubator/AITemplate

jeffkaufman/icdiff

OpenNMT/CTranslate2

alpa-projects/alpa

google/BIG-bench

libffcv/ffcv

huggingface/datatrove

stanford-crfm/helm

microsoft/mup

NVIDIA/MatX

allenai/dolma

GEM-benchmark/NL-Augmenter

mayeaux/generate-subtitles

dustinvtran/latex-templates

google-research/bleurt

songhwanjun/Awesome-Noisy-Labels

microsoft/gpt-MT

facebookresearch/mlqe

microsoft/semantic_parsing_with_constrained_lm

unicode-cookbook/cookbook

mjpost/bin

vyraun/long-tailed

srush/aima-arguments

vered1986/LM_NE_bias

vyraun/Finding-Memo