ShriyaPalsamudram

Senior Deep Learning Engineer @ Nvidia. MS in CS @ Columbia University

Nvidia

Pinned Repositories

emdr2
Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering"
Language:Python0 0 00
logging
MLPerf™ logging library
Language:Python0 0 00
Megatron-LM
Ongoing research training transformer models at scale
Language:Python00
NeMo
NeMo: a toolkit for conversational AI
Language:Python0 0 00
policies
General policies for MLPerf™ including submission rules, coding standards, etc.
Language:Python00
training
Reference implementations of MLPerf™ training benchmarks
Language:Python21
training_policies
Issues related to MLPerf™ training policies, including rules and suggested changes
Language:Python0 0 00
training_results_v3.0
This repository contains the results and code for the MLPerf™ Training v3.0 benchmark.
Language:Python00
training_results_v3.1
This repository contains the results and code for the MLPerf™ Training v3.1 benchmark.
Language:Python00
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python00

ShriyaPalsamudram/training
Reference implementations of MLPerf™ training benchmarks
Language:Python21
ShriyaPalsamudram/emdr2
Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering"
Language:Python0 0 00
ShriyaPalsamudram/logging
MLPerf™ logging library
Language:Python0 0 00
ShriyaPalsamudram/Megatron-LM
Ongoing research training transformer models at scale
Language:Python00
ShriyaPalsamudram/NeMo
NeMo: a toolkit for conversational AI
Language:Python0 0 00
ShriyaPalsamudram/policies
General policies for MLPerf™ including submission rules, coding standards, etc.
Language:Python00
ShriyaPalsamudram/training_policies
Issues related to MLPerf™ training policies, including rules and suggested changes
Language:Python0 0 00
ShriyaPalsamudram/training_results_v3.0
This repository contains the results and code for the MLPerf™ Training v3.0 benchmark.
Language:Python00
ShriyaPalsamudram/training_results_v3.1
This repository contains the results and code for the MLPerf™ Training v3.1 benchmark.
Language:Python00
ShriyaPalsamudram/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python00