mayank31398

Interested in Efficient Models

IBM ResearchBoston

Pinned Repositories

Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.3k 24 144213
transformers-bloom-inference
Fast Inference Solutions for BLOOM
Language:Python557 12 64112
BigCode-Megatron-LM
Ongoing research training transformer models at scale
Language:Python1 1 02
deep-learning-bias-correction
Language:Python1 3 02
GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
Language:Python54 2 04
HybridToD
Language:Python1 3 00
Papers-books-and-blogs
This repository contains the research papers, white papers, thesis etc that I love.
Language:Python20 5 03
pseudo-code-instructions
Pseudo-code Instructions dataset
Language:Python23 1 04
real-time-visual-respiration-rate-estimation-with-dynamic-scene-adaptation
Language:Python8 3 05
VRAG
Language:Python2 3 00

mayank31398's Repositories

mayank31398/GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
Language:Python54 2 04
mayank31398/pseudo-code-instructions
Pseudo-code Instructions dataset
Language:Python23 1 04
mayank31398/Papers-books-and-blogs
This repository contains the research papers, white papers, thesis etc that I love.
Language:Python20 5 03
mayank31398/VRAG
Language:Python2 3 00
mayank31398/BigCode-Megatron-LM
Ongoing research training transformer models at scale
Language:Python1 1 02
mayank31398/HybridToD
Language:Python1 3 00
mayank31398/blog
Public repo for HF blog posts
Language:Jupyter Notebook0 2 00
mayank31398/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python1 0
mayank31398/dotfiles
Language:Shell3 0
mayank31398/flash-attention
Fast and memory-efficient exact attention
mayank31398/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python1 0
mayank31398/gym-text
Language:Python
mayank31398/ibm-fm-base-images
Language:Dockerfile2 0
mayank31398/IBM-fms-fsdp
Demonstrate throughput of PyTorch FSDP
Language:Python
mayank31398/kernel-hyperdrive
A bunch of kernels to make stuff slower 😉
Language:Python
mayank31398/lm-evaluation-harness
A framework for few-shot evaluation of language models.
mayank31398/lqvae
Language:Python3 01
mayank31398/mayank.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
mayank31398/mayank31398
3 01
mayank31398/mayank31398.github.io
Language:HTML2 0
mayank31398/MIPS-verilog
This repository contains code for a MIPS single cycle architecture written in Verilog.
Language:Verilog3 0
mayank31398/mixed-communication-runtime
Language:Python2 0
mayank31398/optimum
🏎️ Accelerate training and inference of 🤗 Transformers with easy to use hardware optimization tools
Language:Python1 0
mayank31398/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python1 01
mayank31398/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python
mayank31398/rocm-apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python1 0
mayank31398/scattermoe
Triton-based implementation of Sparse Mixture of Experts.
Language:Python
mayank31398/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python1 0
mayank31398/vscode-icons
Icons for Visual Studio Code
Language:TypeScript0 0
mayank31398/vscode-settings
3 01