sanyalsunny111
I am a PhD student at UT Austin. I am trying to make LLM pre-training more efficient.
The University of Texas at AustinAustin, Texas
Pinned Repositories
Analyzing_Reddit_trolls_using_machine_learning_and_networkscience
This is a big data analysis problem that we have solved using Random forests and network science.
Cream
This is a collection of our NAS and Vision Transformer work.
Data-Science-Lab-EE-460J-
Early_Weight_Avg
[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training
ECE-380L-Term-Project-Fall-2021
Feature Engineering and Extraction for Channel Estimation and Tracking Using Deep Neural Networks
Federated_Learning_with_Differentiable_Architecture_Compression_ECE381V
This is a class project for EE381V Advanced Computer Vision taught by Prof. Atlas Wang.
FLOW_finetuning
Upweighting Easy Samples in Fine-Tuning Mitigates Forgetting
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
LLM-Inheritune
This is the official repository for Inheritune.
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
sanyalsunny111's Repositories
sanyalsunny111/LLM-Inheritune
This is the official repository for Inheritune.
sanyalsunny111/Early_Weight_Avg
[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training
sanyalsunny111/FLOW_finetuning
Upweighting Easy Samples in Fine-Tuning Mitigates Forgetting
sanyalsunny111/Analyzing_Reddit_trolls_using_machine_learning_and_networkscience
This is a big data analysis problem that we have solved using Random forests and network science.
sanyalsunny111/Cream
This is a collection of our NAS and Vision Transformer work.
sanyalsunny111/Data-Science-Lab-EE-460J-
sanyalsunny111/ECE-380L-Term-Project-Fall-2021
Feature Engineering and Extraction for Channel Estimation and Tracking Using Deep Neural Networks
sanyalsunny111/Federated_Learning_with_Differentiable_Architecture_Compression_ECE381V
This is a class project for EE381V Advanced Computer Vision taught by Prof. Atlas Wang.
sanyalsunny111/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
sanyalsunny111/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
sanyalsunny111/MaskRCNN_and_Inpainting_Videos
This is my final project for UT's Digital Video class. It takes a video, listens for a 'magic word', and then attempts to detect inpaint different objects. This gives the illusion of the object 'disappearing' from the video.
sanyalsunny111/open_lm
A repository for research on medium sized language models.
sanyalsunny111/resume
This is latest resume.