Rohith-Rongali's Stars
amitrajaraman/notes
Notes galore
lchizat/2023-BAFU
Code for the paper L. Chizat, P. Netrapalli (2023). "Steering Deep Feature Learning with Backward Aligned Feature Updates".
aradha/lin-RFM
Code for lin-RFM used for sparse recovery tasks
bGhorbani/linearized_neural_networks
The code for the paper "When do neural networks outperform kernel methods"
modestyachts/neural_kernels_code
tml-epfl/sgd-sparse-features
SGD with large step sizes learns sparse features [ICML 2023]
LiyuanLucasLiu/RAdam
On the Variance of the Adaptive Learning Rate and Beyond
dit/dit
Python package for information theory.
locuslab/edge-of-stability
djsutherland/html-talk
Base for my talks using reveal.js, with bonus nice features (including browser/editor sync!)
steveazzolin/gdl_tutorial_turinginst
Material for the hands-on tutorial on Graph Deep Learning held at the Alan Turing Institute
LeoGrin/tabular-benchmark
pilancilab/convex_nn
aleximmer/Laplace
Laplace approximations for Deep Learning.
namlede/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
eugeneyan/open-llms
đź“‹ A list of open LLMs available for commercial use.
state-spaces/mamba
Mamba SSM architecture
mariuslindegaard/Intermediate_Neural_Collapse
(ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code
bobby-he/simplified_transformers
anthropics/toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
google-research/jaxpruner
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
EleutherAI/math-lm
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
cleverhans-lab/cleverhans
An adversarial example library for constructing attacks, building defenses, and benchmarking both
ResearchDaniel/NeuralActivationPatterns
princeton-nlp/LM-Kernel-FT
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
aradha/deep_neural_feature_ansatz
Code for verifying deep neural feature ansatz