JordanAsh

Pinned Repositories

acb
A PyTorch implementation of the Anti-concentrated Confidence Bonus (ACB) for promoting exploration in deep reinforcement learning.
Language:Python1 1 00
badge
An implementation of the BADGE batch active learning algorithm.
Language:Python193 6 1632
boostresnet
A PyTorch implementation of BoostResNet
Language:Python5 3 44
jordanash.github.io
Language:HTML0 1 00
sharpening
Language:Python00
warm_start
Code corresponding to 'On Warm-Starting Neural Network Training'
Language:Python9 1 11
memoryrl
combining memory and rl
Language:Python3 5 02
laser
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Language:Python361 22 2126

JordanAsh's Repositories

JordanAsh/badge
An implementation of the BADGE batch active learning algorithm.
Language:Python193 6 1632
JordanAsh/warm_start
Code corresponding to 'On Warm-Starting Neural Network Training'
Language:Python9 1 11
JordanAsh/boostresnet
A PyTorch implementation of BoostResNet
Language:Python5 3 44
JordanAsh/acb
A PyTorch implementation of the Anti-concentrated Confidence Bonus (ACB) for promoting exploration in deep reinforcement learning.
Language:Python1 1 00
JordanAsh/jordanash.github.io
Language:HTML0 1 00
JordanAsh/sharpening
Language:Python00