Pinned Repositories
RL4LMs
A modular RL library to fine-tune language models to human preferences
dashifyML
A lightweight tool to manage and track your large scale machine leaning experiments
fluidml
FluidML is a lightweight framework for developing machine learning pipelines.
google-word2vec-demo
A demo of using google's pre-trained word2vec model
minimalistic-ml
A collection of basic ML programs
nlp-gym
NLPGym - A toolkit to develop RL agents to solve NLP tasks.
novelty-guided-rl
pytorch-optimize
A simple black-box optimization framework to train your pytorch models for optimizing non-differentiable objectives
Reacher-2D
To train an agent to reach an object using geometrical solution
spsa-optimization
Repository to implement SPSA
rajcscw's Repositories
rajcscw/nlp-gym
NLPGym - A toolkit to develop RL agents to solve NLP tasks.
rajcscw/pytorch-optimize
A simple black-box optimization framework to train your pytorch models for optimizing non-differentiable objectives
rajcscw/spsa-optimization
Repository to implement SPSA
rajcscw/google-word2vec-demo
A demo of using google's pre-trained word2vec model
rajcscw/minimalistic-ml
A collection of basic ML programs
rajcscw/novelty-guided-rl
rajcscw/Reacher-2D
To train an agent to reach an object using geometrical solution
rajcscw/spsa-policy-rl
rajcscw/blog
Public repo for HF blog posts
rajcscw/datastack
a stream-based file storage solution for machine learning datasets.
rajcscw/gym
A toolkit for developing and comparing reinforcement learning algorithms.
rajcscw/axolotl
Go ahead and axolotl questions
rajcscw/esn-for-crypto
Repository to implement and reproduce "Using Echo State Networks For Cryptography"
rajcscw/rl-exploration
Reinforcement Learning papers on exploration methods.
rajcscw/RL4LMs
A modular RL library to fine-tune language models to human preferences