Pinned Repositories
agents
TF-Agents is a library for Reinforcement Learning in TensorFlow
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
csci-566-assignment2
CSCI566_final_project
gym
A toolkit for developing and comparing reinforcement learning algorithms.
Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
sawyTeleCont
Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
garage
A toolkit for reproducible reinforcement learning research.
avnishn's Repositories
avnishn/sawyTeleCont
avnishn/agents
TF-Agents is a library for Reinforcement Learning in TensorFlow
avnishn/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
avnishn/csci-566-assignment2
avnishn/CSCI566_final_project
avnishn/gym
A toolkit for developing and comparing reinforcement learning algorithms.
avnishn/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
avnishn/meta-world.github.io
avnishn/mnist_digit_tracker_public
avnishn/parallel-ray-tracing
avnishn/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
avnishn/ray-llm
RayLLM - LLMs on Ray
avnishn/rl-experiments
Keeping track of RL experiments
avnishn/rlkit
Collection of reinforcement learning algorithms
avnishn/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
avnishn/spinningup
An educational resource to help anyone learn deep reinforcement learning.
avnishn/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.