/RLXtreme

RLXtreme是一个轻量级且高效的Python强化学习算法包,旨在提供极致性能和灵活业务应用的强化学习解决方案。

Primary LanguagePythonApache License 2.0Apache-2.0

RLXtreme

RLXtreme is a powerful and efficient Python package for reinforcement learning algorithms, designed to provide state-of-the-art performance and tackle challenging problems. It integrates a wide range of classical and modern reinforcement learning algorithms, including Deep Q-Networks (DQN), Proximal Policy Optimization (PPO), Soft Actor-Critic (SAC), and more, to meet the demands of various tasks and applications.

Table of Contents

Features

  • High-performance reinforcement learning algorithms
  • Efficient training and decision-making processes
  • Support for popular algorithms such as (C)MAB, DQN, PPO, SAC, etc.
  • Advanced optimization techniques for scalability
  • Tools for data collection, model evaluation, and result visualization
  • User-friendly API for easy model development and experimentation
  • Distributed computing and parallel training capabilities

Installation

To install RLXtreme, you can use pip:

pip install.

Usage

Here's a simple example that demonstrates how to train a DQN agent using RLXtreme:

import rlxtreme

# Create environment
env = rlxtreme.make_env("CartPole-v1")

# Create DQN agent
agent = rlxtreme.DQN()

# Train the agent
agent.train(env)

# Evaluate the trained agent
rewards = agent.evaluate(env)
print("Average reward:", sum(rewards) / len(rewards))