/aws_deepracer_worksheet

Worksheet and Utilities for AWS DeepRacer – one of the most exciting ways of building strong skills in reinforcement learning and through a hands-on approach. This repository offers: 1) Functionally-rich and flexible reward function 2) Utilities with Jupiter notes for Racing Line calculation and visualisation of track 3) Scripts to parse RoboMaker training and evaluation logs to CSV file 4) Sample Excel file for car behaviour analysis as well as designing and planning new reward curves 5) Coordinates and AWS DeepRacer tracks and images.

Primary LanguagePythonMIT LicenseMIT

AWS Deep Racer Worksheet

Reward Function

Rewards calculation is based on a number of conditional points, which are all configurabTHatle. Sum of all conditional points contributes to the final score. Conditional points also have sensitivity parameter to control rewards curve.

The points are:

  1. Rewards the agent for being on or close enough (configurable) to the racing line.
  2. Rewards the agent for taking an action which gets the car closer to the next waypoint on the racing line.
  3. Reward the agent on having high steps-to-progress ratio. Simply saying: reward on good progress.
  4. Reward the agent on moving in high speed (regardless of track conditions).
  5. Reward the agent on moving straight, rather than making turns (regardless of track conditions).

Sample reward weights

REWARD_WEIGHT_PROG_STEP = 30
REWARD_WEIGHT_MAX_SPEED = 25
REWARD_WEIGHT_MIN_STEER = 20
REWARD_WEIGHT_DIR_STEER = 15
REWARD_WEIGHT_ON_TRACK = 10

In addition to contributing criteria points, there are 4 types of penalty indicators which reduce the total calculates score. The effectiveness of every penalty indicator is configurable in percentage terms, expect for the "Wheels Off Track" penalty. It has penalty weight of 100% percent and decay over number of steps (configurable).

The penalties are:

  1. Penalty to overall score in percentage for being off or far from racing line.
  2. Penalty to taking turn (having steering) which takes the car further away from the next waypoint on the racing line.
  3. Penalty for taking sharp turn / having high steering (regardless of track conditions).
  4. Penalty for having at least one wheel off track.

Sample penalty weights

MAX_STEPS_TO_DECAY_PENALTY = 5      # Value of zero or below disables penalty for having wheels off track

TOTAL_PENALTY_ON_OFF_TRACK = 0.999999  # Maximum penalty in percentage of total reward for being off track
TOTAL_PENALTY_ON_OFF_DIR_STEER = 0.50  # Maximum penalty in percentage of total reward for off directional steering
TOTAL_PENALTY_ON_HIGH_STEERING = 0.25  # Maximum penalty in percentage of total reward for high steering

Racing line calculation and visualisation

The objective is to smooth central line of a given track. This is achieved by minimising the distance between each pair of two closes waypoints. Selected closes waypoints are not always the nearest waypoints, such as between a pair of selected waypoints, there might be others waypoints located closer. However, these are still close enough to be used. This is controlled by skipp_step parameter, which allows acceleration of the algorithm. The optimisation algorithm takes into account track's inner and outer borders to ensure that new calculated waypoints do not touch the borders or go beyond the borders or located too close to the borders. This is controlled by max_offset parameter.

This algorithm does not search for an optimal arc of corners to race at the maximum possible speed for a given steering.

Sample Results

Track Name Original Track Racing Line v1 Racing Line v2
Vivalas Loop
Vivalas Speedway
Expedition Loop
Expedition Super Loop
Playa Super Raceway
Playa Raceway
Hot Rod Super Speedway
Hot Rod Speedway
Baja Highway
Baja Turnpike
Kuei Raceway
Kuei Super Raceway
Cosmic Circuit
Cosmic Loop
Lars Circuit
Lars Loop
Po-Chun Speedway
Po-Chun Super Speedway
Baadal Track

Full list of available tracks' data can be checked here: https://github.com/dp770/aws_deepracer_worksheet/tree/main/tracks.

Other Useful Repositories to look at

  1. AWS DeepRacer League (https://aws.amazon.com/deepracer/league/)
  2. DeepRacer Logs Analysis tools (https://github.com/aws-deepracer-community/deepracer-analysis)
  3. AWS Deepracer Workshop Lab Github (https://github.com/aws-samples/aws-deepracer-workshops)
  4. DeepRacer K1999 Racing Line Algo (https://github.com/cdthompson/deepracer-k1999-race-lines)
  5. DeepRacer Utils CLI commands (https://github.com/aws-deepracer-community/deepracer-utils)

License

MIT License

Copyright (c) 2021 AWS DeepRacer Worksheet contributors community

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.