cpnota/autonomous-learning-library

Change SAC test sampling

Closed this issue · 0 comments

It seems like sampling from the policy produces better test results than choosing the mean action, so use this as the default behavior.