ResourceAllocationReinforcementLearning