... for humans to program robots by using reinforcement learning and supplying a ... Supplied. Control. Policy. Environment. Phase Two. A. R. O 'Best' possible ...
Hierarchical Domain Decomposition for Probabilistic Planning. Construct ... Solve for macro operators. Plan for new goals in time logarithmic in plan length ...