Reinforcement Learning
We have implemented a few learning examples.
Ant
data:image/s3,"s3://crabby-images/6e957/6e957b277bcf0d7d1d1240c401e66c847ab14ceb" alt=""
Policy optimization is performed using the reinforcement-learning algorithm augmented random search (ARS) to optimize static linear policies for locomotion. The insect-like robot has rewards on forward velocity and survival and costs on control usage and contact forces.
Quadruped
data:image/s3,"s3://crabby-images/b5eb3/b5eb3f0fc7745960d6e70ee11431ac1a2400c44a" alt=""
A very basic random-sampling algorithm is used to find parameters for the periodic gait of a quadruped.
Cartpole
data:image/s3,"s3://crabby-images/22322/22322615aba32ada1f4c71fdb6e7fea54ed1a6c8" alt=""
We have modified the cartpole example in the ReinforcementLearning
package to use Dojo
's dynamics. This allows us to combine advanced learning algorithms with accurate dynamics simulation.