--- library_name: sample-factory tags: - deep-reinforcement-learning - reinforcement-learning - sample-factory model-index: - name: APPO results: - metrics: - type: mean_reward value: 9350.13 +/- 1.31 name: mean_reward task: type: reinforcement-learning name: reinforcement-learning dataset: name: mujoco_doublependulum type: mujoco_doublependulum --- A(n) **APPO** model trained on the **mujoco_doublependulum** environment. This model was trained using Sample Factory 2.0: https://github.com/alex-petrenko/sample-factory