---
library_name: sample-factory
tags:
- deep-reinforcement-learning
- reinforcement-learning
- sample-factory
model-index:
- name: APPO
  results:
  - metrics:
    - type: mean_reward
      value: 9350.13 +/- 1.31
      name: mean_reward
    task:
      type: reinforcement-learning
      name: reinforcement-learning
    dataset:
      name: mujoco_doublependulum
      type: mujoco_doublependulum
---

A(n) **APPO** model trained on the **mujoco_doublependulum** environment.
This model was trained using Sample Factory 2.0: https://github.com/alex-petrenko/sample-factory