Deep RL Course documentation

Quiz

Hugging Face's logo
Join the Hugging Face community

and get access to the augmented documentation experience

to get started

Quiz

The best way to learn and to avoid the illusion of competence is to test yourself. This will help you to find where you need to reinforce your knowledge.

Q1: Chose the option which fits better when comparing different types of multi-agent environments

  • Your agents aim to maximize common benefits in ____ environments
  • Your agents aim to maximize common benefits while minimizing opponent’s in ____ environments

Q2: Which of the following statements are true about decentralized learning?

Q3: Which of the following statements are true about centralized learning?

Q4: Explain in your own words what is the Self-Play approach

Solution

Self-play is an approach to instantiate copies of agents with the same policy as your as opponents, so that your agent learns from agents with same training level.

Q5: When configuring Self-play , several parameters are important. Could you identify, by their definition, which parameter are we talking about?

  • The probability of playing against the current self vs an opponent from a pool
  • Variety (dispersion) of training levels of the opponents you can face
  • The number of training steps before spawning a new opponent
  • Opponent change rate

Q6: What are the main motivations to use a ELO rating Score?

Congrats on finishing this Quiz 🥳, if you missed some elements, take time to read the chapter again to reinforce (😏) your knowledge.