This has been the best build so far, for more info: https://www.lesswrong.com/posts/x5ySDLEsJdtdmR7nX/rllmv10-experiment