LunarLander โ€” 394 params, 95% full landings

A tiny neural network (394 parameters) solving LunarLander-v2. Weights: 1.5 KB.

Performance

Metric Value
Params 394
Full landings (reward >= 200) 95/100
Average reward 258.8
Median reward 265.8
Best reward 310.9
Crashes (reward < 100) 3/100
Weights size 1.5 KB

Demo

Architecture

Params: 8x30 + 30 + 30x4 + 4 = 394

Usage

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading