ppo-LunarLander-v2 / robot_1 /policy.optimizer.pth

Commit History

basic PPO model trained in colab, deep-rl course unit
7f32894

jgerbscheid commited on