lunar_lander_v2_ppo_4 / bguan_ppo_lunarlander4

Commit History

lunar lander model #4, using PPO trained with learning rate 0.0005 for 500K timesteps
0e6fc9b

bguan commited on