Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
bguan
/
lunar_lander_v2_ppo_5
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
main
lunar_lander_v2_ppo_5
1 contributor
History:
3 commits
bguan
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
57e96c5
over 2 years ago
bguan_ppo_lunarlander5
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
.gitattributes
1.22 kB
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
README.md
677 Bytes
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
bguan_ppo_lunarlander5.zip
pickle
145 kB
LFS
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
config.json
15 kB
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
replay.mp4
247 kB
LFS
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago
results.json
163 Bytes
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
over 2 years ago