ppo_zephyr_vllm_1e-6_kl_0.02_num_mini_batches_1 / model.safetensors.index.json

Commit History

End of training
d474caa
verified

vwxyzjn commited on