strategy for training this model

by gd1m3y - opened Jan 28, 2023

gd1m3y

Jan 28, 2023

I was curious what kind of data was used to train this for ppo also what strategy was used for deciding reward

trl internal testing org Jan 28, 2023

Hi @gd1m3y
Thanks for your interest
This model is not intended to be trained or used of out the box but only for testing purposes

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment