Rui Yang

Ray2333

AI & ML interests

Deep Reinforcement Learning

Organizations

Ray2333's activity

New activity in Ray2333/GRM-llama3-8B-sftreg 25 days ago
New activity in Ray2333/gpt2-large-harmless-reward_model about 2 months ago

a bug when loading model

1
#2 opened about 2 months ago by ssmmzz
New activity in Ray2333/gpt2-large-harmless-reward_model 6 months ago

How to train the model

1
#1 opened 6 months ago by mike2000