PEFT
Safetensors
qwen2
alignment-handbook
trl
dpo
Generated from Trainer

Commit History

End of training
b9dabf6
verified

khongtrunght commited on

Model save
e965a13
verified

khongtrunght commited on

Training in progress, step 1091
29a177d
verified

khongtrunght commited on

Training in progress, step 1000
56aaf0f
verified

khongtrunght commited on

Training in progress, step 900
7aaa116
verified

khongtrunght commited on

Training in progress, step 800
dbfd53b
verified

khongtrunght commited on

Training in progress, step 700
1d53b28
verified

khongtrunght commited on

Training in progress, step 600
28da3dc
verified

khongtrunght commited on

Training in progress, step 500
a518eb4
verified

khongtrunght commited on

Training in progress, step 400
823cf2b
verified

khongtrunght commited on

Training in progress, step 300
43938b3
verified

khongtrunght commited on

Training in progress, step 200
318a297
verified

khongtrunght commited on

Training in progress, step 100
393c04d
verified

khongtrunght commited on

initial commit
4bfb6f3
verified

khongtrunght commited on