PEFT
Safetensors
qwen2
alignment-handbook
trl
dpo
Generated from Trainer
Qwen2-7B-Instruct-SPPO-Function-call-v2.12 / adapter_model.safetensors

Commit History

Training in progress, step 1091
29a177d
verified

khongtrunght commited on

Training in progress, step 1000
56aaf0f
verified

khongtrunght commited on

Training in progress, step 900
7aaa116
verified

khongtrunght commited on

Training in progress, step 800
dbfd53b
verified

khongtrunght commited on

Training in progress, step 700
1d53b28
verified

khongtrunght commited on

Training in progress, step 600
28da3dc
verified

khongtrunght commited on

Training in progress, step 500
a518eb4
verified

khongtrunght commited on

Training in progress, step 400
823cf2b
verified

khongtrunght commited on

Training in progress, step 300
43938b3
verified

khongtrunght commited on

Training in progress, step 200
318a297
verified

khongtrunght commited on

Training in progress, step 100
393c04d
verified

khongtrunght commited on