PEFT
Safetensors
qwen2
alignment-handbook
trl
dpo
Generated from Trainer
File size: 129 Bytes
393c04d
 
 
1
2
3
4
version https://git-lfs.github.com/spec/v1
oid sha256:021e306e612835437f3809fcb7fbebb4f9a78228a0f387ec4473d85531fd636d
size 6264