PEFT
Safetensors
qwen2
alignment-handbook
trl
dpo
Generated from Trainer
File size: 134 Bytes
393c04d
29a177d
393c04d
1
2
3
4
version https://git-lfs.github.com/spec/v1
oid sha256:24c1d32e97c484b8d24d078bf5f2604954890673a18db449f5370d99ff1e5a8e
size 323014168