Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nteku1
/
llama-7b-qlora-ultrachat_2-DPO
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
llama-7b-qlora-ultrachat_2-DPO
Commit History
End of training
0b117f7
verified
nteku1
commited on
18 days ago
End of training
96ca8ab
verified
nteku1
commited on
20 days ago
initial commit
543c0df
verified
nteku1
commited on
20 days ago