Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
7b-kto-10-40-i1
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
7b-kto-10-40-i1
Commit History
End of training
c62da01
verified
BraylonDash
commited on
1 day ago
Model save
1c4a63e
verified
BraylonDash
commited on
1 day ago
Training in progress, step 220
deab481
verified
BraylonDash
commited on
1 day ago
Training in progress, step 200
2e5b4a1
verified
BraylonDash
commited on
1 day ago
Training in progress, step 180
69f5447
verified
BraylonDash
commited on
1 day ago
Training in progress, step 160
5108870
verified
BraylonDash
commited on
1 day ago
Training in progress, step 140
41cc517
verified
BraylonDash
commited on
1 day ago
Training in progress, step 120
87cb5cc
verified
BraylonDash
commited on
1 day ago
Training in progress, step 100
a9c95c4
verified
BraylonDash
commited on
1 day ago
Training in progress, step 80
6ffddc7
verified
BraylonDash
commited on
1 day ago
Training in progress, step 60
e44c8fa
verified
BraylonDash
commited on
1 day ago
Training in progress, step 20
b8dd1b1
verified
BraylonDash
commited on
1 day ago
initial commit
8a210a9
verified
BraylonDash
commited on
1 day ago