rungao2001
commited on
Commit
•
e2f4984
1
Parent(s):
c117982
Update README.md
Browse files
README.md
CHANGED
@@ -34,8 +34,7 @@ argilla/dpo-mix-7k
|
|
34 |
|
35 |
DPO
|
36 |
|
37 |
-
Notice:
|
38 |
-
Notice: `chat_template` was modified because the original vicuna1.1 format can not be used in trl.DPOTrainer. The \"Conversation roles must alternate user/assistant/user/assistant/...\" Error was removed, and system message is output only when loop.index0 == 0 and role == \'user\'.
|
39 |
|
40 |
#### Training Hyperparameters
|
41 |
|
|
|
34 |
|
35 |
DPO
|
36 |
|
37 |
+
Notice: The chat_template was modified because the original vicuna1.1 format cannot be used in trl.DPOTrainer. The error \"Conversation roles must alternate user/assistant/user/assistant/...\" was removed, and the system message is output only when loop.index0 == 0 and role == 'user'.
|
|
|
38 |
|
39 |
#### Training Hyperparameters
|
40 |
|