Update README.md
Browse files
README.md
CHANGED
@@ -27,7 +27,7 @@ The model has underwent a post-training process that incorporates both **supervi
|
|
27 |
- [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized)
|
28 |
- [argilla/distilabel-intel-orca-dpo-pairs](https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs)
|
29 |
- [argilla/distilabel-math-preference-dpo](https://huggingface.co/datasets/argilla/distilabel-math-preference-dpo)
|
30 |
-
- [jondurbin/py-dpo-v0.1](https://huggingface.co/datasets/
|
31 |
|
32 |
## How to use
|
33 |
### Chat Format
|
|
|
27 |
- [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized)
|
28 |
- [argilla/distilabel-intel-orca-dpo-pairs](https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs)
|
29 |
- [argilla/distilabel-math-preference-dpo](https://huggingface.co/datasets/argilla/distilabel-math-preference-dpo)
|
30 |
+
- [jondurbin/py-dpo-v0.1](https://huggingface.co/datasets/jondurbin/py-dpo-v0.1)
|
31 |
|
32 |
## How to use
|
33 |
### Chat Format
|