v000000 commited on
Commit
e21cb98
1 Parent(s): 3debd31

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -52,6 +52,8 @@ Trained [Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
52
 
53
  * Merged all <b>DPO checkpoints</b> and <b>SLERP</b> variations with <b>MODEL_STOCK</b> to analyze geometric properties and get the most *performant* aspects of all runs/merges. *Model Stock* was chosen due to the similarity between the merged models.
54
 
 
 
55
  ## Recipe
56
 
57
  ```yaml
 
52
 
53
  * Merged all <b>DPO checkpoints</b> and <b>SLERP</b> variations with <b>MODEL_STOCK</b> to analyze geometric properties and get the most *performant* aspects of all runs/merges. *Model Stock* was chosen due to the similarity between the merged models.
54
 
55
+ * This was chosen due to the fact that evaluation for *ORPO* is unclear, so it's hard to know which runs are the best.
56
+
57
  ## Recipe
58
 
59
  ```yaml