nothingiisreal
/

L3-8B-Celeste-v1

@@ -158,16 +158,14 @@ When convenient, say screenplay phrases like "cut to"
 The split was as follows:
 - **2K rows from r/WritingPrompts**
-- **2K rows from r/DirtyWritingPrompts**
-- **2K rows from Opus Instruct 15K (specifically the 6.5K jsonl)**
 - **2K rows from c2 logs cleaned**
 While we did train all system prompts from c2 logs we also have our own system prompts.
 <details>
   <summary>List of trained system prompts. Note: c2 logs system prompts and char cards were also included.</summary>
-Here's the information in a markdown table:
 | Dataset                              | System Prompt                                                                                                                                                     |
 |--------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------|
 | reddit_dirty_writing_prompts.jsonl   | "You are a short story writer. Write a story based on prompt provided by user below.  Mode: NSFW"                                                                  |
@@ -191,7 +189,7 @@ We think there is too much secrecy around what data is being used, and different
 We found that increasing the amount of ranks from 64 to 256 has reduced repetition but also led to the language used resembling Claude more than the 64 rank version. No worries, it's still far enough from Claude.
 <br>It also led to increased coherency but reduced instruction following, likely because the model started diverging more away from L3 8B Instruct.
 <br>**The model is uncensored for RP. For Instruct it needs 2-3 words of prefill for the first message.**
-<br>We found that increasing the amount of data from 1K to 8K reduced repetition aswell.
 <br>**The prose is much better than other synthetic data generations. The model also demonstrates increased style copying abilities likely a result of the long data and varying writing styles found in WritingPrompts.**
 <br>**The model is exceptional at being creative in roleplaying**, knows different persona's and even a single character will change persona in different contexts, persona is tied to last few messages rather than system message or character card. **This is great as it often means the model can do impressive things without you needing to explicitly specify.**
@@ -206,8 +204,8 @@ Grad norm kept increasing throughout the run which is concerning, albeit it coul
 ## Graphs
 Colors:
-<p style="color: #F0B899;">256 rank on 8K rows</p>
-<p style="color: #5BC5DB;">64 rank on 8K rows</p>
 <p style="color: #5387DD;">64 rank on 1K rows</p>
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630cf5d14ca0a22768bbe10c/y9hC4bGq-Lt7sDQ23q5db.png)

 The split was as follows:
 - **2K rows from r/WritingPrompts**
+- **1.1K rows from r/DirtyWritingPrompts**
+- **1.4K rows from Opus Instruct 15K (specifically the 6.5K jsonl)**
 - **2K rows from c2 logs cleaned**
 While we did train all system prompts from c2 logs we also have our own system prompts.
 <details>
   <summary>List of trained system prompts. Note: c2 logs system prompts and char cards were also included.</summary>
 | Dataset                              | System Prompt                                                                                                                                                     |
 |--------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------|
 | reddit_dirty_writing_prompts.jsonl   | "You are a short story writer. Write a story based on prompt provided by user below.  Mode: NSFW"                                                                  |
 We found that increasing the amount of ranks from 64 to 256 has reduced repetition but also led to the language used resembling Claude more than the 64 rank version. No worries, it's still far enough from Claude.
 <br>It also led to increased coherency but reduced instruction following, likely because the model started diverging more away from L3 8B Instruct.
 <br>**The model is uncensored for RP. For Instruct it needs 2-3 words of prefill for the first message.**
+<br>We found that increasing the amount of data from 1K to 6.5K reduced repetition aswell.
 <br>**The prose is much better than other synthetic data generations. The model also demonstrates increased style copying abilities likely a result of the long data and varying writing styles found in WritingPrompts.**
 <br>**The model is exceptional at being creative in roleplaying**, knows different persona's and even a single character will change persona in different contexts, persona is tied to last few messages rather than system message or character card. **This is great as it often means the model can do impressive things without you needing to explicitly specify.**
 ## Graphs
 Colors:
+<p style="color: #F0B899;">256 rank on 6.5K rows</p>
+<p style="color: #5BC5DB;">64 rank on 6.5K rows</p>
 <p style="color: #5387DD;">64 rank on 1K rows</p>
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630cf5d14ca0a22768bbe10c/y9hC4bGq-Lt7sDQ23q5db.png)