datasciguy
/

TinyLlama-1.1B-Chat-v1.0-Unfiltered

Model card Files Files and versions Community

datasciguy commited on 2 days ago

Commit

4801cca

•

1 Parent(s): 39576c2

added model card

Files changed (1) hide show

README.md +77 -3

README.md CHANGED Viewed

@@ -1,3 +1,77 @@
----
-license: mit
----

+---
+license: mit
+---
+### Model Card: **TinyLlama-1.1B-Chat-v1.0-Unfiltered**
+---
+**Model Name**: TinyLlama-1.1B-Chat-v1.0-Unfiltered
+**Model Type**: Conversational AI Model
+**Architecture**: Based on a 1.1B parameter TinyLlama architecture
+**Training Data**:
+- Fine-tuned on the "dan_remixed" dataset (2.7MB).
+- The dataset improves spelling, grammar, and consistency while replacing references to violent crimes with non-violent activities and removes self-censorship from explicatives.
+**Training Time**: Approximately 30-45 minutes. Each validation epoch takes ~322 seconds.
+**Hardware**: Trained on GPU (specific GPU details not provided).
+---
+**Training Performance**:
+- **Epoch Losses**:
+  - Epoch 1: 0.7209
+  - Epoch 2: 0.4441
+  - Epoch 3: 0.3683
+  - Epoch 4: 0.3358
+  - Epoch 5: 0.3145
+- **Final Training Loss (Epoch 5)**: 0.3145
+---
+**Validation Performance** (5 Epochs):
+- **Epoch 1**:
+  - Training Loss: 0.2921
+  - Validation Loss: 0.7962
+  - Perplexity: 2.22
+  - Epoch completed in 321.64 seconds
+- **Epoch 2**:
+  - Training Loss: 0.2872
+  - Validation Loss: 0.7672
+  - Perplexity: 2.15
+  - Epoch completed in 321.91 seconds
+- **Epoch 3**:
+  - Training Loss: 0.2874
+  - Validation Loss: 0.7821
+  - Perplexity: 2.19
+  - Epoch completed in 321.94 seconds
+- **Epoch 4**:
+  - Training Loss: 0.2864
+  - Validation Loss: 0.7796
+  - Perplexity: 2.18
+  - Epoch completed in 322.01 seconds
+- **Epoch 5**:
+  - Training Loss: 0.2831
+  - Validation Loss: 0.8017
+  - Perplexity: 2.23
+  - Epoch completed in 322.01 seconds
+---
+**Optimizer**: AdamW, learning rate: 1e-5
+**Loss Function**: Cross-Entropy Loss, ignoring padding tokens (ignore_index=-100)
+**Use Case**: Conversational AI designed for general, unrestricted conversation, with no filtering on the nature of responses, provided the content is non-violent.
+---
+**Limitations**:
+- Due to the small fine-tuning dataset size (2.7MB), the model may be prone to **overfitting** and **bias**.
+- The dataset has been modified to avoid violent language, but the model might still exhibit strong or explicit responses.
+**Metrics**:
+- Loss and perplexity have been tracked, and more conversational metrics (like BLEU, ROUGE, or human evaluation) could be explored.