luluw
/

llama2-7B-finetuned-chat-guanaco

Text2Text Generation

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

luluw commited on Jul 29

Commit

f857431

•

1 Parent(s): dd6dbb1

Update README.md

Files changed (1) hide show

README.md +37 -12

README.md CHANGED Viewed

@@ -8,26 +8,55 @@ tags:
 model-index:
 - name: llama2-7B-finetuned-chat-guanaco
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# llama2-7B-finetuned-chat-guanaco
-This model is a fine-tuned version of [NousResearch/Llama-2-7b-chat-hf](https://huggingface.co/NousResearch/Llama-2-7b-chat-hf) on an unknown dataset.
 ## Model description
-More information needed
 ## Intended uses & limitations
 More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -44,10 +73,6 @@ The following hyperparameters were used during training:
 - num_epochs: 3
 - mixed_precision_training: Native AMP
-### Training results
 ### Framework versions
 - PEFT 0.12.0

 model-index:
 - name: llama2-7B-finetuned-chat-guanaco
   results: []
+license: mit
 ---
+base_model: NousResearch/Llama-2-7b-chat-hf
+library_name: peft
+tags:
+- trl
+- sft
+- generated_from_trainer
+model-index:
+- name: llama2-7B-finetuned-chat-guanaco
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You should probably proofread and complete it, then remove this comment. -->
 ## Model description
+The __`llama2-7B-finetuned-chat-guanaco`__ model is a fine-tuned version of the [NousResearch/Llama-2-7b-chat-hf](https://huggingface.co/NousResearch/Llama-2-7b-chat-hf) base model. This base model is a variant of LLaMA (Large Language Model Meta AI) designed for chat applications, optimized for conversational understanding and generation.
+## Dataset used
+[mlabonne/guanaco-llama2-1k](https://huggingface.co/mlabonne/guanaco-llama2-1k)
 ## Intended uses & limitations
 More information needed
+### Training results
+The training loss over steps is as follows:
+| Step | Training Loss |
+|------|---------------|
+| 25   | 1.823         |
+| 50   | 2.056         |
+| 75   | 1.829         |
+| 100  | 1.744         |
+| 125  | 1.717         |
+| 150  | 1.412         |
+| 175  | 1.506         |
+| 200  | 1.446         |
+| 225  | 1.499         |
+| 250  | 1.432         |
+| 275  | 1.281         |
+| 300  | 1.341         |
+| 325  | 1.345         |
+| 350  | 1.391         |
+| 375  | 1.388         |
 ## Training procedure
 - num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Framework versions
 - PEFT 0.12.0