bala3040/DPB_buster

Files changed (4) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.1587
 ## Model description
@@ -44,15 +44,17 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
-- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.8033        | 1.0   | 175  | 1.1305          |
-| 0.4034        | 2.0   | 350  | 1.1587          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7201
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
+- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.4537        | 1.0   | 175  | 0.6512          |
+| 0.1981        | 2.0   | 350  | 0.6483          |
+| 0.1697        | 3.0   | 525  | 0.6851          |
+| 0.1532        | 4.0   | 700  | 0.7201          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:51d5199221be9afed5db52ce53b5a06b804fc46e72359bf0cd075b11370aaac8
 size 8397056

 version https://git-lfs.github.com/spec/v1
+oid sha256:2c46540d3d4d4d9bd0f1988f096c6da19a0c46d0ab241f56935c8663d70726a7
 size 8397056

runs/Sep29_13-30-01_6c53b4b1abd0/events.out.tfevents.1727616603.6c53b4b1abd0.1477.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b59cc29dedf68ebaf315dd4d2291bda56bae196ca2311cc4d5435da3326d7248
+size 7853

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b2973f658e68e28309b256887f099f4d78c3800411cf5e5bd707b98405a55508
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:e81a41f7831702cb109661f6b3731c2f081cab55ffea5d4fe2ccb9e910c47269
 size 5176