thespicynoodle
commited on
Commit
•
4df1f5f
1
Parent(s):
dee7008
thespicynoodle/tutorialone
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
16 |
|
17 |
This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
|
18 |
It achieves the following results on the evaluation set:
|
19 |
-
- Loss: 1.
|
20 |
|
21 |
## Model description
|
22 |
|
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
|
|
51 |
|
52 |
| Training Loss | Epoch | Step | Validation Loss |
|
53 |
|:-------------:|:-----:|:----:|:---------------:|
|
54 |
-
|
|
55 |
-
|
|
56 |
-
|
|
57 |
-
| 2.
|
58 |
-
| 2.
|
59 |
-
|
|
60 |
-
|
|
61 |
-
| 1.
|
62 |
-
|
|
63 |
-
| 1.
|
64 |
|
65 |
|
66 |
### Framework versions
|
|
|
16 |
|
17 |
This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
|
18 |
It achieves the following results on the evaluation set:
|
19 |
+
- Loss: 1.8978
|
20 |
|
21 |
## Model description
|
22 |
|
|
|
51 |
|
52 |
| Training Loss | Epoch | Step | Validation Loss |
|
53 |
|:-------------:|:-----:|:----:|:---------------:|
|
54 |
+
| 3.4004 | 0.92 | 9 | 2.6415 |
|
55 |
+
| 2.5356 | 1.95 | 19 | 2.3921 |
|
56 |
+
| 2.2429 | 2.97 | 29 | 2.2271 |
|
57 |
+
| 2.0182 | 4.0 | 39 | 2.0984 |
|
58 |
+
| 2.0963 | 4.92 | 48 | 2.0157 |
|
59 |
+
| 1.7926 | 5.95 | 58 | 1.9555 |
|
60 |
+
| 1.7221 | 6.97 | 68 | 1.9214 |
|
61 |
+
| 1.6816 | 8.0 | 78 | 1.9057 |
|
62 |
+
| 1.8335 | 8.92 | 87 | 1.8983 |
|
63 |
+
| 1.2605 | 9.23 | 90 | 1.8978 |
|
64 |
|
65 |
|
66 |
### Framework versions
|
adapter_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 8397056
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e009a73fb7ebb83da74894d987e0dd86db6d4b3d5473885c96de883e9c6b1d8e
|
3 |
size 8397056
|
runs/Mar13_06-06-53_6588bd6c37be/events.out.tfevents.1710310017.6588bd6c37be.8932.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ab31ace57299f1cee1a161230a2c13d49e3872b58249740e30e99f2735b0676f
|
3 |
+
size 10286
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4856
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:67ea94c8e185c4ebe0aaa73759a895700b2e040cfdc0112bf5f184f0d9ad1203
|
3 |
size 4856
|