Oysiyl commited on
Commit
0417558
1 Parent(s): 7b2fdc9

End of training

Browse files
README.md CHANGED
@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 4.1552
22
- - Bleu: 0.0813
23
- - Gen Len: 16.4792
24
 
25
  ## Model description
26
 
@@ -45,18 +45,23 @@ The following hyperparameters were used during training:
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
- - num_epochs: 5
49
  - mixed_precision_training: Native AMP
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
55
- | 4.9485 | 1.0 | 144 | 4.4460 | 0.0 | 16.875 |
56
- | 4.515 | 2.0 | 288 | 4.2735 | 0.0 | 16.625 |
57
- | 4.3579 | 3.0 | 432 | 4.1977 | 0.0 | 16.7014 |
58
- | 4.3095 | 4.0 | 576 | 4.1644 | 0.0818 | 16.5417 |
59
- | 4.2744 | 5.0 | 720 | 4.1552 | 0.0813 | 16.4792 |
 
 
 
 
 
60
 
61
 
62
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 3.9119
22
+ - Bleu: 0.1206
23
+ - Gen Len: 16.6875
24
 
25
  ## Model description
26
 
 
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
+ - num_epochs: 10
49
  - mixed_precision_training: Native AMP
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
55
+ | 4.9376 | 1.0 | 144 | 4.3320 | 0.1122 | 16.9514 |
56
+ | 4.5062 | 2.0 | 288 | 4.1559 | 0.1191 | 15.9653 |
57
+ | 4.3624 | 3.0 | 432 | 4.0699 | 0.1118 | 16.7361 |
58
+ | 4.2832 | 4.0 | 576 | 4.0222 | 0.1474 | 16.3125 |
59
+ | 4.2286 | 5.0 | 720 | 3.9854 | 0.1421 | 16.2778 |
60
+ | 4.1762 | 6.0 | 864 | 3.9550 | 0.1107 | 16.9583 |
61
+ | 4.14 | 7.0 | 1008 | 3.9386 | 0.1215 | 16.1944 |
62
+ | 4.087 | 8.0 | 1152 | 3.9221 | 0.1232 | 16.7153 |
63
+ | 4.0988 | 9.0 | 1296 | 3.9154 | 0.1206 | 16.6875 |
64
+ | 4.0738 | 10.0 | 1440 | 3.9119 | 0.1206 | 16.6875 |
65
 
66
 
67
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ef0a51adef99bb0c335a98548f2911f9d3bd1b2bb9d0b40953248c994da60fc1
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:74018eca5257836ae7351a0ca339b40176188456b0943543e4e6dcd6c66336f9
3
  size 242041896
runs/Sep17_19-36-17_ip-10-192-12-112/events.out.tfevents.1726601782.ip-10-192-12-112.1319.18 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3dc4c6a07078d557dca62396e88d8b84360096651ed1606852dd9ca499c1b21
3
+ size 9208
runs/Sep17_19-38-51_ip-10-192-12-112/events.out.tfevents.1726601939.ip-10-192-12-112.1319.19 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd6462613bd8080f6d13de659383a8d395a5e6618127b3d4987a415fc50d0688
3
+ size 6884
runs/Sep22_09-15-13_ip-10-192-12-209/events.out.tfevents.1726996524.ip-10-192-12-209.1556.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:37ea71638cbcb010b9342e571eee1ab1b69a930e7b9b1f46091420917b6a8a37
3
+ size 6884
runs/Sep22_09-16-22_ip-10-192-12-209/events.out.tfevents.1726996587.ip-10-192-12-209.1556.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9d59cfa20037380d85a02e2e6aeb6a82739ecd05135ea78c77ca67fd822729d4
3
+ size 9208
runs/Sep22_09-20-10_ip-10-192-12-209/events.out.tfevents.1726996817.ip-10-192-12-209.6211.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b7581d39ec1a1f77e3900c64f120278ef60b66bdba2d417d81d395344f7e4246
3
+ size 9210
runs/Sep22_09-20-10_ip-10-192-12-209/events.out.tfevents.1726997064.ip-10-192-12-209.6211.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c0a7508158d184765b0a7d418e910af45a218820036289dcf735d10595c9c82d
3
+ size 9238
runs/Sep22_09-30-50_ip-10-192-12-209/events.out.tfevents.1726997458.ip-10-192-12-209.15554.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:60e748fd26f91e4a4b8880b90a5e266dc4c45908fe1b9cf0ae90400be15b6508
3
+ size 9251
runs/Sep22_10-00-02_ip-10-192-12-209/events.out.tfevents.1726999203.ip-10-192-12-209.43955.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69a0c83f69fd3b6005422e897d5f71414e4070e74028c54757e56ec719831960
3
+ size 12114
tokenizer.json CHANGED
@@ -1,6 +1,11 @@
1
  {
2
  "version": "1.0",
3
- "truncation": null,
 
 
 
 
 
4
  "padding": null,
5
  "added_tokens": [
6
  {
 
1
  {
2
  "version": "1.0",
3
+ "truncation": {
4
+ "direction": "Right",
5
+ "max_length": 128,
6
+ "strategy": "LongestFirst",
7
+ "stride": 0
8
+ },
9
  "padding": null,
10
  "added_tokens": [
11
  {
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d0182e1228d08bf24a4de1d1df8964f45fa2a8a5c19b929a6b7c93ef082f1ebc
3
  size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d98df7c46aa141f4b035a0b5676c4b4bae82cec1f793a6427a332f37b21643e5
3
  size 5368