Oysiyl commited on
Commit
7b2fdc9
1 Parent(s): 8879485

End of training

Browse files
README.md CHANGED
@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 4.4268
22
- - Bleu: 0.0
23
- - Gen Len: 16.7639
24
 
25
  ## Model description
26
 
@@ -40,19 +40,23 @@ More information needed
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 2e-05
43
- - train_batch_size: 1
44
- - eval_batch_size: 1
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
- - num_epochs: 1
49
  - mixed_precision_training: Native AMP
50
 
51
  ### Training results
52
 
53
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
54
- |:-------------:|:-----:|:----:|:---------------:|:----:|:-------:|
55
- | 4.7866 | 1.0 | 576 | 4.4268 | 0.0 | 16.7639 |
 
 
 
 
56
 
57
 
58
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 4.1552
22
+ - Bleu: 0.0813
23
+ - Gen Len: 16.4792
24
 
25
  ## Model description
26
 
 
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 2e-05
43
+ - train_batch_size: 4
44
+ - eval_batch_size: 4
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
+ - num_epochs: 5
49
  - mixed_precision_training: Native AMP
50
 
51
  ### Training results
52
 
53
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
54
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
55
+ | 4.9485 | 1.0 | 144 | 4.4460 | 0.0 | 16.875 |
56
+ | 4.515 | 2.0 | 288 | 4.2735 | 0.0 | 16.625 |
57
+ | 4.3579 | 3.0 | 432 | 4.1977 | 0.0 | 16.7014 |
58
+ | 4.3095 | 4.0 | 576 | 4.1644 | 0.0818 | 16.5417 |
59
+ | 4.2744 | 5.0 | 720 | 4.1552 | 0.0813 | 16.4792 |
60
 
61
 
62
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8fa26e8a238bf71b7e6616afaeeaac09bdc2647f8521d45fb1070e1f3cfabe5c
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef0a51adef99bb0c335a98548f2911f9d3bd1b2bb9d0b40953248c994da60fc1
3
  size 242041896
runs/Sep17_19-06-04_ip-10-192-12-112/events.out.tfevents.1726599970.ip-10-192-12-112.1319.9 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3a4082b6c65546b8ae48cfdf920a351e6947871dadf38a0d9d0b3f2c9fdf2d7b
3
+ size 6883
runs/Sep17_19-10-17_ip-10-192-12-112/events.out.tfevents.1726600222.ip-10-192-12-112.1319.10 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:969eaf27eafc6749abea2746ec355b2a9e677601e08aab7736f7a42fcb831bb8
3
+ size 6159
runs/Sep17_19-12-26_ip-10-192-12-112/events.out.tfevents.1726600351.ip-10-192-12-112.1319.11 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce01b0863e76b937d6e5c5bbe7207332120d68149ae17f1c2b8c5cccd6e0d7ce
3
+ size 6160
runs/Sep17_19-21-44_ip-10-192-12-112/events.out.tfevents.1726600908.ip-10-192-12-112.1319.12 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3dc293ceccc054f6e29f1b60143509718e12c4ad9e2fe7fdcc2bf5c142dfe788
3
+ size 6161
runs/Sep17_19-24-59_ip-10-192-12-112/events.out.tfevents.1726601104.ip-10-192-12-112.1319.13 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df1d8d2f00aa162654f8675c7839ef1269b74a7e6e08096b14df3a165cdc3240
3
+ size 6161
runs/Sep17_19-26-18_ip-10-192-12-112/events.out.tfevents.1726601183.ip-10-192-12-112.1319.14 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:82fdbdad9fd512f7c6be8c9ab63ed4dff6862ec73a51b64726d05c6447b20512
3
+ size 7465
runs/Sep17_19-29-03_ip-10-192-12-112/events.out.tfevents.1726601349.ip-10-192-12-112.1319.15 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a4bdd2d6245130e7014688617a43d655ac25ec0720fb1ad9f40593a8bc98f0a6
3
+ size 6161
runs/Sep17_19-29-30_ip-10-192-12-112/events.out.tfevents.1726601373.ip-10-192-12-112.1319.16 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:80f2ba2efca47533f703af195cfe726437eac47c040910f7fcc277c4727f8b7d
3
+ size 7465
runs/Sep17_19-31-19_ip-10-192-12-112/events.out.tfevents.1726601486.ip-10-192-12-112.1319.17 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eb99c20945da1555d4c8cf92ce006968a74fcdc2a596e1296156e9fff9aff13c
3
+ size 9208
tokenizer.json CHANGED
@@ -1,11 +1,6 @@
1
  {
2
  "version": "1.0",
3
- "truncation": {
4
- "direction": "Right",
5
- "max_length": 128,
6
- "strategy": "LongestFirst",
7
- "stride": 0
8
- },
9
  "padding": null,
10
  "added_tokens": [
11
  {
 
1
  {
2
  "version": "1.0",
3
+ "truncation": null,
 
 
 
 
 
4
  "padding": null,
5
  "added_tokens": [
6
  {
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4c878264c39ad34daf60faf954168f50e239eed30765cb20a2e2a0dc4611bacd
3
  size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d0182e1228d08bf24a4de1d1df8964f45fa2a8a5c19b929a6b7c93ef082f1ebc
3
  size 5368