oSabre commited on
Commit
92fc183
1 Parent(s): 8f2f1bc

End of training

Browse files
README.md CHANGED
@@ -1,4 +1,6 @@
1
  ---
 
 
2
  tags:
3
  - generated_from_trainer
4
  datasets:
@@ -20,7 +22,7 @@ model-index:
20
  metrics:
21
  - name: Bleu
22
  type: bleu
23
- value: 1.1563
24
  ---
25
 
26
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -28,11 +30,11 @@ should probably proofread and complete it, then remove this comment. -->
28
 
29
  # opus_books_es_pt
30
 
31
- This model was trained from scratch on the opus_books dataset.
32
  It achieves the following results on the evaluation set:
33
- - Loss: 2.0979
34
- - Bleu: 1.1563
35
- - Gen Len: 18.5414
36
 
37
  ## Model description
38
 
@@ -52,28 +54,38 @@ More information needed
52
 
53
  The following hyperparameters were used during training:
54
  - learning_rate: 2e-05
55
- - train_batch_size: 10
56
- - eval_batch_size: 10
57
  - seed: 42
58
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
59
  - lr_scheduler_type: linear
60
- - num_epochs: 10
61
  - mixed_precision_training: Native AMP
62
 
63
  ### Training results
64
 
65
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
66
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
67
- | No log | 1.0 | 107 | 2.1971 | 1.0497 | 18.5414 |
68
- | No log | 2.0 | 214 | 2.1675 | 1.041 | 18.4774 |
69
- | No log | 3.0 | 321 | 2.1505 | 1.146 | 18.5414 |
70
- | No log | 4.0 | 428 | 2.1328 | 1.0819 | 18.4662 |
71
- | 2.1692 | 5.0 | 535 | 2.1244 | 1.0903 | 18.5075 |
72
- | 2.1692 | 6.0 | 642 | 2.1093 | 1.1418 | 18.5677 |
73
- | 2.1692 | 7.0 | 749 | 2.1045 | 1.1429 | 18.5414 |
74
- | 2.1692 | 8.0 | 856 | 2.0997 | 1.1875 | 18.5301 |
75
- | 2.1692 | 9.0 | 963 | 2.0991 | 1.1846 | 18.5414 |
76
- | 2.0458 | 10.0 | 1070 | 2.0979 | 1.1563 | 18.5414 |
 
 
 
 
 
 
 
 
 
 
77
 
78
 
79
  ### Framework versions
 
1
  ---
2
+ license: apache-2.0
3
+ base_model: t5-base
4
  tags:
5
  - generated_from_trainer
6
  datasets:
 
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
+ value: 1.2169
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
30
 
31
  # opus_books_es_pt
32
 
33
+ This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on the opus_books dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 2.0763
36
+ - Bleu: 1.2169
37
+ - Gen Len: 18.5038
38
 
39
  ## Model description
40
 
 
54
 
55
  The following hyperparameters were used during training:
56
  - learning_rate: 2e-05
57
+ - train_batch_size: 8
58
+ - eval_batch_size: 8
59
  - seed: 42
60
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
  - lr_scheduler_type: linear
62
+ - num_epochs: 20
63
  - mixed_precision_training: Native AMP
64
 
65
  ### Training results
66
 
67
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
68
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
69
+ | No log | 1.0 | 133 | 2.5227 | 0.5795 | 18.5789 |
70
+ | No log | 2.0 | 266 | 2.3918 | 0.6703 | 18.5451 |
71
+ | No log | 3.0 | 399 | 2.3166 | 0.8471 | 18.5301 |
72
+ | 2.6664 | 4.0 | 532 | 2.2665 | 0.8914 | 18.4737 |
73
+ | 2.6664 | 5.0 | 665 | 2.2319 | 0.928 | 18.4549 |
74
+ | 2.6664 | 6.0 | 798 | 2.2025 | 1.0067 | 18.5113 |
75
+ | 2.6664 | 7.0 | 931 | 2.1784 | 1.0162 | 18.515 |
76
+ | 2.2503 | 8.0 | 1064 | 2.1580 | 1.1102 | 18.5113 |
77
+ | 2.2503 | 9.0 | 1197 | 2.1420 | 1.0638 | 18.515 |
78
+ | 2.2503 | 10.0 | 1330 | 2.1257 | 1.1149 | 18.5113 |
79
+ | 2.2503 | 11.0 | 1463 | 2.1142 | 1.1334 | 18.4474 |
80
+ | 2.1172 | 12.0 | 1596 | 2.1091 | 1.1308 | 18.4925 |
81
+ | 2.1172 | 13.0 | 1729 | 2.0980 | 1.1655 | 18.5075 |
82
+ | 2.1172 | 14.0 | 1862 | 2.0950 | 1.1464 | 18.4925 |
83
+ | 2.1172 | 15.0 | 1995 | 2.0890 | 1.1383 | 18.5038 |
84
+ | 2.0185 | 16.0 | 2128 | 2.0833 | 1.1671 | 18.5 |
85
+ | 2.0185 | 17.0 | 2261 | 2.0806 | 1.1555 | 18.5038 |
86
+ | 2.0185 | 18.0 | 2394 | 2.0777 | 1.15 | 18.5113 |
87
+ | 1.9882 | 19.0 | 2527 | 2.0770 | 1.2252 | 18.5113 |
88
+ | 1.9882 | 20.0 | 2660 | 2.0763 | 1.2169 | 18.5038 |
89
 
90
 
91
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7259c01a45a834e7f60d6b3f90577887c4fba9975edb38fae2525d5660300101
3
  size 891644712
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f29f309604d3ac42303211c48a8bc028b86019598403b1e98b9dc105ed608183
3
  size 891644712
runs/Dec17_15-34-49_222b4dc5c326/events.out.tfevents.1702827291.222b4dc5c326.42.5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e5458a5d95899b839304264be592d81deea2708174445b5bde1db98ba3d209f5
3
- size 12813
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ade819ff78a83d6029a2bc8a478422e554283570f887882ac88a7960fa5e579d
3
+ size 13907