oSabre
/

opus_books_es_pt

@@ -1,4 +1,6 @@
 ---
 tags:
 - generated_from_trainer
 datasets:
@@ -20,7 +22,7 @@ model-index:
     metrics:
     - name: Bleu
       type: bleu
-      value: 1.1563
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -28,11 +30,11 @@ should probably proofread and complete it, then remove this comment. -->
 # opus_books_es_pt
-This model was trained from scratch on the opus_books dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.0979
-- Bleu: 1.1563
-- Gen Len: 18.5414
 ## Model description
@@ -52,28 +54,38 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 10
-- eval_batch_size: 10
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
-| No log        | 1.0   | 107  | 2.1971          | 1.0497 | 18.5414 |
-| No log        | 2.0   | 214  | 2.1675          | 1.041  | 18.4774 |
-| No log        | 3.0   | 321  | 2.1505          | 1.146  | 18.5414 |
-| No log        | 4.0   | 428  | 2.1328          | 1.0819 | 18.4662 |
-| 2.1692        | 5.0   | 535  | 2.1244          | 1.0903 | 18.5075 |
-| 2.1692        | 6.0   | 642  | 2.1093          | 1.1418 | 18.5677 |
-| 2.1692        | 7.0   | 749  | 2.1045          | 1.1429 | 18.5414 |
-| 2.1692        | 8.0   | 856  | 2.0997          | 1.1875 | 18.5301 |
-| 2.1692        | 9.0   | 963  | 2.0991          | 1.1846 | 18.5414 |
-| 2.0458        | 10.0  | 1070 | 2.0979          | 1.1563 | 18.5414 |
 ### Framework versions

 ---
+license: apache-2.0
+base_model: t5-base
 tags:
 - generated_from_trainer
 datasets:
     metrics:
     - name: Bleu
       type: bleu
+      value: 1.2169
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # opus_books_es_pt
+This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on the opus_books dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.0763
+- Bleu: 1.2169
+- Gen Len: 18.5038
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
+| No log        | 1.0   | 133  | 2.5227          | 0.5795 | 18.5789 |
+| No log        | 2.0   | 266  | 2.3918          | 0.6703 | 18.5451 |
+| No log        | 3.0   | 399  | 2.3166          | 0.8471 | 18.5301 |
+| 2.6664        | 4.0   | 532  | 2.2665          | 0.8914 | 18.4737 |
+| 2.6664        | 5.0   | 665  | 2.2319          | 0.928  | 18.4549 |
+| 2.6664        | 6.0   | 798  | 2.2025          | 1.0067 | 18.5113 |
+| 2.6664        | 7.0   | 931  | 2.1784          | 1.0162 | 18.515  |
+| 2.2503        | 8.0   | 1064 | 2.1580          | 1.1102 | 18.5113 |
+| 2.2503        | 9.0   | 1197 | 2.1420          | 1.0638 | 18.515  |
+| 2.2503        | 10.0  | 1330 | 2.1257          | 1.1149 | 18.5113 |
+| 2.2503        | 11.0  | 1463 | 2.1142          | 1.1334 | 18.4474 |
+| 2.1172        | 12.0  | 1596 | 2.1091          | 1.1308 | 18.4925 |
+| 2.1172        | 13.0  | 1729 | 2.0980          | 1.1655 | 18.5075 |
+| 2.1172        | 14.0  | 1862 | 2.0950          | 1.1464 | 18.4925 |
+| 2.1172        | 15.0  | 1995 | 2.0890          | 1.1383 | 18.5038 |
+| 2.0185        | 16.0  | 2128 | 2.0833          | 1.1671 | 18.5    |
+| 2.0185        | 17.0  | 2261 | 2.0806          | 1.1555 | 18.5038 |
+| 2.0185        | 18.0  | 2394 | 2.0777          | 1.15   | 18.5113 |
+| 1.9882        | 19.0  | 2527 | 2.0770          | 1.2252 | 18.5113 |
+| 1.9882        | 20.0  | 2660 | 2.0763          | 1.2169 | 18.5038 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7259c01a45a834e7f60d6b3f90577887c4fba9975edb38fae2525d5660300101
 size 891644712

 version https://git-lfs.github.com/spec/v1
+oid sha256:f29f309604d3ac42303211c48a8bc028b86019598403b1e98b9dc105ed608183
 size 891644712

runs/Dec17_15-34-49_222b4dc5c326/events.out.tfevents.1702827291.222b4dc5c326.42.5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e5458a5d95899b839304264be592d81deea2708174445b5bde1db98ba3d209f5
-size 12813

 version https://git-lfs.github.com/spec/v1
+oid sha256:ade819ff78a83d6029a2bc8a478422e554283570f887882ac88a7960fa5e579d
+size 13907