--- license: apache-2.0 base_model: t5-base tags: - generated_from_trainer metrics: - bleu model-index: - name: t5-base-finetuned-c_zh-to-m_zh results: [] --- # t5-base-finetuned-c_zh-to-m_zh This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on an unknown dataset. It achieves the following results on the evaluation set: - Loss: 0.1259 - Bleu: 86.7906 - Gen Len: 7.5595 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 2e-05 - train_batch_size: 16 - eval_batch_size: 16 - seed: 42 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lr_scheduler_type: linear - num_epochs: 15 - mixed_precision_training: Native AMP ### Training results | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len | |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:| | No log | 1.0 | 234 | 0.1445 | 86.8464 | 7.5788 | | No log | 2.0 | 468 | 0.1357 | 86.5266 | 7.6088 | | 0.194 | 3.0 | 702 | 0.1317 | 86.8389 | 7.5756 | | 0.194 | 4.0 | 936 | 0.1302 | 87.0248 | 7.5648 | | 0.1428 | 5.0 | 1170 | 0.1372 | 85.5082 | 7.686 | | 0.1428 | 6.0 | 1404 | 0.1297 | 86.3936 | 7.612 | | 0.1328 | 7.0 | 1638 | 0.1273 | 86.7919 | 7.5745 | | 0.1328 | 8.0 | 1872 | 0.1266 | 86.7919 | 7.5745 | | 0.129 | 9.0 | 2106 | 0.1262 | 86.9787 | 7.5606 | | 0.129 | 10.0 | 2340 | 0.1256 | 86.882 | 7.5616 | | 0.1262 | 11.0 | 2574 | 0.1259 | 86.9757 | 7.5616 | | 0.1262 | 12.0 | 2808 | 0.1255 | 86.8843 | 7.5595 | | 0.1262 | 13.0 | 3042 | 0.1257 | 86.9322 | 7.5584 | | 0.1262 | 14.0 | 3276 | 0.1256 | 87.0674 | 7.5563 | | 0.1238 | 15.0 | 3510 | 0.1259 | 86.7906 | 7.5595 | ### Framework versions - Transformers 4.41.2 - Pytorch 2.3.0+cu121 - Datasets 2.20.0 - Tokenizers 0.19.1