t5Indo2Sunda / README.md
pijarcandra22's picture
Training in progress epoch 27
c376bfc
|
raw
history blame
No virus
2.42 kB
metadata
license: apache-2.0
base_model: t5-small
tags:
  - generated_from_keras_callback
model-index:
  - name: pijarcandra22/t5Indo2Sunda
    results: []

pijarcandra22/t5Indo2Sunda

This model is a fine-tuned version of t5-small on an unknown dataset. It achieves the following results on the evaluation set:

  • Train Loss: 2.7225
  • Validation Loss: 2.5299
  • Epoch: 27

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
  • training_precision: float32

Training results

Train Loss Validation Loss Epoch
4.3724 3.9124 0
3.9887 3.6348 1
3.7534 3.4215 2
3.5819 3.2847 3
3.4632 3.1902 4
3.3751 3.1139 5
3.3039 3.0493 6
3.2447 2.9955 7
3.1911 2.9481 8
3.1455 2.9082 9
3.1068 2.8718 10
3.0697 2.8387 11
3.0381 2.8105 12
3.0050 2.7825 13
2.9796 2.7568 14
2.9510 2.7350 15
2.9259 2.7096 16
2.9053 2.6881 17
2.8833 2.6696 18
2.8599 2.6510 19
2.8403 2.6328 20
2.8207 2.6171 21
2.8046 2.5999 22
2.7861 2.5857 23
2.7715 2.5699 24
2.7557 2.5542 25
2.7387 2.5420 26
2.7225 2.5299 27

Framework versions

  • Transformers 4.35.2
  • TensorFlow 2.14.0
  • Datasets 2.15.0
  • Tokenizers 0.15.0