NMTIndoBaliBART / README.md
pijarcandra22's picture
Training in progress epoch 82
79c79c1
|
raw
history blame
No virus
4.71 kB
metadata
license: apache-2.0
base_model: facebook/bart-base
tags:
  - generated_from_keras_callback
model-index:
  - name: pijarcandra22/NMTIndoBaliBART
    results: []

pijarcandra22/NMTIndoBaliBART

This model is a fine-tuned version of facebook/bart-base on an unknown dataset. It achieves the following results on the evaluation set:

  • Train Loss: 5.5354
  • Validation Loss: 5.5570
  • Epoch: 82

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 0.02, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
  • training_precision: float32

Training results

Train Loss Validation Loss Epoch
9.7885 5.6003 0
5.5737 5.5523 1
5.5346 5.5361 2
5.5189 5.5283 3
5.5149 5.5252 4
5.5123 5.5233 5
5.5116 5.5485 6
5.5095 5.5314 7
5.5120 5.5569 8
5.5137 5.5239 9
5.5170 5.5289 10
5.5180 5.5298 11
5.5217 5.5513 12
5.5219 5.5344 13
5.5248 5.5366 14
5.5268 5.5493 15
5.5260 5.5313 16
5.5290 5.5462 17
5.5299 5.5570 18
5.5293 5.5480 19
5.5378 5.5524 20
5.5317 5.5740 21
5.5328 5.5543 22
5.5327 5.5537 23
5.5330 5.5356 24
5.5304 5.5492 25
5.5355 5.5388 26
5.5337 5.5812 27
5.5355 5.5598 28
5.5348 5.5489 29
5.5373 5.5526 30
5.5357 5.5575 31
5.5377 5.5439 32
5.5404 5.5367 33
5.5383 5.5819 34
5.5359 5.5815 35
5.5370 5.5499 36
5.5340 5.5622 37
5.5373 5.5667 38
5.5360 5.5548 39
5.5327 5.5555 40
5.5365 5.5642 41
5.5375 5.5496 42
5.5336 5.5424 43
5.5359 5.5761 44
5.5360 5.5821 45
5.5362 5.5742 46
5.5352 5.5635 47
5.5335 5.5507 48
5.5340 5.5613 49
5.5368 5.5599 50
5.5375 5.5541 51
5.5368 5.5536 52
5.5366 5.5438 53
5.5363 5.5497 54
5.5364 5.5721 55
5.5388 5.5493 56
5.5361 5.5719 57
5.5372 5.5920 58
5.5346 5.5534 59
5.5354 5.5526 60
5.5357 5.5788 61
5.5370 5.5531 62
5.5374 5.5613 63
5.5366 5.5585 64
5.5370 5.5652 65
5.5354 5.5463 66
5.5354 5.5689 67
5.5355 5.5508 68
5.5350 5.5522 69
5.5334 5.5574 70
5.5354 5.5555 71
5.5354 5.5503 72
5.5368 5.5562 73
5.5373 5.5524 74
5.5356 5.5544 75
5.5365 5.5508 76
5.5357 5.5650 77
5.5355 5.5665 78
5.5365 5.5471 79
5.5356 5.5535 80
5.5357 5.5801 81
5.5354 5.5570 82

Framework versions

  • Transformers 4.40.2
  • TensorFlow 2.15.0
  • Datasets 2.19.1
  • Tokenizers 0.19.1