Impact of Weight Decay on MBart-large-50 for EN-ES
Collection
5 items
•
Updated
This model is a fine-tuned version of facebook/mbart-large-50 on the None dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Bleu | Rouge |
---|---|---|---|---|---|
1.4627 | 1.0 | 4500 | 1.0255 | 42.1880 | {'rouge1': 0.6725633216905762, 'rouge2': 0.48605402524493657, 'rougeL': 0.6498853764470456, 'rougeLsum': 0.6501981166312041} |
0.8878 | 2.0 | 9000 | 0.9572 | 44.1734 | {'rouge1': 0.6912686406245903, 'rouge2': 0.5093695171345348, 'rougeL': 0.6701896043455414, 'rougeLsum': 0.6703473419504804} |
0.7125 | 3.0 | 13500 | 0.9414 | 44.8709 | {'rouge1': 0.7051197958532004, 'rouge2': 0.5210482863677958, 'rougeL': 0.6843075431636916, 'rougeLsum': 0.6846265298079588} |
0.6092 | 4.0 | 18000 | 0.9549 | 45.0821 | {'rouge1': 0.7047932899349161, 'rouge2': 0.523739339466653, 'rougeL': 0.6840127607742443, 'rougeLsum': 0.684202100852132} |
Base model
facebook/mbart-large-50