Venkatesh4342 commited on
Commit
357cd6e
1 Parent(s): 9ea12a8

End of training

Browse files
Files changed (1) hide show
  1. README.md +10 -8
README.md CHANGED
@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.2597
21
- - Bleu: 0.03
22
- - Gen Len: 17.8329
23
 
24
  ## Model description
25
 
@@ -39,9 +39,11 @@ More information needed
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 2e-05
42
- - train_batch_size: 16
43
- - eval_batch_size: 16
44
  - seed: 42
 
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
  - num_epochs: 1
@@ -49,9 +51,9 @@ The following hyperparameters were used during training:
49
 
50
  ### Training results
51
 
52
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
- |:-------------:|:-----:|:-----:|:---------------:|:----:|:-------:|
54
- | 0.2379 | 1.0 | 58095 | 0.2597 | 0.03 | 17.8329 |
55
 
56
 
57
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.2581
21
+ - Bleu: 0.0425
22
+ - Gen Len: 17.7806
23
 
24
  ## Model description
25
 
 
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 2e-05
42
+ - train_batch_size: 6
43
+ - eval_batch_size: 6
44
  - seed: 42
45
+ - gradient_accumulation_steps: 3
46
+ - total_train_batch_size: 18
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
  - num_epochs: 1
 
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
55
+ |:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
56
+ | 0.2411 | 1.0 | 51639 | 0.2581 | 0.0425 | 17.7806 |
57
 
58
 
59
  ### Framework versions