t5_cause_classifier / README.md
zera09's picture
End of training
4a4e59e verified
metadata
license: apache-2.0
base_model: google/flan-t5-base
tags:
  - generated_from_trainer
metrics:
  - f1
  - accuracy
model-index:
  - name: t5_cause_classifier
    results: []

t5_cause_classifier

This model is a fine-tuned version of google/flan-t5-base on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.2104
  • F1: 0.8112
  • Accuracy: 0.3196

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 32
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 20

Training results

Training Loss Epoch Step Validation Loss F1 Accuracy
No log 1.0 124 0.3976 0.5740 0.0
No log 2.0 248 0.2948 0.7116 0.0907
No log 3.0 372 0.2657 0.7396 0.1200
No log 4.0 496 0.2514 0.7599 0.1724
0.3526 5.0 620 0.2406 0.7706 0.2006
0.3526 6.0 744 0.2318 0.7809 0.2147
0.3526 7.0 868 0.2267 0.7910 0.2591
0.3526 8.0 992 0.2227 0.7949 0.2742
0.2263 9.0 1116 0.2178 0.7993 0.2772
0.2263 10.0 1240 0.2161 0.8028 0.2923
0.2263 11.0 1364 0.2158 0.8024 0.2873
0.2263 12.0 1488 0.2144 0.8033 0.2984
0.2005 13.0 1612 0.2125 0.8074 0.3054
0.2005 14.0 1736 0.2118 0.8076 0.3206
0.2005 15.0 1860 0.2120 0.8093 0.3095
0.2005 16.0 1984 0.2122 0.8088 0.3196
0.1877 17.0 2108 0.2109 0.8096 0.3155
0.1877 18.0 2232 0.2106 0.8103 0.3135
0.1877 19.0 2356 0.2109 0.8099 0.3155
0.1877 20.0 2480 0.2104 0.8112 0.3196

Framework versions

  • Transformers 4.41.1
  • Pytorch 1.13.1+cu117
  • Datasets 2.19.1
  • Tokenizers 0.19.1