autoevaluator's picture
Add evaluation results on the 3.0.0 config and test split of cnn_dailymail
f067a07
|
raw
history blame
3.35 kB
metadata
language:
  - it
tags:
  - summarization
datasets:
  - ARTeLab/ilpost
metrics:
  - rouge
base_model: gsarti/it5-base
model-index:
  - name: summarization_ilpost
    results:
      - task:
          type: summarization
          name: Summarization
        dataset:
          name: cnn_dailymail
          type: cnn_dailymail
          config: 3.0.0
          split: test
        metrics:
          - type: rouge
            value: 11.9368
            name: ROUGE-1
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMTc3NjJhNTQ2ZDFhODRlZTY2NzkwY2ZlYTg4ODA4NTljYjgzNDIxNjNiYmNhMGU4NGM3ODAyYmViNmE0ODc5MSIsInZlcnNpb24iOjF9.1CCU4dBPTTvC8R2HlTwBBkoxepw_C8m0DaghQwgaMX7pE5Q1G5jU7Pzjmd65HCHrIyam_zrkSuOb4Ka0ctoJCg
          - type: rouge
            value: 3.5366
            name: ROUGE-2
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYTE3NTFkYmFlYWM4NGI4ZDJlYzg3MzBjMTdlN2E5NTUwNjA3MDRlOGMyYTc0ZjkzNTk4MmU5MTgyOWIwZjVmYSIsInZlcnNpb24iOjF9.hxB8DNJYZZAmGhh4ivlsv2yV8p6SmvLQiCHAUIG1Um0XfRIexI9UahNw_MR2RBa_V8VdsInJXwiVuYALlx13DQ
          - type: rouge
            value: 9.9609
            name: ROUGE-L
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYjFkNTI1MmQxMjkxN2ZlOWVkYjc1ZjlhMGQ5ZGQ3YmYzZDVlZDU0YTJiNmNhODdmY2MxY2IxNWYwMzE1NTdmNCIsInZlcnNpb24iOjF9.KpFReVOXMkjIhfcPrVO8CkmJwCtHTBvCh2Mx19Fw-aMH96bPcTgPM4GHa4gXOiyUym-olmrY5WV8rsg6ueX9AA
          - type: rouge
            value: 11.2114
            name: ROUGE-LSUM
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYjRkMjg3ZDc2ZTAwNDk5ZjE3NTIyNGI4MmI3ZjllZWFkYTczYTFkN2NmYmE5ZmU0Y2E2OGM3OGIxNjA0MzllMSIsInZlcnNpb24iOjF9.NmJtZHXij-oV1E3pkR0GwRwVY5R_eGlJ0emLuY9HfK7iGCESwzMR6fk7Gh1w_15iEdCfaxgh6uE9oOLgFpE_BA
          - type: loss
            value: 8.635225296020508
            name: loss
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZGYyZjYzZWZiYzIzNmI0NTEzNGYxNDljODFkMjE1Yzk1NzI2ZDcwYjYzNGVlZjc4NDc0NGM5ZjM2OTgwY2ExMSIsInZlcnNpb24iOjF9.D-g1NamsmrDUgcA20CQ57Mj9tHdQ5bpIjWuGtIy5ZQh_GBN5UN9wWslzm7mYEuPNWwzITR8fMtdBOLJv8xVABA
          - type: gen_len
            value: 18.9985
            name: gen_len
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNzcyMmY4ODg3ZTdmOTc4NmRjYTc4MGI0NjBhYmFhMDc4NWUyNWVkMzI1NjU0MzE5MGNhODEyM2IyM2UxNzU4NSIsInZlcnNpb24iOjF9.mWqGs_wVk9CyBkYczl5sJp6YURbGzHE6tx_KNjpRIaF4B-8YfyM9pjrl_Q8kfPGrnPgLrJrURGC26Bza9kYUCw

summarization_ilpost

This model is a fine-tuned version of gsarti/it5-base on IlPost dataset for Abstractive Summarization.

It achieves the following results:

  • Loss: 1.6020
  • Rouge1: 33.7802
  • Rouge2: 16.2953
  • Rougel: 27.4797
  • Rougelsum: 30.2273
  • Gen Len: 45.3175

Usage

from transformers import T5Tokenizer, T5ForConditionalGeneration
tokenizer = T5Tokenizer.from_pretrained("ARTeLab/it5-summarization-ilpost")
model = T5ForConditionalGeneration.from_pretrained("ARTeLab/it5-summarization-ilpost")

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 6
  • eval_batch_size: 6
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 4.0

Framework versions

  • Transformers 4.12.0.dev0
  • Pytorch 1.9.1+cu102
  • Datasets 1.12.1
  • Tokenizers 0.10.3