Edit model card

This is the base version (274MB) of the summarization model for the Spanish language presented in the SEMANTiCS 2022 conference (paper entitled "esT5s: A Spanish Model for Text Summarization"). This model was created in 17 hours (using a single GPU, specifically an NVIDIA v100 16GB) from the multilingual T5 model using the XL-Sum dataset. It achieves a ROUGE-1 value of 25.30 (mT5 achieves 26.21 after a 96h training using 4 GPUs), ROUGE-2 8.21 (mT5 achieves 8.74), and ROUGE-l 20.29 (mT5 achieves 21.06).

Downloads last month
5
Safetensors
Model size
238M params
Tensor type
F32
·
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for oeg/esT5s-base

Finetunes
1 model