Edit model card

Model Card for Model ID

This is a Text-to-Text model for generating headlines and tags from news articles.

Model Details

Model Description

This model is a Text-to-Text model for generating headlines and tags from news articles. This is a mT5 model finetuned on the XL-HeadTags multilingual dataset (read more).

  • Model type: T5
  • Finetuned from model : mT5

Model Sources [optional]

Citation [optional]

@inproceedings{shohan-etal-2024-xl,
    title = "{XL}-{H}ead{T}ags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags",
    author = "Shohan, Faisal  and
      Nayeem, Mir Tafseer  and
      Islam, Samsul  and
      Akash, Abu Ubaida  and
      Joty, Shafiq",
    editor = "Ku, Lun-Wei  and
      Martins, Andre  and
      Srikumar, Vivek",
    booktitle = "Findings of the Association for Computational Linguistics ACL 2024",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand and virtual meeting",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.findings-acl.771",
    pages = "12991--13024"
}

Model Card Authors [optional]

Model Card Contact

Faisal Tareque Shohan

Downloads last month
3
Safetensors
Model size
390M params
Tensor type
F32
·
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train faisaltareque/XL-HeadTags-mT5-c5