Edit model card

Contributed by:

  • Shashwath P
  • Shashank Ashok
  • Akilan Yohendiran

Total downloads all time - 1126

Model Card for Model ID

The following model is an experimental fine tuned model of the IDEFIC 9B version, for medical Visual Question Answering. It uses a dataset combined from SLAKE and VQARAD. Check the following repository for the notebooks of training,merging and inference. https://github.com/Shashwathp/Idefic_medical_vqa

Model Description

This is the model card of a 馃 transformers model that has been pushed on the Hub. This model card has been automatically generated.

  • Developed by: [@Shashwath01,@Akill19,@Shashank91097 ]
  • Model type: [Multimodal, Visual Question Answering]
  • Language(s) (NLP): [English]
  • License: [Apache - 2.0]
  • Finetuned from model [optional]: [IDEFIC 9B]

Dataset

https://huggingface.co/datasets/Shashwath01/VQARAD_SLAKE

Model Sources

How to Get Started with the Model

Check the below link to get started with inferencing. https://github.com/Shashwathp/Idefic_medical_vqa/blob/main/inference.ipynb

Citation

[1] S. Punneshetty, S. Ashok, M. Niranjanamurthy, and S. V. N. Murthy, "Fine Tuning Idefic 9b With LORA for Multimodal Medical VQA," in Proceedings of the 2024 International Conference on Knowledge Engineering and Communication Systems (ICKECS), India, Apr. 2024, pp. 1-8. DOI: 10.1109/ICKECS61492.2024.10616779.

Downloads last month
13
Safetensors
Model size
4.73B params
Tensor type
F32
FP16
U8
Inference API
Unable to determine this model鈥檚 pipeline type. Check the docs .

Spaces using Shashwath01/Idefic_medical_VQA_merged_4bit 3