How can i fine-tune the Qwen2-VL model to make it completely become OCR model

#1
by summon1d - opened

The model used for document information extraction however depends a lot on how to ask questions. So I wondered if there was a way to turn it into an OCR model or if there was a prompt structure to extract the best information..?

Sign up or log in to comment