Analyse any image with Llama3.2
a tutorial for rmbg 1.4
Try PaliGemma on document understanding tasks
4M: Massively Multimodal Masked Modeling