Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
699
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
google/paligemma-3b-ft-gqa-224
Image-Text-to-Text
•
Updated
Jul 19
google/paligemma-3b-ft-refcoco-seg-224
Image-Text-to-Text
•
Updated
Jul 19
•
406
google/paligemma-3b-ft-scicap-448
Image-Text-to-Text
•
Updated
Jul 19
•
913
google/paligemma-3b-ft-coco35l-224
Image-Text-to-Text
•
Updated
Jul 19
•
598
•
1
google/paligemma-3b-ft-science-qa-448
Image-Text-to-Text
•
Updated
Jul 19
•
13
•
1
google/paligemma-3b-ft-aokvqa-da-224
Image-Text-to-Text
•
Updated
Jul 19
google/paligemma-3b-ft-stvqa-224
Image-Text-to-Text
•
Updated
Jul 19
•
25
google/paligemma-3b-ft-textvqa-896
Image-Text-to-Text
•
Updated
Jul 19
•
23
•
1
google/paligemma-3b-ft-stvqa-448
Image-Text-to-Text
•
Updated
Jul 19
google/paligemma-3b-ft-widgetcap-224
Image-Text-to-Text
•
Updated
Jul 19
•
1
Zery/MV-LLaVA-7B
Image-Text-to-Text
•
Updated
Jun 4
•
13
google/paligemma-3b-ft-ai2d-448
Image-Text-to-Text
•
Updated
Jul 18
•
29
tinyllava/TinyLLaVA-Phi-2-SigLIP-3.1B
Image-Text-to-Text
•
Updated
May 18
•
2.61k
•
12
Trelis/idefics2-8b-chatty-bf16
Image-Text-to-Text
•
Updated
May 15
•
71
•
1
mlx-community/paligemma-3b-mix-224-8bit
Image-Text-to-Text
•
Updated
May 15
•
10
•
3
mlx-community/paligemma-3b-mix-448-8bit
Image-Text-to-Text
•
Updated
May 24
•
15
•
7
tinyllava/TinyLLaVA-Gemma-SigLIP-2.4B
Image-Text-to-Text
•
Updated
May 18
•
838
•
1
gokaygokay/paligemma-docci-transformers
Image-Text-to-Text
•
Updated
May 16
•
4
•
1
gokaygokay/paligemma-rich-captions
Image-Text-to-Text
•
Updated
Jun 15
•
115
•
8
leo009/paligemma-3b-pt-224
Image-Text-to-Text
•
Updated
May 18
•
19
leo009/paligemma-3b-mix-224
Image-Text-to-Text
•
Updated
May 17
•
29
•
1
RichardLuo/Shotluck-Holmes-1.5
Image-Text-to-Text
•
Updated
May 18
•
9
•
2
Xenova/tiny-random-PaliGemmaForConditionalGeneration
Image-Text-to-Text
•
Updated
May 19
•
27
stanrom/ShareGPT4V-7B
Image-Text-to-Text
•
Updated
May 20
•
1
aloobun/F18
Image-Text-to-Text
•
Updated
May 20
ayoubkirouane/moondream2-image-captcha
Image-Text-to-Text
•
Updated
May 22
•
24
•
2
ayoubkirouane/llava-phi3-instruct-Lora
Image-Text-to-Text
•
Updated
May 22
•
4
lamm-mit/Cephalo-Phi-3-vision-128k-4b-alpha
Image-Text-to-Text
•
Updated
Jun 2
•
29
•
6
ayoubkirouane/Idefics2-8b-Finetuned-Lora
Image-Text-to-Text
•
Updated
Jun 13
abhi-8/Age-gender-predictor
Image-Text-to-Text
•
Updated
May 23
•
1
Previous
1
...
12
13
14
15
16
...
24
Next