Edit Models filters

Multimodal

Image-Text-to-Text

Visual Question Answering

Document Question Answering

Computer Vision

Depth Estimation

Image Classification

Object Detection

Image Segmentation

Unconditional Image Generation

Video Classification

Zero-Shot Image Classification

Mask Generation

Zero-Shot Object Detection

Image Feature Extraction

Natural Language Processing

Text Classification

Token Classification

Table Question Answering

Question Answering

Zero-Shot Classification

Feature Extraction

Text Generation

Text2Text Generation

Sentence Similarity

Audio

Automatic Speech Recognition

Audio Classification

Voice Activity Detection

Tabular

Tabular Classification

Tabular Regression

Time Series Forecasting

Reinforcement Learning

Reinforcement Learning

Other

Graph Machine Learning

Models

402

Full-text search

Active filters: image-text-to-text

microsoft/Florence-2-large

Image-Text-to-Text • Updated 8 days ago • 95.9k • 783

qnguyen3/nanoLLaVA-1.5

Image-Text-to-Text • Updated 2 days ago • 473 • 61

OpenGVLab/InternVL2-26B

Image-Text-to-Text • Updated about 18 hours ago • 1.41k • 48

OpenGVLab/InternVL2-8B

Image-Text-to-Text • Updated about 18 hours ago • 1.3k • 20

microsoft/Florence-2-large-ft

Image-Text-to-Text • Updated 8 days ago • 31.5k • 230

OpenGVLab/InternVL2-2B

Image-Text-to-Text • Updated about 18 hours ago • 1.65k • 11

OpenGVLab/InternVL2-40B

Image-Text-to-Text • Updated about 18 hours ago • 14 • 9

llava-hf/llava-v1.6-mistral-7b-hf

Image-Text-to-Text • Updated 11 days ago • 3.36M • 171

vikhyatk/moondream2

Image-Text-to-Text • Updated May 22 • 77.5k • 503

OpenGVLab/InternVL-Chat-V1-5

Image-Text-to-Text • Updated about 18 hours ago • 32.9k • 377

HuggingFaceM4/Florence-2-DocVQA

Image-Text-to-Text • Updated about 19 hours ago • 781 • 28

microsoft/Florence-2-base

Image-Text-to-Text • Updated 8 days ago • 35.2k • 100

HuggingFaceM4/idefics2-8b

Image-Text-to-Text • Updated May 30 • 484k • • 535

microsoft/Florence-2-base-ft

Image-Text-to-Text • Updated 8 days ago • 26.1k • 69

OpenGVLab/InternVL2-4B

Image-Text-to-Text • Updated about 18 hours ago • 526 • 6

fal/moondream2-docci-instruct

Image-Text-to-Text • Updated May 10 • 25 • 4

google/paligemma-3b-pt-224

Image-Text-to-Text • Updated 12 days ago • 56.8k • 194

gokaygokay/Florence-2-SD3-Captioner

Image-Text-to-Text • Updated 14 days ago • 747 • 5

FreedomIntelligence/HuatuoGPT-Vision-34B

Image-Text-to-Text • Updated 6 days ago • 5 • 9

liuhaotian/llava-v1.6-34b

Image-Text-to-Text • Updated May 9 • 42.7k • 294

deepseek-ai/deepseek-vl-7b-chat

Image-Text-to-Text • Updated Mar 15 • 5.28k • 205

HuggingFaceM4/idefics2-8b-chatty

Image-Text-to-Text • Updated May 30 • 16.5k • • 77

google/paligemma-3b-ft-ocrvqa-896

Image-Text-to-Text • Updated 12 days ago • 774 • 8

google/paligemma-3b-mix-224

Image-Text-to-Text • Updated 12 days ago • 168k • 45

OpenGVLab/Mini-InternVL-Chat-4B-V1-5

Image-Text-to-Text • Updated about 18 hours ago • 22.9k • 51

openvla/openvla-7b

Image-Text-to-Text • Updated 25 days ago • 18.9k • 44

AIDC-AI/Ovis-Clip-Llama3-8B

Image-Text-to-Text • Updated 24 days ago • 54 • 5

AIDC-AI/Ovis-Clip-Qwen1_5-7B

Image-Text-to-Text • Updated 24 days ago • 31 • 2

AIDC-AI/Ovis-Clip-Qwen1_5-14B

Image-Text-to-Text • Updated 24 days ago • 22 • 3

onnx-community/Florence-2-base-ft

Image-Text-to-Text • Updated 8 days ago • 68 • 11