Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
multimodal
Inference Endpoints
AutoTrain Compatible
custom_code
Merge
text-generation-inference
4-bit precision
text-embeddings-inference
Eval Results
Mixture of Experts
Other with no match
8-bit precision
Carbon Emissions
Apply filters
Models
107
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
qnguyen3/nanoLLaVA-1.5
Image-Text-to-Text
•
Updated
2 days ago
•
829
•
64
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
May 30
•
535k
•
•
536
qnguyen3/nanoLLaVA
Text Generation
•
Updated
10 days ago
•
2.39k
•
129
Isaak-Carter/J.O.S.I.E.v4o
Updated
4 days ago
•
12
openvla/openvla-7b
Image-Text-to-Text
•
Updated
25 days ago
•
20.2k
•
44
onnx-community/nanoLLaVA-1.5
Image-Text-to-Text
•
Updated
1 day ago
•
1
•
2
imageomics/bioclip
Zero-Shot Image Classification
•
Updated
May 17
•
293k
•
30
HuggingFaceM4/idefics-9b
Text Generation
•
Updated
Oct 12, 2023
•
4.88k
•
46
sshh12/Mistral-7B-LoRA-ImageBind-LLAVA
Text Generation
•
Updated
Nov 2, 2023
•
15
•
10
sshh12/Mistral-7B-LoRA-AudioCLAP
Updated
Dec 13, 2023
•
5
•
5
osamaifti/NEXTGPT
Text Generation
•
Updated
Mar 29
•
3
GeorgeBredis/ruIdefics2-ruLLaVA-merged
Image-Text-to-Text
•
Updated
Apr 29
•
281
•
9
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text
•
Updated
May 30
•
16.2k
•
•
77
TIGER-Lab/Mantis-8B-siglip-llama3
Updated
May 23
•
7.55k
•
24
openvla/openvla-v01-7b
Image-Text-to-Text
•
Updated
25 days ago
•
97
•
9
chenjoya/videollm-online-8b-v1plus
Updated
14 days ago
•
517
•
5
sujitpal/clip-imageclef
Zero-Shot Image Classification
•
Updated
Oct 31, 2023
•
19
•
3
waybarrios/guidance-based-video-grounding
Updated
Apr 1, 2023
MonoHime/mosei-senti-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
2
MonoHime/mosei-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
1
MonoHime/iemocap-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
2
MonoHime/mosi-senti-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
2
MonoHime/meld-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
2
HuggingFaceM4/idefics-80b
Text Generation
•
Updated
Oct 12, 2023
•
1.06k
•
64
HuggingFaceM4/idefics-9b-instruct
Text Generation
•
Updated
Oct 12, 2023
•
8.18k
•
•
99
HuggingFaceM4/idefics-80b-instruct
Text Generation
•
Updated
Oct 12, 2023
•
1.87k
•
•
177
typeof/idefics-9b
Text Generation
•
Updated
Oct 13, 2023
•
1
sshh12/Mistral-7B-LoRA-VisionCLIP-LLAVA
Text Generation
•
Updated
Oct 28, 2023
•
18
•
9
sshh12/Mistral-7B-LoRA-DocumentGTE-260K-x128
Text Generation
•
Updated
Nov 4, 2023
•
3
•
3
NousResearch/Nous-Hermes-2-Vision-Alpha
Text Generation
•
Updated
Dec 3, 2023
•
3.06k
•
301
Previous
1
2
3
4
Next