Phan Hoang's picture

Phan Hoang

phanhoang

·

AI & ML interests

None yet

Organizations

None yet

phanhoang's activity

upvoted an article 4 days ago

Article

SmolLM - blazingly fast and remarkably powerful

6 days ago

• 150

upvoted 2 collections 10 days ago

Florence

9 items • Updated 11 days ago • 138

MGM

Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 13 items • Updated May 3 • 46

upvoted an article 10 days ago

Article

Preference Optimization for Vision Language Models

12 days ago

• 18

upvoted an article 12 days ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

17 days ago

• 41

upvoted a collection 12 days ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 29 items • Updated Jun 6 • 257

upvoted a collection 14 days ago

LLaVA - Visual Question Answering

14 items • Updated 3 days ago • 5

upvoted an article 21 days ago

Article

Breaking resolution curse of vision-language models

By

•

Feb 24

• 6

upvoted 2 articles 23 days ago

Article

Vision Language Models Explained

Apr 11

• 138

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

28 days ago

• 142

upvoted a paper 6 months ago

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19 • 54

upvoted a paper 7 months ago

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 178