80 21 58

Hugo Laurençon

HugoLaurencon

AI & ML interests

None yet

Articles

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 146

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Mar 15

• 5

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Aug 22, 2023

• 14

Putting ethical principles at the core of research lifecycle

May 19, 2022

Organizations

Posts 5

Post

2509

We release Idefics2-chatty, the chatbot-optimized version of Idefics2: HuggingFaceM4/idefics2-8b-chatty

Idefics2-chatty is better at following instructions and following Chain-of-Thoughts reasoning.

Moreover, we also release a paper, containing a lot of findings on how to build an efficient and performant Vision-Language Model: What matters when building vision-language models? (2405.02246)

How are you going to use the model, or what data are you going to fine-tune it on?

Post

2141

Idefics2 is trained mostly on OBELICS, our open interleaved image-text document dataset.

Training on interleaved data is crucial to reaching high performance on VQA tasks, taking an arbitrary number of images as input, and doing in-context learning.

Dataset: HuggingFaceM4/OBELICS
Nomic visualization: https://atlas.nomic.ai/map/f2fba2aa-3647-4f49-a0f3-9347daeee499/ee4a84bd-f125-4bcc-a683-1b4e231cb10f
Link to OBELICS thread: https://twitter.com/HugoLaurencon/status/1694005892839006301

View all posts

Collections 1

Filter_values_distributions

models

None public yet

datasets 3

HugoLaurencon/common_words

Updated Mar 5

HugoLaurencon/libri_light

Updated Feb 13 • 4 • 2

HugoLaurencon/IIIT-5K

Updated Jan 4, 2023 • 1

Hugo Laurençon

AI & ML interests

Articles

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Putting ethical principles at the core of research lifecycle

Organizations

Posts 5

Collections 1

openbmb/RLAIF-V-Dataset

MMInstruction/VLFeedback

zhiqings/LLaVA-Human-Preference-10K

sqrti/SPA-VL

Papers 7

spaces 3

Text Data Filtering

Text Data Filtering

Filter_values_distributions

models

datasets 3

HugoLaurencon/common_words

HugoLaurencon/libri_light

HugoLaurencon/IIIT-5K

Hugo Laurençon

AI & ML interests

Articles

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Putting ethical principles at the core of research lifecycle

Organizations

Posts 5

Collections 1

Papers 7

spaces 3 Sort: Recently updated

Text Data Filtering

Text Data Filtering

Filter_values_distributions

models

datasets 3 Sort: Recently updated

spaces 3

datasets 3