view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • about 14 hours ago • 4
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published 10 days ago • 73
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models 12 days ago • 124
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 • 115
view article Article Wikipedia's Treasure Trove: Advancing Machine Learning with Diverse Data By frimelle • Jun 3 • 12
view article Article Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset Mar 15 • 5
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset Paper • 2403.09029 • Published Mar 14 • 54