155 97 313

Alvaro Bartolome

alvarobartt

https://alvarobartt.me

AI & ML interests

☁️ cloud machine learning @huggingface and open source passionate

Articles

💨 Introducing Notus: a DPO fine-tune of Zephyr with a focus on high-quality data

Dec 1, 2023

• 1

🤗 LLM suggestions in Argilla with HuggingFace Inference Endpoints

Sep 20, 2023

Organizations

alvarobartt's activity

upvoted a collection 8 days ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 9 days ago • 216

upvoted a collection 9 days ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 9 days ago • 322

upvoted 2 collections 16 days ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 16 days ago • 220

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 16 days ago • 201

upvoted an article 17 days ago

Article

Introducing Community Tools on HuggingChat

19 days ago

• 26

upvoted a collection 29 days ago

Ruri: Japanese General Text Embeddings

Collection

18 items • Updated 22 days ago • 13

upvoted an article about 2 months ago

Article

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Aug 19

• 18

upvoted a paper about 2 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16 • 96

upvoted a collection about 2 months ago

Llama-3.1 Quantization

Collection

Neural Magic quantized Llama-3.1 models • 21 items • Updated 8 days ago • 35

upvoted an article about 2 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 78

upvoted a paper 2 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31 • 102

upvoted 3 articles 2 months ago

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29

• 26

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 197

Article

WWDC 24: Running Mistral 7B with Core ML

Jul 22

• 54

upvoted 2 collections 2 months ago

Llama 3.1 GPTQ, AWQ, and BNB Quants

Collection

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated 8 days ago • 51

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 9 days ago • 586

upvoted a collection 3 months ago

DCLM

Collection

DCLM Models + Datasets • 7 items • Updated Jul 22 • 38

upvoted 2 articles 3 months ago

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Jul 18

• 44

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 244

upvoted a paper 3 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 153

upvoted an article 3 months ago

Article

Google Cloud TPUs made available to Hugging Face users

Jul 9

• 19

upvoted a collection 4 months ago

Florence

Collection

9 items • Updated Jul 11 • 154

upvoted a paper 4 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 79

upvoted 2 collections 4 months ago

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 16 days ago • 340

em🍞ing series

Collection

crispy sentence embedding family • 4 items • Updated Aug 21 • 20

upvoted 4 papers 4 months ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20 • 33

Aya 23: Open Weight Releases to Further Multilingual Progress

Paper • 2405.15032 • Published May 23 • 26

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published May 24 • 53

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Paper • 2405.17428 • Published May 27 • 16

upvoted an article 4 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28

• 148

upvoted 2 collections 4 months ago

Creación de corpus en comunidad

Collection

Colección de esfuerzos colaborativos para crear corpus en español de calidad. Toda persona hispanohablante puede contribuir :) • 7 items • Updated Jul 17 • 6

C4AI Aya 23

Collection

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 4 items • Updated Aug 6 • 45

upvoted an article 4 months ago

Article

Deploy models on AWS Inferentia2 from Hugging Face

May 22

• 13

upvoted a collection 5 months ago

Critique Models (CM) on the 🤗 Hub

Collection

This collection contains some Critique Models (CM) for LLM evaluation available in the HuggingFace Hub • 5 items • Updated Sep 2 • 3

upvoted a paper 5 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 114

upvoted 2 collections 5 months ago

SystemChat Preferences

Collection

This collection contains the results of the effort on extending `abacusai/SystemChat-1.1` to convert it into a preference dataset • 12 items • Updated Apr 30 • 1

Capybara Preferences

Collection

This collection contains the results of the effort on extending `LDJnr/Capybara` to convert it into a preference dataset, with 7B LLMs • 8 items • Updated Apr 17 • 1

upvoted 2 papers 5 months ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29 • 68

Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

Paper • 2404.14367 • Published Apr 22 • 1

upvoted an article 5 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22

• 78

upvoted a paper 5 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 251

upvoted 4 papers 6 months ago

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Paper • 2404.10719 • Published Apr 16 • 3

Best Practices and Lessons Learned on Synthetic Data for Language Models

Paper • 2404.07503 • Published Apr 11 • 29

Pre-training Small Base LMs with Fewer Tokens

Paper • 2404.08634 • Published Apr 12 • 34

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12 • 62

upvoted an article 6 months ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22

• 221

upvoted a collection 6 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 9 days ago • 676

upvoted an article 6 months ago

Article

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Apr 10

• 18

upvoted a collection 6 months ago

Zephyr ORPO

Collection

Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook • 3 items • Updated Apr 12 • 16

upvoted 2 papers 6 months ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9 • 64

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 104

upvoted an article 6 months ago

Article

Hugging Face and Google partner for open AI collaboration

Jan 25

• 4

upvoted a collection 6 months ago

Qwen1.5

Collection

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 16 days ago • 206

upvoted a paper 6 months ago

sDPO: Don't Use Your Data All at Once

Paper • 2403.19270 • Published Mar 28 • 38

upvoted a collection 6 months ago

Foundation AI Papers

Collection

Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15 • 25

upvoted a paper 6 months ago

Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27 • 23

upvoted a collection 6 months ago

Recent Mamba Papers

Collection

[NB: Notes are from TuringPost] • 3 items • Updated Mar 26 • 8

upvoted 2 collections 7 months ago

About ORPO

Collection

Contains some information and experiments fine-tuning LLMs using 🤗 `trl.ORPOTrainer` • 8 items • Updated Sep 2 • 5

ORPO

Collection

This is the official collection of "ORPO: Monolithic Preference Optimization without Reference Model". • 5 items • Updated Apr 12 • 10

upvoted a paper 7 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 60

Alvaro Bartolome

AI & ML interests

Articles

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

Deploying 🤗 Hub models in Vertex AI

🏷️ Build AI Feedback (AIF) datasets for LLM alignment with ⚗️ distilabel

💨 Introducing Notus: a DPO fine-tune of Zephyr with a focus on high-quality data

🤗 LLM suggestions in Argilla with HuggingFace Inference Endpoints

Organizations

alvarobartt's activity

Introducing Community Tools on HuggingChat

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

XetHub is joining Hugging Face!

Serverless Inference with Hugging Face and NVIDIA NIMs

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

WWDC 24: Running Mistral 7B with Core ML

TGI Multi-LoRA: Deploy Once, Serve 30 Models

SmolLM - blazingly fast and remarkably powerful

Google Cloud TPUs made available to Hugging Face users

Training and Finetuning Embedding Models with Sentence Transformers v3

Deploy models on AWS Inferentia2 from Hugging Face

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Fine-tune Llama 3 with ORPO

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Hugging Face and Google partner for open AI collaboration