Edit Models filters

arxiv: 2405.07863

Inference Endpoints

AutoTrain Compatible

text-generation-inference

Other with no match

4-bit precision

text-embeddings-inference

8-bit precision

Carbon Emissions

Mixture of Experts

Models

17

Full-text search

Active filters: 2405.07863

RLHFlow/pair-preference-model-LLaMA3-8B

Text Generation • Updated May 24 • 9.7k • 29

Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R

Text Generation • Updated Jun 12 • 224 • 73

qwp4w3hyb/SFR-Iterative-DPO-LLaMA-3-8B-R-iMat-GGUF

Text Generation • Updated May 16 • 195 • 2

sirovub/SFR-Iterative-DPO-LLaMA-3-8B-R-GGUF

Text Generation • Updated May 26 • 76 • 1

thesven/SFR-Iterative-DPO-LLaMA-3-8B-R-GGUF

Updated 14 days ago • 158 • 1

sirovub/LLaMA3-iterative-DPO-final-GGUF

Text Generation • Updated May 26 • 94 • 1

sfairXC/FsfairX-Gemma2-RM-v0.1

Text Classification • Updated 13 days ago • 65 • 3

Salesforce/LLaMA-3-8B-SFR-SFT-R

Text Generation • Updated May 31 • 24 • 7

Salesforce/LLaMA-3-8B-SFR-RM-R

Text Classification • Updated May 31 • 10 • 9

RLHFlow/LLaMA3-iterative-DPO-final

Text Generation • Updated Jun 12 • 4.6k • 37

RLHFlow/LLaMA3-SFT

Text Generation • Updated May 23 • 4.79k • 5

TriAiExperiments/SFR-Iterative-DPO-LLaMA-3-8B-R

Text Generation • Updated May 24 • 24 • 1

Apel-sin/llama-3-8B-iterative-DPO-final-exl2

Updated May 25 • 1

QuantFactory/pair-preference-model-LLaMA3-8B-GGUF

Text Generation • Updated May 26 • 75

OpenRLHF/Llama-3-8b-sft-mixture

Text Generation • Updated Jun 14 • 3.17k

QuantFactory/LLaMA-3-8B-SFR-Iterative-DPO-R-GGUF

Text Generation • Updated Jun 19 • 1.69k • 1

QuantFactory/LLaMA-3-8B-SFR-SFT-R-GGUF

Text Generation • Updated Jun 19 • 431 • 1