Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
arxiv:
2405.07863
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Other with no match
Eval Results
4-bit precision
Merge
text-embeddings-inference
custom_code
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
17
Full-text search
Edit filters
Sort: Trending
Active filters:
2405.07863
Clear all
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
Updated
May 24
•
9.7k
•
29
Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R
Text Generation
•
Updated
Jun 12
•
224
•
73
qwp4w3hyb/SFR-Iterative-DPO-LLaMA-3-8B-R-iMat-GGUF
Text Generation
•
Updated
May 16
•
195
•
2
sirovub/SFR-Iterative-DPO-LLaMA-3-8B-R-GGUF
Text Generation
•
Updated
May 26
•
76
•
1
thesven/SFR-Iterative-DPO-LLaMA-3-8B-R-GGUF
Updated
14 days ago
•
158
•
1
sirovub/LLaMA3-iterative-DPO-final-GGUF
Text Generation
•
Updated
May 26
•
94
•
1
sfairXC/FsfairX-Gemma2-RM-v0.1
Text Classification
•
Updated
13 days ago
•
65
•
3
Salesforce/LLaMA-3-8B-SFR-SFT-R
Text Generation
•
Updated
May 31
•
24
•
7
Salesforce/LLaMA-3-8B-SFR-RM-R
Text Classification
•
Updated
May 31
•
10
•
9
RLHFlow/LLaMA3-iterative-DPO-final
Text Generation
•
Updated
Jun 12
•
4.6k
•
37
RLHFlow/LLaMA3-SFT
Text Generation
•
Updated
May 23
•
4.79k
•
5
TriAiExperiments/SFR-Iterative-DPO-LLaMA-3-8B-R
Text Generation
•
Updated
May 24
•
24
•
1
Apel-sin/llama-3-8B-iterative-DPO-final-exl2
Updated
May 25
•
1
QuantFactory/pair-preference-model-LLaMA3-8B-GGUF
Text Generation
•
Updated
May 26
•
75
OpenRLHF/Llama-3-8b-sft-mixture
Text Generation
•
Updated
Jun 14
•
3.17k
QuantFactory/LLaMA-3-8B-SFR-Iterative-DPO-R-GGUF
Text Generation
•
Updated
Jun 19
•
1.69k
•
1
QuantFactory/LLaMA-3-8B-SFR-SFT-R-GGUF
Text Generation
•
Updated
Jun 19
•
431
•
1