Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
UCLA-AGI
's Collections
zephyr-7b-sft-full-SPIN
datasets-SPIN
SPIN-Diffusion
SPPO
SPPO
updated
6 days ago
Self-Play Preference Optimization
Upvote
9
UCLA-AGI/Mistral7B-PairRM-SPPO
Text Generation
•
Updated
May 7
•
4
•
6
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter1
Text Generation
•
Updated
May 6
•
3
•
1
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter2
Text Generation
•
Updated
May 6
•
3
•
1
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter3
Text Generation
•
Updated
May 7
•
6
•
5
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter1
Text Generation
•
Updated
11 days ago
•
10
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2
Text Generation
•
Updated
11 days ago
•
1
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
Text Generation
•
Updated
7 days ago
•
1.53k
•
54
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
Text Generation
•
Updated
4 days ago
•
342
•
51
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter2
Text Generation
•
Updated
4 days ago
•
5
•
2
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1
Text Generation
•
Updated
4 days ago
•
32
•
2
Upvote
9
+5
Share collection
View history
Collection guide
Browse collections