argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 3.42k • 108
cyberagent/chatbot-arena-ja-calm2-7b-chat-experimental Viewer • Updated about 20 hours ago • 29.2k • 172 • 15
argilla/ultrafeedback-multi-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 158k • 20 • 5
NickyNicky/neovalle_H4rmony_dpo_translated_English_to_Spanish Viewer • Updated May 17 • 2.02k • 13 • 4
argilla/ultrafeedback-multi-binarized-quality-preferences-cleaned Viewer • Updated Dec 11, 2023 • 155k • 4 • 4
Mitsuki-Sakamoto/hh-rlhf-reward-model-deberta-v3-large-v2-helpful-2-original_mix_50_random_seed_2 Viewer • Updated Jun 8 • 46.2k • 18 • 1
vwxyzjn/summarize_from_feedback_oai_preprocessing_1706381144 Viewer • Updated Jan 27 • 179k • 2.24k • 1
insub/imdb_prefix20_forDPO_gpt2-large-imdb-FT_siebert_sentiment-roberta-large-english Viewer • Updated Oct 22, 2023 • 50k • 39 • 1