Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
2
Arash Ahmadian
ArashAhmadian
Follow
shuyuej's profile picture
asusevski's profile picture
GangGreenTemperTatumCohere's profile picture
10 followers
·
0 following
aahmadian_
AI & ML interests
None yet
Articles
Putting RL back in RLHF
24 days ago
•
51
Organizations
Papers
3
arxiv:
2406.01660
arxiv:
2402.14740
arxiv:
2309.05444
models
12
Sort: Recently updated
ArashAhmadian/rloo_1B_tldr
Text Generation
•
Updated
25 days ago
•
4
ArashAhmadian/rloo_tldr_final
Text Generation
•
Updated
26 days ago
•
1
ArashAhmadian/rloo_tldr
Text Generation
•
Updated
26 days ago
•
122
ArashAhmadian/ppo_6.9b_new
Text Generation
•
Updated
28 days ago
•
1
ArashAhmadian/rloo_6.9b_new
Text Generation
•
Updated
28 days ago
•
1
ArashAhmadian/rloo_7b_f
Feature Extraction
•
Updated
30 days ago
•
3
ArashAhmadian/ppo_rloo_bp_7b
Feature Extraction
•
Updated
30 days ago
•
8
ArashAhmadian/rloo_tldr_6.9b_defaultclip_512bs_05kl
Text Generation
•
Updated
Jun 4
•
1
ArashAhmadian/rloo_tldr_6.9b_noratioclip
Text Generation
•
Updated
Jun 1
•
1
ArashAhmadian/rloo_tldr_6.9b_ds2
Text Generation
•
Updated
May 30
•
1
Expand 12 models
datasets
None public yet