Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
BiXie
/
next
like
0
License:
apache-2.0
Model card
Files
Files and versions
Community
main
next
/
trl
/
trainer
/
__pycache__
1 contributor
History:
1 commit
BiXie
Upload 204 files
252711e
verified
about 1 month ago
__init__.cpython-39.pyc
1.06 kB
Upload 204 files
about 1 month ago
base.cpython-39.pyc
1.79 kB
Upload 204 files
about 1 month ago
ddpo_config.cpython-39.pyc
3.29 kB
Upload 204 files
about 1 month ago
ddpo_trainer.cpython-39.pyc
18.5 kB
Upload 204 files
about 1 month ago
dpo_trainer.cpython-39.pyc
36.8 kB
Upload 204 files
about 1 month ago
iterative_sft_trainer.cpython-39.pyc
12.3 kB
Upload 204 files
about 1 month ago
model_config.cpython-39.pyc
2.88 kB
Upload 204 files
about 1 month ago
ppo_config.cpython-39.pyc
4.28 kB
Upload 204 files
about 1 month ago
ppo_trainer.cpython-39.pyc
42.9 kB
Upload 204 files
about 1 month ago
reward_config.cpython-39.pyc
1.25 kB
Upload 204 files
about 1 month ago
reward_trainer.cpython-39.pyc
9.76 kB
Upload 204 files
about 1 month ago
sft_trainer.cpython-39.pyc
16.6 kB
Upload 204 files
about 1 month ago
utils.cpython-39.pyc
23.8 kB
Upload 204 files
about 1 month ago