DarshanDeshpande
/

gemma_2b_oasst1_ppo_model

Reinforcement Learning

Inference Endpoints

Model card Files Files and versions Community

gemma_2b_oasst1_ppo_model

1 contributor

History: 3 commits

DarshanDeshpande's picture

DarshanDeshpande

Push model using huggingface_hub.

342c126 verified 7 months ago

.gitattributes

1.57 kB

Push model using huggingface_hub. 7 months ago
README.md

1.37 kB

Push model using huggingface_hub. 7 months ago
adapter_config.json

610 Bytes

Push model using huggingface_hub. 7 months ago
adapter_model.safetensors

3.7 MB
LFS

Push model using huggingface_hub. 7 months ago
config.json

1.27 kB

Push model using huggingface_hub. 7 months ago
pytorch_model.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
9.72 kB
LFS

Push model using huggingface_hub. 7 months ago
special_tokens_map.json

441 Bytes

Push model using huggingface_hub. 7 months ago
tokenizer.json
17.5 MB
LFS

Push model using huggingface_hub. 7 months ago
tokenizer.model

4.24 MB
LFS

Push model using huggingface_hub. 7 months ago
tokenizer_config.json

1.13 kB

Push model using huggingface_hub. 7 months ago