Edit model card

ProofGPT-v0.1

Model Description

ProofGPT-v0.1 is a 1.3B parameter language model based on the GPT-NeoX architecture and trained on the proof-pile (v1.1). We initiailized training with pythia-1.3b weights, a precursor to the pythia-1.4b model that has roughly equivalent performance.

Detailed evaluations coming soon :)

Note: Commit 3bcdc4e replaced the weights with a model trained on proof-pile v1.1, as opposed to previous commits which were trained on v1.0. Commit 9695b51 updated the tokenizer to have bos, eos, and unk tokens.

Downloads last month
1,127
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train hoskinson-center/proofGPT-v0.1