dyang415
/

mixtral-pb-20e

Generated from Trainer

4-bit precision

Model card Files Files and versions Metrics Training metrics Community

dyang415 commited on Mar 3

Commit

cf6dcfd

•

1 Parent(s): 08a6a6f

Training in progress, step 100

Files changed (2) hide show

README.md +5 -10
adapter_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -2,7 +2,6 @@
 license: apache-2.0
 library_name: peft
 tags:
-- axolotl
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
@@ -69,10 +68,10 @@ lora_target_modules:
   - o_proj
-# wandb_project: function-call
-# wandb_name: mixtral-instruct-lora--v1
-# wandb_log_model: end
-# hub_model_id: dyang415/mixtral-lora-v0
 gradient_accumulation_steps: 2
@@ -111,7 +110,7 @@ fsdp_config:
 # mixtral-pb-20e
-This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on the None dataset.
 ## Model description
@@ -157,10 +156,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 20
-### Training results
 ### Framework versions
 - PEFT 0.7.0

 license: apache-2.0
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
   - o_proj
+wandb_project: function-call
+wandb_name: mixtral-instruct-lora--v1
+wandb_log_model: end
+hub_model_id: dyang415/mixtral-pb-20e
 gradient_accumulation_steps: 2
 # mixtral-pb-20e
+This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on an unknown dataset.
 ## Model description
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 20
 ### Framework versions
 - PEFT 0.7.0

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:26482d95b09adf4415e0b86130e680672c70ab4a8eb89b50ce4aaacbc891508c
 size 27297032

 version https://git-lfs.github.com/spec/v1
+oid sha256:87ba511a3f83c87a3c172e1dd73a924b4cf84def31d05067854b0ce64ddcb551
 size 27297032