--- license: mit datasets: - netcat420/MFANN language: - en pipeline_tag: text-generation --- task vector optimization checkpoint ready for merging. trained on MFANN for 12000 steps, however due to a slightly higher training loss, im going to merge this model with the last version and retrain, the goal was to use DARE-TIES to reduce the parameters used per vector, and this model will now be merged with the last model before DARE using TIES alone, and will be subsequently retrained.