license: apache-2.0 | |
MiniSymp2 is A retrain of my MiniSymposium model attempt except with some more data and better practices. | |
- added EOS tokens where they belong | |
- made the prompt formats more diverse in the data so you could experiment / play with prompt format in context | |
- added some new examples | |
- measured loss curve to make sure I wasn't overfitting | |
- used 8-bit lora instead of 4-bit qlora |