Safetensors
llama
AuriAetherwiing's picture
Update README.md
28e06c0 verified
metadata
license: apache-2.0
datasets:
  - AuriAetherwiing/Allura
  - kalomaze/Opus_Instruct_25k
base_model:
  - AuriAetherwiing/Yi-1.5-9B-32K-tokfix

EVA Yi 1.5 9B v1

A RP/storywriting focused model, full-parameter finetune of Yi-1.5-9B-32K on mixture of synthetic and natural data.
A continuation of nothingiisreal's Celeste 1.x series, made to improve stability and versatility, without losing unique, diverse writing style of Celeste.

Quants: (GGUF is not recommended, lcpp breaks tokenizer fix)

We recommend using original BFloat16 weights, quantization seems to affect Yi significantly more than other model architectures.

Prompt format is ChatML.

Recommended sampler values:

  • Temperature: 1
  • Min-P: 0.05

Recommended SillyTavern presets (via CalamitousFelicitousness):


Training data:

  • Celeste 70B 0.1 data mixture minus Opus Instruct subset. See that model's card for details.
  • Kalomaze's Opus_Instruct_25k dataset, filtered for refusals.

Hardware used:

  • 4x3090Ti for 5 days.

Model was trained by Kearm and Auri.

Special thanks:

  • to Lemmy, Gryphe, Kalomaze and Nopm for the data
  • to ALK, Fizz and CalamitousFelicitousness for Yi tokenizer fix
  • and to InfermaticAI's community for their continued support for our endeavors