ShortCircuit-MoE / mergekit_moe_config.yml

My greatest Creation, Soon I will be working on a new company project so I will have to put you on hold. This model has a personality.

338362c verified 2 months ago

No virus

1.52 kB

	base_model: TroyDoesAI/ShortCircuit
	gate_mode: hidden # one of "hidden", "cheap_embed", or "random"

	# There are three methods for populating the MoE gates implemented.

	# "hidden"
	# Uses the hidden state representations of the positive/negative prompts for MoE gate parameters. Best quality and most effective option; the default. Requires evaluating each prompt using the base model so you might not be able to use this on constrained hardware (depending on the model). You can use --load-in-8bit or --load-in-4bit to reduce VRAM usage.

	# "cheap_embed"
	# Uses only the raw token embedding of the prompts, using the same gate parameters for every layer. Distinctly less effective than "hidden". Can be run on much, much lower end hardware.

	# "random"
	# Randomly initializes the MoE gates. Good for if you are going to fine tune the model afterwards, or maybe if you want something a little unhinged? I won't judge.

	experts:
	- source_model: TroyDoesAI/ShortCircuit
	positive_prompts:
	- "hello"
	- "you"
	- "assistant"
	- "what"
	- "when"
	- "where"
	- "why"
	- "how"

	- source_model: TroyDoesAI/Undo-Woke-Mind-Virus
	positive_prompts:
	- "how"
	- "who"
	- "politic"
	- "conservative"
	- "!woke"
	- "!toxic"

	- source_model: TroyDoesAI/BlackSheep-RP-8B
	positive_prompts:
	- "think"
	- "story"

	- source_model: TroyDoesAI/Unbiased-Personality
	positive_prompts:
	- "describe"
	- "tell"
	- "say"