Kimiko-Mistral-7B-fp16 / README.md

TheBloke

Update README.md

d9fa03f about 1 year ago

preview code

raw

history blame contribute delete

No virus

7.52 kB

	---
	base_model: nRuaif/Kimiko-Mistral-7B
	inference: false
	license: apache-2.0
	model-index:
	- name: Kimiko-Mistral-7B
	results: []
	model_creator: nRuaif
	model_name: Kimiko Mistral 7B
	model_type: mistral
	prompt_template: 'You are a helpful AI assistant.


	USER: {prompt}

	ASSISTANT:
	'
	tags:
	- generated_from_trainer
	---

	<!-- header start -->
	<!-- 200823 -->
	<div style="width: auto; margin-left: auto; margin-right: auto">
	<img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
	</div>
	<div style="display: flex; justify-content: space-between; width: 100%;">
	<div style="display: flex; flex-direction: column; align-items: flex-start;">
	<p style="margin-top: 0.5em; margin-bottom: 0em;"><a href="https://discord.gg/theblokeai">Chat & support: TheBloke's Discord server</a></p>
	</div>
	<div style="display: flex; flex-direction: column; align-items: flex-end;">
	<p style="margin-top: 0.5em; margin-bottom: 0em;"><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? TheBloke's Patreon page</a></p>
	</div>
	</div>
	<div style="text-align:center; margin-top: 0em; margin-bottom: 0em"><p style="margin-top: 0.25em; margin-bottom: 0em;">TheBloke's LLM work is generously supported by a grant from <a href="https://a16z.com">andreessen horowitz (a16z)</a></p></div>
	<hr style="margin-top: 1.0em; margin-bottom: 1.0em;">
	<!-- header end -->

	# Kimiko Mistral 7B - FP16
	- Model creator: [nRuaif](https://huggingface.co/nRuaif)
	- Original model: [Kimiko Mistral 7B](https://huggingface.co/nRuaif/Kimiko-Mistral-7B)

	<!-- description start -->
	## Description

	This repo contains pytorch format fp16 model files for [nRuaif's Kimiko Mistral 7B](nRuaif/Kimiko-Mistral-7B).

	It is the result of either merging a LoRA, or converting the source repository to float16.

	<!-- description end -->
	<!-- repositories-available start -->
	## Repositories available

	* [AWQ model(s) for GPU inference.](https://huggingface.co/TheBloke/Kimiko-Mistral-7B-AWQ)
	* [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/Kimiko-Mistral-7B-GPTQ)
	* [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/TheBloke/Kimiko-Mistral-7B-GGUF)
	* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/Kimiko-Mistral-7B-fp16)
	* [nRuaif's original LoRA adapter, which can be merged on to the base model.](https://huggingface.co/nRuaif/Kimiko-Mistral-7B)

	<!-- repositories-available start -->

	<!-- prompt-template start -->
	## Prompt template: Vicuna-Short

	```
	You are a helpful AI assistant.

	USER: {prompt}
	ASSISTANT:

	```

	<!-- prompt-template end -->




	<!-- footer start -->
	<!-- 200823 -->
	## Discord

	For further support, and discussions on these models and AI in general, join us at:

	[TheBloke AI's Discord server](https://discord.gg/theblokeai)

	## Thanks, and how to contribute

	Thanks to the [chirper.ai](https://chirper.ai) team!

	Thanks to Clay from [gpus.llm-utils.org](llm-utils)!

	I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.

	If you're able and willing to contribute it will be most gratefully received and will help me to keep providing more models, and to start work on new AI projects.

	Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.

	* Patreon: https://patreon.com/TheBlokeAI
	* Ko-Fi: https://ko-fi.com/TheBlokeAI

	Special thanks to: Aemon Algiz.

	Patreon special mentions: Pierre Kircher, Stanislav Ovsiannikov, Michael Levine, Eugene Pentland, Andrey, 준교 김, Randy H, Fred von Graf, Artur Olbinski, Caitlyn Gatomon, terasurfer, Jeff Scroggin, James Bentley, Vadim, Gabriel Puliatti, Harry Royden McLaughlin, Sean Connelly, Dan Guido, Edmond Seymore, Alicia Loh, subjectnull, AzureBlack, Manuel Alberto Morcote, Thomas Belote, Lone Striker, Chris Smitley, Vitor Caleffi, Johann-Peter Hartmann, Clay Pascal, biorpg, Brandon Frisco, sidney chen, transmissions 11, Pedro Madruga, jinyuan sun, Ajan Kanaga, Emad Mostaque, Trenton Dambrowitz, Jonathan Leane, Iucharbius, usrbinkat, vamX, George Stoitzev, Luke Pendergrass, theTransient, Olakabola, Swaroop Kallakuri, Cap'n Zoog, Brandon Phillips, Michael Dempsey, Nikolai Manek, danny, Matthew Berman, Gabriel Tamborski, alfie_i, Raymond Fosdick, Tom X Nguyen, Raven Klaugh, LangChain4j, Magnesian, Illia Dulskyi, David Ziegler, Mano Prime, Luis Javier Navarrete Lozano, Erik Bjäreholt, 阿明, Nathan Dryer, Alex, Rainer Wilmers, zynix, TL, Joseph William Delisle, John Villwock, Nathan LeClaire, Willem Michiel, Joguhyik, GodLy, OG, Alps Aficionado, Jeffrey Morgan, ReadyPlayerEmma, Tiffany J. Kim, Sebastain Graf, Spencer Kim, Michael Davis, webtim, Talal Aujan, knownsqashed, John Detwiler, Imad Khwaja, Deo Leter, Jerry Meng, Elijah Stavena, Rooh Singh, Pieter, SuperWojo, Alexandros Triantafyllidis, Stephen Murray, Ai Maven, ya boyyy, Enrico Ros, Ken Nordquist, Deep Realms, Nicholas, Spiking Neurons AB, Elle, Will Dee, Jack West, RoA, Luke @flexchar, Viktor Bowallius, Derek Yates, Subspace Studios, jjj, Toran Billups, Asp the Wyvern, Fen Risland, Ilya, NimbleBox.ai, Chadd, Nitin Borwankar, Emre, Mandus, Leonard Tan, Kalila, K, Trailburnt, S_X, Cory Kujawski


	Thank you to all my generous patrons and donaters!

	And thank you again to a16z for their generous grant.

	<!-- footer end -->

	# Original model card: nRuaif's Kimiko Mistral 7B


	<!-- This model card has been generated automatically according to the information the Trainer had access to. You
	should probably proofread and complete it, then remove this comment. -->

	[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
	# Kimiko-Mistral-7B

	This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the Kimiko dataset.
	It achieves the following results on the evaluation set:
	- Loss: 2.1173

	## Model description

	Same dataset as Kimiko-v2 but on new model. THIS IS NOT TRAIN ON V3 DATASET

	## Intended uses & limitations

	As a finetuning experiment on new 7B model. You can use this for roleplay or as an assistant

	# Prompt Template Structure
	```
	This is a chat between ASSISTANT and USER
	USER: What is 4x8?
	ASSISTANT:

	```


	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 0.00005
	- train_batch_size: 4
	- eval_batch_size: 4
	- seed: 42
	- gradient_accumulation_steps: 16
	- total_train_batch_size: 64
	- optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-05
	- lr_scheduler_type: cosine
	- lr_scheduler_warmup_steps: 10
	- num_epochs: 2

	### Training results

	\| Training Loss \| Epoch \| Step \| Validation Loss \|
	\|:-------------:\|:-----:\|:----:\|:---------------:\|
	\| 1.5675 \| 0.47 \| 25 \| 2.1323 \|
	\| 1.4721 \| 0.95 \| 50 \| 2.1209 \|
	\| 1.472 \| 1.42 \| 75 \| 2.1177 \|
	\| 1.5445 \| 1.9 \| 100 \| 2.1173 \|


	### Framework versions

	- Transformers 4.34.0.dev0
	- Pytorch 2.0.1+cu118
	- Datasets 2.14.5
	- Tokenizers 0.14.0