jeff31415
/

TinyLlama-1.1B-1.5T-OpenOrca-Alpha

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

TinyLlama-1.1B-1.5T-OpenOrca-Alpha / README.md

jeff31415's picture

Update README.md

c337449 11 months ago

|

history blame contribute delete

No virus

1.24 kB

	---
	license: apache-2.0
	datasets:
	- Open-Orca/OpenOrca
	- bigcode/starcoderdata
	- cerebras/SlimPajama-627B
	language:
	- en
	---
	[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)

	#### Base model:
	https://huggingface.co/TinyLlama/tinyLlama-intermediate-checkpoints/tree/step-720k-token-1510B
	This fine tune was done on the "early" version of tinyllama-1.5T which suffers from a bug in dataset processing. See https://github.com/jzhang38/TinyLlama/issues/67.
	Through it suffers from the glitch, its performance seems not being damaged and still showing improvement(metrics needed)

	#### Dataset:
	Fine tuned on OpenOrca GPT4 subset for 1 epoch,Using CHATML format

	#### Model License:
	Apache 2.0, following the TinyLlama base model.

	#### Quantisation:
	GGUF format:https://huggingface.co/s3nh/jeff31415-TinyLlama-1.1B-1.5T-OpenOrca-Alpha-GGUF

	#### Hardware and training details:
	Hardware: 1*RTX A5000, ~16 hours to complete 1 epoch. GPU from autodl.com, cost around $3 for this finetuning.
	https://wandb.ai/jeff200402/TinyLlama-1.5T-alpha-Orca?workspace= for more details.