Edit model card

Download a Model

cortex models pull tinyllama

Start the Engine

cortex engines

Start model

cortex models start tinyllama

Chat

cortex chat

Model Structure

1.1b-chat-v1.0-gguf-Q2_K
1.1b-chat-v1.0-gguf-Q4_K_M
1.1b-chat-v1.0-gguf-Q6_K
1.1b-chat-v1.0-gguf-Q8_0
1.1b-chat-v1.0-tensorrtllm-windows-ada-int4
1.1b-chat-v1.0-tensorrtllm-windows-turing-int4
1.1b-chat-v1.0-tensorrtllm-windows-ampere-int4
1.1b-chat-v1.0-tensorrtllm-windows-ada-fp16
1.1b-chat-v1.0-tensorrtllm-windows-turing-fp16
1.1b-chat-v1.0-tensorrtllm-windows-ampere-fp16
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .