TheBloke
/

koala-13B-GGML

Model card Files Files and versions Community

TheBloke commited on Apr 9, 2023

Commit

c7b8f3c

•

1 Parent(s): 55d7e18

Update README.md

Files changed (1) hide show

README.md +8 -4

README.md CHANGED Viewed

@@ -6,13 +6,17 @@ This repo contains the weights of the Koala 13B model produced at Berkeley. It i
 This version has then been quantized to 4-bit using [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa) and then converted to GGML for use with [llama.cpp](https://github.com/ggerganov/llama.cpp).
-## Other Koala repos
-I have also made these other Koala models available:
 * [Unquantized 13B model in HF format](https://huggingface.co/TheBloke/koala-13B-HF)
-* [GPTQ quantized 4bit 7B model in `pt` and `safetensors` formats](https://huggingface.co/TheBloke/koala-7B-4bit-128g)
 * [Unquantized 7B model in HF format](https://huggingface.co/TheBloke/koala-7B-HF)
 * [Unquantized 7B model in GGML format for llama.cpp](https://huggingface.co/TheBloke/koala-7b-ggml-unquantized)
 ## How to run in `llama.cpp`

 This version has then been quantized to 4-bit using [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa) and then converted to GGML for use with [llama.cpp](https://github.com/ggerganov/llama.cpp).
+## My Koala repos
+I have the following Koala model repositories available:
+**13B models:**
 * [Unquantized 13B model in HF format](https://huggingface.co/TheBloke/koala-13B-HF)
+* [GPTQ quantized 4bit 13B model in `pt` and `safetensors` formats](https://huggingface.co/TheBloke/koala-13B-GPTQ-4bit-128g)
+* [GPTQ quantized 4bit 13B model in GGML format for `llama.cpp`](https://huggingface.co/TheBloke/koala-13B-GPTQ-4bit-128g-GGML)
+**7B models:**
 * [Unquantized 7B model in HF format](https://huggingface.co/TheBloke/koala-7B-HF)
 * [Unquantized 7B model in GGML format for llama.cpp](https://huggingface.co/TheBloke/koala-7b-ggml-unquantized)
+* [GPTQ quantized 4bit 7B model in `pt` and `safetensors` formats](https://huggingface.co/TheBloke/koala-7B-GPTQ-4bit-128g)
+* [GPTQ quantized 4bit 7B model in GGML format for `llama.cpp`](https://huggingface.co/TheBloke/koala-7B-GPTQ-4bit-128g-GGML)
 ## How to run in `llama.cpp`