请教两个问题

#1
by snuffcn - opened

一个是量化的GGUF模型中近期常见一种标名为i1的模型,是什么技术,有什么特点?
二是像Qwen/Qwen2.5-72B-Instruct-GGUF这样的模型由多个模型文件组成,如何使用ollama来创建。
Ollama的modelfile文件如何from这种多文件?我使用copy /B 把文件先连接起来变成单个大文件是可以创建成功的,但好像是不能用。
返回为EOF
谢谢

Hi, I only really do english, but I will try to answer some of your questions. As for quants made of multiple files, you indeed need to concatenate them - make sure to only concatenate the files belonging to the same quant (e.g. Q4_K_M). As for the error, I have no clue - the last time I used DOS commands was when it was still called MS-DOS :)

As for ollama, I don't know what needs to be done, you might have to refer to the ollama documentation. For most llama.cpp frontends, you only need to specify the model file itself somehow.

mradermacher changed discussion status to closed

Sign up or log in to comment