Can't find a way to make it work with llama.cpp

#102

by ZeroWw - opened Jun 16

Jun 16

I'm trying to use gemma-7b with llama.cpp
I converted the model to gguf.
As I start the server and try to chat, the model answers correctly the first time (but very shortly) then starts talking to itself :(
Any idea?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment