I'm trying to use gemma-7b with llama.cppI converted the model to gguf.As I start the server and try to chat, the model answers correctly the first time (but very shortly) then starts talking to itself :(Any idea?
· Sign up or log in to comment