🚩 No files available

by alvarobartt HF staff - opened Feb 22

Discussion

alvarobartt

Feb 22

Hi here @mlabonne !

Nice and fast GGUF quantized weights of Google's Gemma models, just flagging that there are no files available here.

mlabonne

Owner Feb 22

Ah maybe don't though, I'm just waiting for llama.cpp to fix the inference. :( I can upload the files but then people will flag it because it doesn't work haha

alvarobartt

Feb 22

Oh fair! Indeed I'm using it now and seems to be working fine so far with llama-cpp-python, I'm filling a PR to add the gemma formatting

alvarobartt changed discussion status to closed Feb 22

alvarobartt

Feb 22

Pasting it here for reference in case that's useful to you @mlabonne https://github.com/abetlen/llama-cpp-python/pull/1210

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment