Google Gemma
Collection
Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models.
•
6 items
•
Updated
•
9
This model was converted to MLX format from google/gemma-7b
.
Refer to the original model card for more details on the model.
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("mlx-community/quantized-gemma-7b")
response = generate(model, tokenizer, prompt="hello", verbose=True)