--- license: other library_name: transformers tags: - mlx widget: - text: 'user How does the brain work? model ' inference: parameters: max_new_tokens: 200 extra_gated_heading: Access Gemma on Hugging Face extra_gated_prompt: To access Gemma on Hugging Face, you’re required to review and agree to Google’s usage license. To do this, please ensure you’re logged-in to Hugging Face and click below. Requests are processed immediately. extra_gated_button_content: Acknowledge license license_name: gemma-terms-of-use license_link: https://ai.google.dev/gemma/terms --- # mlx-community/quantized-gemma-7b-it This model was converted to MLX format from [`google/gemma-7b-it`](). Refer to the [original model card](https://huggingface.co./google/gemma-7b-it) for more details on the model. ## Use with mlx ```bash pip install mlx-lm ``` ```python from mlx_lm import load, generate model, tokenizer = load("mlx-community/quantized-gemma-7b-it") response = generate(model, tokenizer, prompt="hello", verbose=True) ```