Gemma Llama.cpp

Google released Gemma 3, a family of multimodal models that offers advanced capabilities like large context and multilingual support. This interactive chat interface allows you to experiment with the gemma-3-1b-it text model using various prompts and generation parameters. Users can select different model variants (GGUF format), system prompts, and observe generated responses in real-time. Key generation parameters, such as ⁣temperature, max_tokens, top_k and others are exposed below for tuning model behavior. For a detailed technical walkthrough, please refer to the accompanying blog post.

Model

Select the AI model to use for chat

512 2048
0.1 2
0.1 1
1 100
1 2