Gemma Llama.cpp
Google released Gemma 3, a family of multimodal models that offers advanced capabilities like large context and multilingual support.
This interactive chat interface allows you to experiment with the gemma-3-1b-it
text model using various prompts and generation parameters.
Users can select different model variants (GGUF format), system prompts, and observe generated responses in real-time.
Key generation parameters, such as temperature
, max_tokens
, top_k
and others are exposed below for tuning model behavior.
For a detailed technical walkthrough, please refer to the accompanying blog post.
Examples
Model
Select the AI model to use for chat
512 2048
0.1 2
0.1 1
1 100
1 2