Google org

The model is running only on the CPU instead of the GPU because the device_map="auto" parameter was not specified in the Gemma3ForConditionalGeneration.from_pretrained function. Adding this parameter will enable automatic GPU allocation.

Ready to merge
This branch is ready to get merged automatically.
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment