It's a cool release, but if someone on the google team reads that: flash 2.5 is ...

edvinasbartkus · 2025-12-17T19:50:49 1766001049

Did you try setting thinkingLevel to minimal?

thinkingConfig: { thinkingLevel: "low", }

More about it here https://ai.google.dev/gemini-api/docs/gemini-3#new_api_featu...

zurfer · 2025-12-17T20:41:12 1766004072

Yes I tried it with minimal and it's roughly 3 seconds for prompts that take flash 2.5 1 second.

On that note it would be nice to get these benchmark numbers based on the different reasoning settings.

retropragma · 2025-12-17T19:46:21 1766000781

That's more of a flash-lite thing now, I believe

Tiberium · 2025-12-17T20:23:12 1766002992

You can still set thinking budget to 0 to completely disable reasoning, or set thinking level to minimal or low.

andai · 2025-12-18T00:35:36 1766018136

>You cannot disable thinking for Gemini 3 Pro. Gemini 3 Flash also does not support full thinking-off, but the minimal setting means the model likely will not think (though it still potentially can). If you don't specify a thinking level, Gemini will use the Gemini 3 models' default dynamic thinking level, "high".

https://ai.google.dev/gemini-api/docs/thinking#levels

Tiberium · 2025-12-18T11:57:03 1766059023

I was talking about Gemini 3 Flash, and you absolutely can disable reasoning, just try sending thinking budget: 0. It's strange that they don't want to mention this, but it works.

andai · 2025-12-18T13:39:51 1766065191

Gemini 3 Flash is in the second sentence.

throwaway127482 · 2025-12-18T14:49:37 1766069377

See, this is what happens when you turn off thinking completely.

bobviolier · 2025-12-17T22:26:39 1766010399

This might also have to do with it being a preview, and only available on the global region?