have been on 1M context window with claude since 4.0 - it gets pretty expensive ...

Workaccount2 · 2025-12-12T03:39:33 1765510773

You should be doing everything you can to keep context under 200k, ideally even 100k. All the models unwind so badly as context grows.

patates · 2025-12-12T06:03:35 1765519415

I don't have that experience with gemini. Up to 90% full, it's just fine.

tgtweak · 2025-12-15T14:20:16 1765808416

If the models are designed around it, and not resorting to compression to get to higher input token lengths, they don't 'fall off' as they get near the context window limit. When working with large codebases, exhausting or compressing the context actually causes more issues since the agent forgets what was in the other libraries and files. Google has realized this internally and were among the first to get to 2M token context length (internally then later released publicly).