It's a good thing people were enamored of how inexpensive GPT-5 is, given that t...

Tadpole9181 · 2025-08-09T18:14:53 1754763293

54,000 bytes, one byte per character. 4 characters per token (more or less). Around 13,000 tokens.

These are NOT included in the model context size for pricing.

btdmaster · 2025-08-09T17:24:51 1754760291

I might be wrong, but can't you checkpoint the post-system prompt model and restore from there, trading memory for compute? Or is that too much extra state?

mdaniel · 2025-08-09T17:34:02 1754760842

My mental model is that the system prompt isn't one thing, and that seems even more apparent with line 6 telling the model what today's date is. I have no insider information but system prompts could undergo A/B testing just like any change, to find the optimal one for some population of users

Which is to say you wouldn't want to bake such a thing too deeply into a multi-terabyte bunch of floating points because it makes operating things harder

reitzensteinm · 2025-08-10T23:10:29 1754867429

OpenAI automatically caches prompt prefixes on the API. Caching an infrequently changing internally controlled system prompt is trivial by comparison.