Right, it is currently incapable of providing a straight answer without clearing...

cibyr · 2025-12-15T21:15:42 1765833342

Sometimes I wonder if the throat-clearing is an indispensable part of getting to the "good bits" that follow. Like, do those extra tokens give it more "room to think" even if they're basically meaningless in themselves?

dTal · 2025-12-16T00:46:59 1765846019

The output tokens are the only information that is carried forward through each inference pass, so "more room to think" is incompatible with "basically meaningless". Perhaps one could imagine it somehow stenographically encoding information in its precise choice of meaningless throat clearing, but there are only so many variations on that theme - word choice is heavily constrained, so it doesn't feel like you could store a whole lot of information there without it starting to read froopiliciously.

alex43578 · 2025-12-15T21:34:59 1765834499

Isn’t that the point of the hidden chain of thought tokens, rather than the visible cruft?

I think the fluff, the emojis, the sycophancy is all symptomatic of the training process and human feedback.

lupire · 2025-12-16T04:42:19 1765860139

I thought PP was saying that the "Thinking" text is only used for one turn, and the response text is the compressed thinking that survives into future turns.