My biggest problem with LLM's at this point is that they produce different and i...

fragmede · 2025-12-12T14:38:12 1765550292

It sounds like you have dug into this problem with some depth so I would love to hear more. When you've tried to automate things, I'm guessing you've got a template and then some data and then the same or similar input gives totally different results? What details about how different the results are can you share? Are you asking for eg JSON output and it totally isn't, or is it a more subtle difference perhaps?

sebastiennight · 2025-12-12T20:32:33 1765571553

> I want to give an LLM the same prompt on different days and I want to be able to trust that it will do the same thing as yesterday

Bad news, it's winter now in the Northern hemisphere, so expect all of our AIs to get slightly less performant as they emulate humans under-performing until Spring.

conception · 2025-12-12T13:37:22 1765546642

You need to change the temperature to 0 and tune your prompts for automated workflows.

balder1991 · 2025-12-12T13:48:29 1765547309

It doesn’t really solve it as a slight shift in the prompt can have totally unpredictable results anyway. And if your prompt is always exactly the same, you’d just cache it and bypass the LLM anyway.

What would really be useful is a very similar prompt should always give a very very similar result.

jknightco · 2025-12-12T16:44:26 1765557866

This doesn't work with the current architecture, because we have to introduce some element of stochastic noise into the generation or else they're not "creatively" generative.

Your brain doesn't have this problem because the noise is already present. You, as an actual thinking being, are able to override the noise and say "no, this is false." An LLM doesn't have that capability.

sheeshe · 2025-12-12T17:04:48 1765559088

Well that’s because if you look at the structure of the brain there’s a lot more going on than what goes on within an LLM.

It’s the same reason why great ideas almost appear to come randomly - something is happening in the background. Underneath the skin.

tsunamifury · 2025-12-12T16:32:25 1765557145

That’s a way different problem my guy.

dominotw · 2025-12-13T12:54:01 1765630441

have you tried this? this doesnt work because the way inference runs at big companies. its not just running your query in isolation.

maybe it can work if you are running your own inference.