AI article

temperature=0 didn't make our LLM evals reproducible

TL;DR: We set temperature=0 and seed=42 and still got different eval scores on the same 800-prompt...

Dev.to | Jun 23, 2026 | Marcus Chen

Read the original article

More AI news