AI article
temperature=0 didn't make our LLM evals reproducible
TL;DR: We set temperature=0 and seed=42 and still got different eval scores on the same 800-prompt...
Dev.to | Jun 23, 2026 | Marcus Chen
AI article
TL;DR: We set temperature=0 and seed=42 and still got different eval scores on the same 800-prompt...
Dev.to | Jun 23, 2026 | Marcus Chen