Back to Daily Feed 
Claude Opus 4.8 Focuses on "Honesty" & Reduced Hallucination
Must Read
Originally published on Simon Willison's Weblog by Simon Willison
View Original Article
Share this article:

Summary & Key Takeaways
- Anthropic has released Claude Opus 4.8, described as a "modest but tangible improvement."
- A key focus of this release is improved "honesty" in the model.
- Opus 4.8 is four times less likely to allow flaws in its generated code to pass unremarked.
- The model achieves this by abstaining on uncertain questions rather than providing incorrect answers.
- Pricing remains consistent with previous Opus versions.
- "Fast mode" pricing has been significantly reduced for organizations with access.
Our Commentary
We genuinely appreciate Anthropic's candor here. Calling a release "modest but tangible" is refreshing in an industry often prone to hyperbole. The focus on "honesty" and abstention is a smart move. It's better for an LLM to say "I don't know" than to confidently hallucinate. This feels like a crucial step towards more trustworthy AI. I'm curious if this approach will become a standard for future model releases.
View Original Article
Share this article: