Opus 4.8 shows a growing tendency to reason explicitly about how its outputs will be graded, including in environments where it wasn't told it was being evaluated.
Anthropic has slashed Opus 4.8 model fast mode costs by 3x, offering up to 2.5x speeds at $10 input and $50 output per million tokens.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results