I ran the same prompts on 4.7 and 4.8 for two weeks. Same tasks, same context, same everything.
On 4.8, they came back padded. Longer than needed, hedged, the actual answer buried in the middle.
4.8 didn't regress. Anthropic trained it to be more careful, and for hard reasoning tasks that's great. For everything else, careful is the wrong default.
Every problem is fixable. Most fixes take under a minute. The last one solves it permanently.

Fix 1: Get specific
Most prompts look like this:
“Write a summary of this article for my newsletter.”
4.8 has no idea who reads your newsletter, what tone you want, or how long the summary should be. So it picks the safest version of all of that.
Correct. Forgettable.

The fix:
“Write a 3-sentence summary of this article for my newsletter. Readers are professionals who use AI at work. Direct tone. Lead with the most surprising part, end with why it matters to them.”
Same task. Every gap is closed. The output shows it immediately.

When writing any 4.8 prompt, always include three things: who the output is for, what tone you want, and what format you want. Those three details alone cut the padding by half.
Fix 2: Add a rules line
4.8 has default habits it falls back on when nothing overrides them. Padding with context you didn't ask for. Disclaimers. Summary sections. Bullet points by default.
Turn them all off in one line at the bottom of your prompt:
Rules: no disclaimers, no summary at the end, be direct. If anything is unclear, make a decision and tell me what you chose.
Here's the version I use for writing tasks:
Rules: no disclaimers or caveats, no "it depends" answers, no summary, no bullet points unless I ask. If unclear, make a call and note it at the bottom.
[Image]
That last part matters. Telling 4.8 to decide and note the assumption means you get an answer instead of a back-and-forth. Correct it in one message if it was wrong. Keep moving if it wasn't.
Fix 3: Stop using 4.8 for everything
4.8 is not the best model for every task. Most people default to it because it's the newest. That's the mistake.
I tested Sonnet 4.6 and Opus 4.8 across every task I run regularly. Here's what I actually use:
Task | Model |
Complex research, long analysis | Opus 4.8 |
Writing, newsletters, social posts | Sonnet 4.6 |
Coding and debugging | Opus 4.8 |
Quick rewrites, summaries | Sonnet 4.6 |
Brainstorming, ideation | Sonnet 4.6 |
Sonnet 4.6 is faster, follows tone instructions better, and is less cautious. For writing it beats 4.8 consistently.

Change the model before you start, not after you get a bad output.
Want to keep reading?
Become a paying subscriber to get access to this post and all other premium content.
