What happened
Anthropic released Claude 3.5 Sonnet, a mid-tier model priced at $3 per million input tokens and $15 per million output. It outperforms the previous flagship, Claude 3 Opus, on graduate-level reasoning (GPQA), undergraduate-level knowledge (MMLU), and HumanEval code generation. Anthropic also shipped 'Artifacts' — a side-panel canvas for rendered output.
The pricing is the most consequential part. Sonnet is 1/5 the cost of Opus for superior benchmark scores. Anthropic is signalling that the cost curve of frontier-quality retrieval is falling fast.
Why it matters to Shopify merchants
When the best model gets cheaper, AI-assistant surfaces become more ambitious. Claude is now cheap enough that Anthropic's own shopping-assistant experiments, and downstream products built on the Claude API, can afford deeper retrieval per query. Deeper retrieval means more citations per answer — and Shopify merchants cited in those answers sit in the funnel.
Better reasoning also raises the bar for what content makes the citation cut. Claude 3.5 Sonnet is better at discarding weak sources. Catalogs with thin descriptions, missing schema, or hallucination-prone content get filtered out more aggressively. The asymmetric payoff is to the stores whose pages are parseable, factual, and clean.
For merchants, the tactical move is to make sure every product page answers the specific questions buyers ask before buying — materials, fit, comparisons to named alternatives, return policy in one sentence. Claude 3.5 Sonnet rewards that density; thin product pages get skipped.
- Mar 2024
Claude 3 Opus ships
Flagship reasoning model at $15/$75 per 1M input/output tokens.
- Jun 2024
Claude 3.5 Sonnet ships
Beats Opus on GPQA + HumanEval at $3/$15 per 1M tokens — 1/5 the cost.
- Jun 2024
Artifacts canvas goes GA
Side-panel canvas for rendered output, shifting shopping UX toward inline product cards.
- Q3 2024
Sonnet API adoption surge
Third-party agentic shoppers migrate from Opus to Sonnet en masse.