Anthropic: Claude Sonnet 4Model25/100 via “prompt caching for reduced latency and cost on repeated contexts”
Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),...
Unique: Automatic content-hash based caching that requires zero developer configuration — the API detects cacheable content and applies caching transparently, with 90% token cost reduction and 50-70% latency improvement on cache hits without explicit cache management APIs
vs others: More transparent than manual caching approaches and more efficient than GPT-4's prompt caching (which requires explicit cache control headers), with automatic detection eliminating the need for developers to manually identify cacheable content