AI Streaming architecture and speculative decoding: How companies are unlocking cheaper AI Tasmin Lockwood Apr 28, 2026 5 min read