Today’s theme: agents are moving from demos to deployable systems. New products emphasize sandboxing and team-wide workflows, model releases push more capability onto fewer GPUs, and research is drilling into the bottlenecks (parallelizing model streams, privacy-policy trade-offs, and contamination-resistant evaluation). The practical question is no longer ‘can an agent do this?’, but ‘can we run it safely, predictably, and cost-effectively at scale?’