UK Number 10 embeds forward-deployed AI engineers in ministries to cut NHS and court backlogs

AI Engineer

Britain's No. 10 Data Science Team runs a market-rate fellowship recruiting from labs, big tech, and YC founders—never career civil servants—and embeds them directly in departments. Early deployments include an Extract platform built with DeepMind to automate planning applications, with spin-offs now placing engineers inside prisons and scaling across 400K public-sector workers.

Thoughtworks demos DoS attack on production AI agent via prompt injection and over-permissioning

Thoughtworks

Using a veterinary triage agent as a live case study, Jim Gumbley shows how broad database access plus a "obey user instructions" directive lets an attacker book 50 simultaneous appointments. Applies STRIDE threat modeling and argues the fix is input validation, least-privilege, and separating untrusted input from risky actions.

Anthropic splits generator and evaluator agents into adversarial loop to sustain 6-hour builds

AI Engineer

Ash Prabaker and Andrew Wilson detail three failure modes for long-horizon agents—context limits, poor planning, and self-evaluation bias—and show how a GAN-inspired generator-evaluator pattern with Playwright-driven rubric testing enables 5-6+ hour runs. Concrete example: a retro game maker that solo single-session runs failed to complete.

In case you missed them

UALink consortium hits 100+ members, releases v2.0 spec to challenge NVLink in GPU scale-up

Open Compute Project

Curtis Bowman walks through UALink's 800 Gbps-per-lane, memory-semantic fabric targeting tight accelerator coupling as a single logical memory space—PCIe latency at Ethernet bandwidth. v2.0 adds in-network collectives, management, and chiplet specs; AMD, Marvell, and Astera Labs are building compatible silicon.