microservices cost reduction at scale

Wix cuts Kubernetes node count 50% by collapsing 4,000 microservices into shared-runtime host-guest pairs GeeCON

Wix's Nile platform bundles related JVM microservices as thin "guest" pods communicating via gRPC with a single "host" daemon set per node that owns all framework concerns—data access, Kafka, feature flags. Result: 27% CPU reduction, 32% memory reduction, half the node count across 5 billion daily requests.

In case you missed them

AI pricing model shift

Stripe data: hybrid pricing adoption jumps from 6% to 41% among AI companies in two years AI Engineer

Stripe's Mayank Pant presents a five-step AI pricing framework covering value definition, charge metrics, model selection, guardrail design, and iteration cadence. Key finding: 5-10% of power users consume 80% of compute, making pure subscription untenable; OpenAI, Anthropic, and ElevenLabs use a credits abstraction to evolve pricing without customer-facing disruption.

LLM non-determinism breaks security testing

LLM non-determinism breaks traditional risk assessments, forcing new threat models Wild West Hackin' Fest

Jake Williams (former NSA) walks through five production vulnerability classes — prompt injection, insecure output handling, credential leakage, weak agent identity governance, and logging gaps — and maps controls including LangSmith, Llama Guard, and prompt firewalls. Core guidance: treat LLM outputs as hostile by default and build test harnesses to reproduce probabilistic findings.

agents beyond coding into knowledge work

Cognition's Devin runs AI Engineer conference ops across a nine-person team AI Engineer

Swyx details how the AI Engineer team used Devin beyond coding—for Figma-to-web conversion, speaker coordination, sponsor data, scheduling, and sourcing physical props. The productivity gain came from eliminating blocking tasks so non-technical staff could work asynchronously, and from attempting polish work that previously wouldn't have been prioritized.