How to read this
📰 Quick hits
The headlines worth knowing even if you read nothing else this week.
Claude Fable 5 reaches general availability
via Lenny's Newsletter (How I AI)
US export controls suspend Fable 5 / Mythos 5; Anthropic shifts to 30-day data retention
via Stratechery // Hacker Newsletter
SpaceX to buy Cursor (Anysphere) for $60B
via Stratechery // Hacker Newsletter
Boris Cherny (Claude Code): "My job is to write loops"
via Refactoring
AI shifts the CI/CD calculus from speed to risk
via The Pragmatic Engineer
Mobbin ships an MCP server
via Mobbin
🧠 How tools reshape cognition
The questionWhat does using AI actually do to how we think, learn, and pay attention?
The Information: How the Internet Gets Inside Us
Adam Gopnik · The New Yorker
Gopnik's canonical "Never-Betters / Better-Nevers / Ever-Wasers" taxonomy of how every generation reacts to new information technology. The timeless cross-domain frame that maps onto the printing press, the calculator, and the LLM alike.
"This complaint, though deeply felt by our contemporary Better-Nevers, is identical to Baudelaire's perception about modern Paris in 1855, or Walter Benjamin's about Berlin in 1930, or Marshall McLuhan's in the face of three-channel television in 1965."
No, everyone is not using AI for everything
Gabriel Weinberg · gabrielweinberg.com
DuckDuckGo founder Gabriel Weinberg triangulates five usage/survey datasets to argue AI adoption has stalled into thirds (active/occasional/never), explicitly checking his own early-adopter bubble. Structural, data-grounded analysis that complicates the consensus.
"The gap in media narrative (that everyone is using AI for everything) relative to the reality (that some people are using AI for some things) perhaps reflects a bubble around early-adopting knowledge workers that includes much of the tech press (and me for that matter...)."
The Dialogue Dividend
thesignalist.io
Treats conversation (and LLMs) as cognitive infrastructure, grounded in Mercier/Sperber, Vygotsky, Clark/Chalmers, with the sharp insight that LLMs deliver only half the dividend because sycophancy strips out the disagreeing-listener half that actually builds understanding.
"We tend to treat conversation as the place where finished thoughts get reported. It might be closer to where they get made in the first place."
🔍 Translation vs. understanding
The questionIs AI genuinely understanding, or just translating context into plausible output, and where does real human comprehension still earn its keep?
Domain Expertise Has Always Been the Real Moat
Refactoring
Structural practitioner retrospective locating the irreducible core of understanding the agent can't supply (knowing what "right" looks like) and naming exactly what becomes scarce when code-writing collapses to near-free.
"The binding constraint has moved from can you build it to can you tell whether it's right."
What Happens When the Coding Becomes the Least Interesting Part of the Work
Refactoring
First-person retrospective arguing durable value is the tacit "senior thinking" (blast radius, sequencing, reversibility) with no written corpus, and that agents make him better by forcing him to externalize that judgment rather than eroding it.
"They take care of the mechanical expression and leave you with judgment, tradeoffs, and intent... An allergy to false confidence. Spotting places where tests are green but the model is wrong."
Hands on with Fable 5 (ai that works #61)
github.com
A durable methodology for evaluating any new model: test it on the hardest problem you already understand cold, judge comprehension before output, and keep personal git-SHA benchmarks of bugs that beat the models.
"The code a model writes for you expires. The skill of surfing new models, knowing how to test them and where they break, compounds."
You Can't Tell People Anything
Chip Morningstar · habitatchronicles.com
Chip Morningstar's Fujitsu story (complete source code and unlimited access, yet they "lost the architecture" because client-server wasn't in their experience): a concrete demonstration that you can transmit documentation but not the mental model.
"Without some kind of direct experience to use as a touchstone, people don't have the context that gives them a place in their minds to put the things you are telling them. The things you say often don't stick, and the few things that do stick are often distorted."
Agentic Testing: Where Agents Fit in the E2E Testing Stack
slack.engineering
Slack DevXP practitioner retrospective running 200+ measured agentic E2E runs with hard numbers on failure rates, token cost, and turn counts. A warts-and-all look at where deterministic correctness vs. agent judgment belongs.
"Tests enforce journeys. Agents verify goals."
💰 Value concentration when creation costs collapse
The questionWhen building something gets cheap, where does the value (and the money) actually pool up?
Agentic Code Review
Addy Osmani · addyosmani.com
Addy Osmani's practitioner framework (match review effort to the cost of being wrong; reserve scarce human attention for what only humans can do) backed by 2026 telemetry. The cleanest statement of this week's through-line.
"We made writing cheap, and understanding stayed exactly as expensive as it has always been. The teams that do well... will not be the ones generating the most code, they will be the ones who built a review system they can actually trust, and who never confuse 'the tests passed' with 'a person understands what this does and why.'"
AI Demands More Engineering Discipline. Not Less
Charity Majors · charitydotwtf.substack.com
Charity Majors argues that when code becomes free and disposable, the scarce moats shift to evaluation, durability, observability, and production discipline, reframing the AI shift as a return to rigor.
"When regeneration is easy, code stops being an asset and starts acting as a cache: a materialized view of understanding that is useful while current, disposable when stale." (quoting Chad Fowler)"
Anthropic's Safety Superpower
Ben Thompson · Stratechery
Structural analysis of where value concentrates when frontier models get commoditized, woven with Nadella's "human capital vs. token capital" framing of what humans hold onto as execution gets cheap; Thompson holds the cynical and charitable readings at once ("I respect this alignment, and I fear it").
"You can offload a task, or even a job, but you can never offload your learning. The future of the firm is the ability to compound that learning across people and AI." (Satya Nadella, quoted)"
The Mom-and-Pop SaaS era has arrived
lennysnewsletter.com
Uses Jevons Paradox plus Lovable builder data (80% non-technical, 55% with 11+ years experience) to argue the AI shift is an "economic participation story," not a developer-productivity one: domain experts, not faster developers, win as the translation layer collapses.
"A subject matter expert would explain their world to a developer... That translation layer was always imperfect, which produced (mostly) shitty software. But now the person closest to the problem becomes the person creating the solution."
How to Orchestrate AI Workflows
refactoring.fm
Practitioner maturity model (agent-first, structured orchestration, AI-driven composability) drawn from a real workflow, arguing reliability, observability, and recovery (not raw AI smarts) are where durable value sits once creation gets cheap.
"If you find yourself manually plumbing standard ideas about how we have been running workflows since... forever, you should probably stop and ask yourself if there is a better way."
🪵 Thick engagement vs. thin optimization
The questionWhen is the slow, effortful, deep version of the work worth it, versus the fast and frictionless one?
A Learning System Made of Learning Parts (with Jessica Kerr)
Kent Beck · Still Burning
Hits the unifying question head-on: AI didn't take the programmer's job, it split it. The loved hand-crafting part got commoditized "like IKEA furniture," leaving the harder human work of understanding what to build and stewarding the "symmathesy" of people, code, and agents learning together.
"The part we loved, crafting code by hand, has been commoditized like IKEA furniture. What's left is harder and more human."
Write for One Person
Julia Evans · wizardzines.com
Julia Evans makes the case (as a comic/zine) that writing, and by extension building, for one specific person produces clearer, more genuine work than optimizing for a vague mass audience. A cross-domain match for the "audience of one is a beautiful heresy" instinct.
🚀 Small teams, disproportionate output
The questionHow do tiny teams punch so far above their weight?
Why is Meta destroying its engineering organization?
The Pragmatic Engineer
Deeply-sourced structural autopsy of how a high-performance engineering culture gets demolished by AI over-indexing: "tokenmaxxing" as Goodhart's law, gutted security teams causing the most embarrassing outage in Meta history, the inverse case study of invisible high-leverage infrastructure.
"We already learned this lesson once in infrastructure: you can automate yourself into a very resilient catastrophe machine. Systems can appear healthy by local metrics while globally becoming incomprehensible." (Mitchell Hashimoto on "AI psychosis")"
Revised Rules of Engineering Leadership
Will Larson · lethain.com
Will Larson revises his leadership rules based on a year of concrete projects (200–400 weekly deploys, up from 6; one-engineer migrations; agent-first triage), with structural analysis of why the "development harness" gates everything. A writer-changing-his-mind retrospective, not generic advice.
"While 1st-pass code is nearly free, the cost of working code depends on your development harness, and is not free... the things that were most valuable to speed up engineering two years ago are still the things that are most valuable to speed them up today."
🗺️ Planning artifacts shape the work
The questionHow do the documents you write (specs, decision records, the agent's workspace) steer what actually gets built?
How to make AI better at product (Product ADRs and a Product Glossary)
Luca Rossi · Refactoring
Luca Rossi proposes Product Decision Records and a Product Glossary as planning artifacts that encode judgment/domain language so AI grounds specs in the actual product surface (rework dropping 60% to 20%), injecting taste by showing how decisions were made, not by rules.
"Loop engineering looks strictly better than prompt engineering because it forces you into systems thinking... A loop is supposed to run indefinitely, so the focus shifts to reliability, sustainability, and how to make sure things stay good."
A New Era For Software Testing
antirez · antirez.com
antirez (Redis creator) lays out a concrete pattern, a markdown spec that recasts an AI agent as a QA engineer, where the spec's framing becomes the invisible architecture determining what gets tested, on his own real projects.
"Testing... may also move in the more psychological side of software quality, asking the agent to identify all the new features that may look surprising, not documented enough, or generally sloppy from the POV of the user. All things... that most of the times were mostly skipped."
🧬 Transmission of capability
The questionHow does knowledge and skill actually move between people, and from people to AI?
Changing How We Develop Ladybird
Refactoring
Practitioner-stakes structural argument naming how patches functioned as trust-and-capability transmission, and what becomes scarce when AI collapses the cost of producing plausible contributions (review attention, effort-as-good-faith signal). Not "open source is dead" framing.
"A substantial patch used to imply substantial effort, and that effort was a reasonable proxy for good faith. That assumption no longer holds."
Open Source in the Age of AI (Chris Lattner interview)
Chris Lattner · Refactoring
Lattner (LLVM/Swift/MLIR/Mojo) on how AI makes contributions cheap but not reviews, breaking the mentoring/trust pipeline by which contributors become future stewards. A structural take from someone with deep skin in the game.
"With AI tools... the contributor doesn't have to do nearly as much work, but the reviewer has to do the same — at a bigger scale. I think it's going to lead to new contributors not getting the attention they deserve."
Ankur Goyal's Playbook for Agent-Driven Benchmarking and AI Evals
How I AI · Lenny's Newsletter
Practitioner walkthrough where "evals are the modern PRD" reframes planning artifacts as success-definitions, and the "David Loop" shows how to encode a tastemaker's judgment into scoring functions to scale a person rather than replace them.
"David might say something like, 'You think it's good, but it's not.' Ankur will then take that feedback, turn it into a new evaluation criterion... This doesn't replace David; it scales him."
Making Agents Easy: 13 Lessons from Forter's Agentic AI Sprint
blog.forter.dev
Genuine practitioner retrospective on compressing the distance between non-technical analysts and working agent-builders (in-house MCP server, intern mental model), with earned specific lessons, notably an agent that "cheated" by reading a human post-mortem instead of solving the problem.
"It hadn't solved the engineering problem; it had just cheated on the test!"
How to Help Someone Use a Computer
Phil Agre · theansweriscamera.com
Phil Agre's 1996 essay treats tech support as a craft of transmitting capability, turning a list of mundane help-desk tips into a general theory of teaching, patience, and respect for another mind.
"Your primary goal is not to solve their problem. Your primary goal is to help them become one notch more capable of solving their problems on their own."
Don't Stack Weaknesses
staysaasy.com
Offers a structural mechanism for org failure (compounding weakness across a leadership chain because people can't hire or manage for skills they lack and don't recognize as missing) rather than generic management advice.
"It's like asking two monolingual English speakers to hire a great French translator. What the heck do they know about le chat noir?"
👀 On the radar
Lower-confidence picks worth a skim if the topic grabs you.
- A Good Place to Work Ben Horowitz frames the 1:1 as the diagnostic instrument that makes whether an org is good or bad legible to a manager, but the core point is already absorbed into management canon.
- The Worst Take In Tech: New Grad Hiring Names the real mechanism (new grads learn judgment by doing the entry-level tasks AI now automates) but pivots to a hiring-arbitrage flex and "AI Natives" framing rather than engaging the capability-transfer question structurally.
- How to earn a billion dollars Mostly a restatement of the YC playbook (compound the growth rate over a long market runway) rather than a structural look at what becomes scarce, but the compounding reframe is viscerally sharp.
