Categories
Haiku Living Reading

The Presence We Keep Deferring

I have so many unread articles saved to Instapaper that I’ve stopped checking the count. Each one felt, in the moment of saving it, like something I needed. A long piece on urban planning, a profile of someone interesting, a reported essay I fully intended to sit with.

The app is beautifully designed for exactly this โ€” the frictionless capture, the clean reading interface waiting patiently on the other side.

What it can’t do is manufacture the attention I didn’t have when I saved it and still don’t have now. The articles aren’t the problem. The premise is: that presence is something you can bank.

There’s a haiku I keep returning to, from Natalie Goldberg’s Three Simple Lines. It’s by a poet named Fumiko Harada:

Morning chill
I savor this moment โ€”
one meeting one lifetime

Eleven words. No verb in the third line, which makes it feel less like a thought and more like a verdict.

The Japanese concept underneath it is ichi-go ichi-e โ€” loosely, “one time, one meeting.” It’s a Zen idea with origins in the tea ceremony, the understanding that each gathering is singular and therefore irreversible. You cannot archive it. You cannot search for it later. When it ends, it doesn’t go anywhere you can retrieve.

This is what the Instapaper queue is, at scale: an archive of moments I decided to experience later. The article about urban planning was written by someone who spent months reporting it, on a day when some editor thought it was ready, and landed in my feed on a morning when something about the headline caught me. That constellation doesn’t reassemble. Later is a different article.

The tools I use every day are getting astonishing. There are systems that can summarize, translate, recall, explain, anticipate. I use them. I find them genuinely useful.

But there’s a habit of mind they reward โ€” a kind of perpetual deferral of full attention โ€” that I haven’t fully reckoned with. The promise, always, is that you can engage more completely later, once the summary is ready, once the transcript exists, once the notes have been taken. Presence becomes a productivity tax you pay while waiting for a deliverable.

Harada’s haiku doesn’t moralize. The speaker isn’t lecturing herself into awareness. She’s just cold, and awake, and choosing to notice. I savor this moment. The word “savor” does a lot of work. It implies effort. You savor things that could be missed.

The pivot in the third line is what stays with me. One meeting one lifetime. Not “this meeting will last a lifetime” โ€” that would be sentiment. It’s more like a mathematical statement: the cardinality of this encounter is one. There is exactly one of them. This morning, this particular chill, whatever conversation or solitude is happening inside it โ€” that set has one element. By tomorrow it has zero. No amount of documentation changes that arithmetic.

I’m working on believing that.

Categories
Architecture Infrastructure

The Architecture of the Indestructible

We are conditioned to look for the center of things. When we try to understand an organization, we ask for an organizational chart. When we look at a nation, we look to its capital. Traditional architectureโ€”whether of a building, a company, or an armyโ€”relies on a classic playbook: a strong hub, radiating outward. You find the center, you secure it, and the system holds.

But what happens when you try to decapitate an enemy, or a technology, that has no head?

In 1964, a brilliant engineer named Paul Baran sat at his desk at the RAND Corporation, trying to solve a Cold War nightmare: How do you maintain a communications network after a catastrophic nuclear strike? Baran realized that traditional networks were centralizedโ€”like a wheel with spokes. If you destroy the hub in the center, every single spoke becomes useless.

His solution was the distributed network, the foundational blueprint for what would eventually become the Internet.

“Under the proposed system, each station would need to be connected to only a few of its nearest neighborsโ€ฆ The system would be highly reliable, even if a large fraction of the stations were destroyed.”

Baran mathematically proved that if you remove the center, the edges don’t die. They simply reroute. A few decades later, telecom engineers used a remarkably similar logic to build cellular telephone networks. Instead of one massive, high-power radio tower serving an entire city, they broke the terrain into a grid of small, low-power cells. If one tower goes offline, the network degrades gracefully rather than collapsing. It bends, but it refuses to break.

There is a profound, poetic irony buried here. The United States government originally funded Baranโ€™s research to create a distributed network so that its centralized monolith could survive. Decades later, asymmetric adversaries across the globe adopted that exact architectural philosophy for their physical defense doctrinesโ€”creating “Mosaic Defense” systems designed specifically so that when you destroy the center, the edges keep fighting.

They copied our homework to survive our strength.

I find myself thinking about this tension far beyond the realms of military strategy or software engineering. It is a metaphor for how we construct our lives. We often build centralized livesโ€”anchored entirely to a single identity, a single career, or a single institution. We project a monolith of strength to the world. But monoliths are brittle. When the center is struck, the whole architecture crumbles.

The lesson of our modern architecture is becoming increasingly clear, whether you are managing a network, building an organization, or navigating the quiet complexities of a human life. The fragile monolith is an illusion of safety.

The future belongs to the web that knows how to reroute.

Categories
Etiquette Founders

How to Be in a Room

I happened to notice this in a post on X: Slow Ventures ran a 4-hour etiquette class for founders. This is actually a brilliant idea. A non-trivial amount of founders (and VCs) could use some help in this department.

So I asked AI: what would a course syllabus for something like this look like? Below is the response (which I quite like!):


This is a genuinely good idea โ€” etiquette as a business skill is perennially under-taught, especially for technical founders who may have gone deep on product/engineering but shallow on navigating rooms. Here’s how I’d structure a day-long or modular course built around the Slow Ventures premise.

Founder Etiquette โ€” A Field Guide

4-hour intensive ยท 5 modules ยท lecture + role-play format

Morning Block โ€” First Impressions

Module 1 ยท 45 min โ€” The room before the meeting

Arrivals, introductions, and presence under observation

  • Timing: the cost of early vs. late
  • Handshakes, eye contact, the graceful exit from a name you missed
  • Business card and phone etiquette
  • How to work a room without appearing to work a room
Module 2 ยท 45 min โ€” Table stakes: dining fluency

The business meal as performance and ritual

  • Place settings, utensils, and the BMW rule
  • Who orders first, who orders what
  • Wine: fluency without performance
  • Dietary restrictions โ€” yours and theirs
  • The check: the ritual, the timing, the dodge
  • When and how to talk business at the table

Mid-Morning Block โ€” Communication

Module 3 ยท 45 min โ€” Communication that doesn’t leak desperation

Written and verbal signals investors actually read

  • Email: subject lines, length, tone, and the follow-up cadence
  • Reading the room on Slack/text vs. formal channels
  • The thank-you note โ€” when, how, why it still matters
  • Saying no, disagreeing, and delivering bad news
  • How to receive feedback without deflecting or over-agreeing
  • The exit from a conversation: graceful, not abrupt

Afternoon Block โ€” High-Stakes Scenarios

Module 4 ยท 45 min โ€” Investor relations: the long game

Board rooms, pitch meetings, and the unwritten VC code

  • Board meeting protocols โ€” who speaks, in what order
  • Managing up to your board without managing them
  • The warm intro โ€” how to ask, how to use, how to close the loop
  • Confidence vs. bravado: what the room hears
  • What to do when the meeting is going badly
  • Conference behavior: panels, hallways, and the after-party
Module 5 ยท 60 min โ€” The edges: cultural fluency and hard situations

Where etiquette meets judgment

  • Cross-cultural norms: Japan, Middle East, Europe, India โ€” the high-risk differentials
  • Power dynamics: how to behave when you have more (or less)
  • Navigating someone else’s social gaffe gracefully
  • Gift-giving: what to give, what never to give
  • The fine line between casual and sloppy in casual settings
  • Live role-play: difficult scenarios, coached debrief

A few things I’d stress if actually running this:

Module 3 is where the real behavioral work happens for founders specifically. The desperation signal leaks through email more than anywhere else โ€” the too-fast reply, the over-explanation, the follow-up that arrives four hours after the first message. That’s the stuff that’s hard to see in yourself.

One thing Slow Ventures probably got right: four hours is about the right ceiling. Any longer and you start to feel like you’re at finishing school, which kills the energy.


After reviewing this, I thought it might be useful to add two role playing scenarios to the course. Here’s what the AI recommended:

First, add this role play in after Module 2:

Role-play 1 ยท 20 min โ€” The wrong bread plate

Recovering from a gaffe without losing the room

A founder is taken to lunch by a partner at a firm they’ve been trying to get in front of for six months. It’s going well โ€” good rapport, the partner seems genuinely interested. Then the founder accidentally takes the partner’s bread roll, realizes it mid-bite, and has to navigate the next ten minutes without letting the gaffe collapse the energy in the room. Secondary layer: the check arrives and it’s ambiguous who’s hosting.

This tests everything from Modules 1 and 2 simultaneously โ€” table fluency, recovery under embarrassment, reading whether to acknowledge the mistake or absorb it silently, and the check ritual. Low-stakes enough to be funny, high-stakes enough to feel real.


Next, add this role play after Modules 3, 4, and 5:

Role-play 2 ยท 30 min โ€” The meeting that’s going sideways

Holding the room when the numbers missed

A founder is in a board meeting presenting quarterly numbers that missed. Midway through, one board member interrupts with a pointed question the founder doesn’t have a clean answer to. A second board member piles on. The founder has to hold the room โ€” not deflect, not over-apologize, not go defensive โ€” while also reading which of the two is genuinely concerned versus performing concern for the room. Cultural wrinkle: the more aggressive questioner comes from a context where directness reads as respect, not attack.

This pulls from Modules 3, 4, and 5 simultaneously. It’s the hardest scenario in the course and should feel that way. The coached debrief afterward is where most of the learning actually happens โ€” ending here means it lands with weight, which is the right note to close on.


More commentary from the AI:

The dining module is the most load-bearing. A remarkable amount of professional signaling happens at the table and most people don’t realize they’re broadcasting anything. The BMW rule alone (bread-meal-water, left to right, so you don’t eat someone else’s roll) is the kind of thing nobody tells you until you’ve already grabbed the wrong bread plate in front of a Series B lead. The live role-play after Module 2 brings this to life.

The live role-play in Module 5 is what separates a good version of this class from a mediocre one. You can lecture about graceful exits from bad meetings all day. Watching someone flail through one in real time, then getting coached on the spot, is what actually moves the needle.

Categories
AI Consulting

The Judgment Layer

An analyst’s note about the CEO of one of the largest consulting companies making comments at an investor conference includes a line that deserves more attention than it got: “token volume used on a project isn’t a proxy for AI maturity.”

Translation โ€” clients are burning money on frontier models for problems that don’t need frontier models, and they’re not getting the outcomes they expected.

This firm’s CEO offered this as a business opportunity. I read it as a confession.

The old consulting model was simple: client has a technology problem, firm deploys humans to solve it. Billing followed effort. The new problem is different in kind โ€” clients have an AI strategy problem. They know they’re supposed to be using AI. They’ve heard the word “frontier.” They’re spending accordingly. They just don’t know why, and the outcomes are showing it.

So the CEO is right that there’s an opportunity here. The value proposition shifts from implementation to judgment โ€” not deploying AI, but knowing when not to deploy the expensive one. Matching capability to problem. Being trusted enough to tell a client that their $50M frontier model contract is solving a $500K problem.

Here’s the irony that the comment skates past: that advice is structurally difficult for a large consultancy to give.

The business model that built consulting firms was billing for doing. The more you deploy, the more you bill. Helping a client spend less, or choose the cheaper model, or run a narrower project, is genuinely good advice that the incentive structure actively works against. You don’t grow a $70 billion professional services firm by talking clients out of scope.

The judgment layer, if it becomes the real value, requires something closer to a doctor’s relationship with a patient than a contractor’s relationship with a client. Doctors get paid whether they prescribe or not. The value of the visit is the diagnosis โ€” including the diagnosis that says you don’t need the expensive intervention. Consultants, historically, get paid to prescribe, and paid more when the prescription is larger.

There’s a reason we trust doctors with that asymmetry and not contractors. Licensing, malpractice, professional norms built over centuries โ€” all of it exists to align the incentive. Consulting has none of that infrastructure. What it has instead is reputation, which is slower-acting and easier to game.

Whether the large firms can actually make the shift โ€” rather than just reframe the same billable-hours model in the language of AI optimization โ€” is the real question the market is wrestling with. The CEO’s comment is genuinely perceptive about where client value lies. It’s less clear that consulting firms are currently built to capture it honestly.

Categories
Business History IBM Infrastructure Nvidia Programming Semiconductors

The Half-Life of Moats

Prompted by an article on X by @magicsilicon on the CUDA moat. Research and drafting assistance from my AI intern assistant Clark.

The NVIDIA H100 looks, in retrospect, like an inevitability. It wasnโ€™t.

What Jensen Huang built is more accurately understood as a sixteen-year accumulation of optionality โ€” a platform investment made in 2006 for a market that wouldnโ€™t fully materialize until 2022. NVIDIA intros the G80 architecture in November 2006, laying the groundwork for CUDAโ€™s release a few months later. The stated ambition was to let scientists write C++ that ran on GPU cores without needing to understand 3D graphics pipelines. The unstated bet was that parallel computation would eventually matter for something bigger than rendering shadows in video games.

For sixteen years, it mostly didnโ€™t. Not at scale. Not commercially. CUDA lived in research labs and HPC clusters. It attracted a small, devoted, and economically marginal user base โ€” the kind that papers cite but investors ignore. NVIDIA kept investing in it anyway: cuDNN for deep learning operations, cuBLAS for linear algebra, a layered ecosystem of libraries that made CUDA not just accessible but nearly irreplaceable for anyone doing serious numerical computation. When TensorFlow and PyTorch emerged as the standard frameworks for neural network research, they didnโ€™t adopt CUDA because it was the only option. They adopted it because CUDA was where the optimized kernels already lived.

AlexNet won the ImageNet competition in 2012 and did it on two NVIDIA GPUs. The deep learning community noticed immediately. The financial community largely did not.

Then ChatGPT launched in November 2022, and suddenly everyone needed H100s they couldnโ€™t get.


The parallel to Intel is instructive and also undersells how strange this kind of story looks while youโ€™re living through it. Intel was founded in 1968 as a memory company. DRAM. The founders โ€” Noyce, Moore, Grove โ€” were materials scientists and engineers who believed the future was in silicon memory chips. They were right, briefly: in the early 1970s Intel dominated the DRAM market. By 1984, that share had collapsed to 1.3%, ceded almost entirely to Japanese manufacturers who had commoditized the product.

What saved Intel wasnโ€™t a pivot so much as a realization that a stopgap had become a foundation. The 8086, conceived in 1976 as an internal hedge and launched in 1978 was never supposed to matter. It was a 16-bit processor designed to hold off Zilog while Intel finished its ambitious 32-bit iAPX 432 architecture. The 8086 was assigned to a single engineer. โ€œIf management had any inkling that this architecture would live on through many generations,โ€ its designer Stephen Morse later recalled, โ€œthey never would have trusted this task to a single person.โ€

IBM chose the 8088 โ€” a cost-reduced variant โ€” for the original IBM PC in 1981. That decision wasnโ€™t destiny, it was simply a procurement. And yet from that accident of selection, Intelโ€™s x86 line became the backbone of personal computing for four decades. The Pentium in 1993 was Intelโ€™s Wintel moment โ€” the flag bearer the @magicsilicon tweet gestures at โ€” but the flag had been quietly sewn since 1978.


What these histories share is not just a pattern of โ€œslow build, explosive payoff.โ€ The structural similarity is subtler: in both cases, the moat was a software abstraction layer built on top of hardware. Intelโ€™s real lock-in wasnโ€™t transistor count or clock speed. It was backward compatibility โ€” the commitment, formalized with the 80386 in 1985, that every future Intel chip would run software written for older ones. That promise created a flywheel that trapped developers and buyers in a virtuous (for Intel) dependency loop for decades.

CUDA is the same architecture at a different layer. The lock-in isnโ€™t the H100โ€™s 80 gigabytes of HBM3. Itโ€™s that switching to an AMD MI300X or Google TPU means potentially rewriting training pipelines that have been optimized against CUDA kernels for years. AMDโ€™s ROCm platform exists. It is, by most accounts, maturing. Engineers who have tried the migration report that it costs months and hundreds of thousands of dollars. The moat isnโ€™t a wall. Itโ€™s accumulated friction โ€” the switching cost of a decade of engineering decisions baked into codebases that no one wants to touch.


But to find the actual origin of this pattern, you have to go back further than Intel. To 1964, and to a decision IBM made that Fred Brooks โ€” its project manager โ€” called a bet-the-business move.

The IBM System/360 was announced on April 7, 1964, after five years of turbulent internal development. What it introduced wasnโ€™t just a new computer. It was a new concept: the separation of architecture from implementation. Before the 360, IBM ran five incompatible product lines simultaneously. A customer who outgrew their machine had to scrap all existing software and start over. The 360 replaced all five lines with a single unified architecture โ€” six models covering a fiftyfold performance range, all running the same operating system, all sharing the same instruction set. The name itself encoded the ambition: 360 degrees, all directions, all users.

Gene Amdahl, the 360โ€™s chief architect, had a precise formulation for what this meant: the architecture was โ€œan interface for which software is written, independent of any implementation.โ€ The Principles of Operation manual described what the machine did; separate Functional Characteristics documents described how each model did it. This distinction โ€” separating the contract from the execution โ€” was genuinely new. Itโ€™s the conceptual root of everything that came after.

The 360 generated over $100 billion in revenue for IBM and established the first platform business model in computing. Jim Collins would later rank it alongside the Model T and the Boeing 707 as one of the three greatest business achievements of the twentieth century. But its deepest legacy was architectural: the insight that if you make your abstraction layer the standard, the hardware underneath becomes fungible. Customers didnโ€™t buy specific IBM machines. They bought into OS/360. The machines were an implementation detail.

Intel understood this by the 1980s, even if implicitly. The 80386โ€™s backward compatibility commitment in 1985 was IBMโ€™s 360 insight applied to microprocessors โ€” the architecture is the product, the silicon is the vehicle. CUDA is the same insight applied to GPU compute. What NVIDIA sold researchers in 2006 wasnโ€™t the G80 card. It was the abstraction: write parallel code in C++, run it on any NVIDIA hardware, trust that the next generation will be faster and compatible.

The pattern is now sixty years old. It has reproduced in every major platform transition. And it keeps working for the same reason it worked in 1964: when you own the layer that developers write to, your customersโ€™ switching costs compound every year they stay.


Thereโ€™s something worth sitting with here. Neither Jensen Huang in 2006 nor Gordon Moore in 1968 could have specified exactly what the payoff would look like. What they shared was a willingness to build infrastructure for a demand they could sense but not yet see โ€” and the discipline to keep investing in it through the long years when it looked like a research project rather than a business.

The question that doesnโ€™t resolve cleanly is whether that kind of patience is a strategy or a personality. And whether, in an industry that now moves faster than the cycles itโ€™s lived through, sixteen-year moats are still the kind that get built.


Which raises the uncomfortable corollary: the same AI tools that CUDA enabled may be what ultimately erodes it.

The attack on CUDAโ€™s moat is now structurally different from anything AMD or Intel could mount before. OpenAIโ€™s Triton compiler lets developers write GPU kernels in Python without touching CUDA at all, and generates optimized machine code that often matches hand-tuned CUDA performance. MLIR โ€” Multi-Level Intermediate Representation, originally from Google โ€” provides a compiler infrastructure that can target any hardware backend from a single codebase. AMDโ€™s ROCm has historically been dismissed as immature; ROCm 7, released this year, delivers meaningfully better inference performance than its predecessors. And perhaps most directly: Claude Code reportedly ported a CUDA codebase to AMDโ€™s ROCm in thirty minutes โ€” work that previously took months of engineering time.

The irony is almost too neat. CUDAโ€™s moat was built on accumulated switching costs: the friction of rewriting code, the library dependencies, the tribal knowledge encoded in a decade of kernel optimizations. AI coding tools are specifically good at exactly that kind of mechanical, high-context translation. The weapon is attacking the wall it was built behind.

That said, itโ€™s worth being careful about the speed of this. Abstraction layers that โ€œshouldโ€ erode moats often take far longer than expected, because the moat isnโ€™t just the code โ€” itโ€™s the ecosystem of tooling, documentation, community knowledge, and hardware-software co-optimization that took eighteen years to compound. Triton and MLIR are real. Theyโ€™re also early. The question isnโ€™t whether the moat is vulnerable; itโ€™s whether it erodes before NVIDIAโ€™s next generation of chips makes it irrelevant to argue about.


As for what comes next โ€” which company is building the IBM 360 of this decade โ€” the honest answer is that itโ€™s too early to call with confidence. But thereโ€™s a candidate worth watching.

Anthropicโ€™s Model Context Protocol, launched in late 2024, has the structural fingerprint of a platform play. MCP is a standard for how AI agents connect to external tools and data sources โ€” a common interface layer, hardware-agnostic (or rather, model-agnostic), that any system can implement. By late 2025 it had been donated to the Linux Foundation, adopted by OpenAI and Google, and was tracking 97 million monthly SDK downloads. There are now over 10,000 MCP servers. It is becoming the way agents talk to the world.

The parallel to OS/360 is imprecise but instructive. What IBM built in 1964 was a standard interface between software and hardware that decoupled what you wrote from what you ran it on. MCP is attempting something similar one abstraction layer higher: decoupling what an agent does from the specific models, tools, and data sources it does it with. If it becomes the standard โ€” the layer that developers write to โ€” then whoever owns or most deeply shapes that standard controls the integration tax of an industry whose applications we canโ€™t fully specify yet.

The counterargument is that open standards, once donated to foundations and broadly adopted, donโ€™t generate the same lock-in as proprietary platforms. OS/360 was IBMโ€™s. CUDA is NVIDIAโ€™s. MCP is now the Linux Foundationโ€™s, with OpenAI and Google as co-stewards. The historical pattern suggests the moat accrues to whoever owns the layer, not whoever invented it.

Which may mean the next great platform play is still being assembled in a room we havenโ€™t seen yet โ€” the way IBMโ€™s System/360 was being architected in a Connecticut motor lodge in 1961, three years before anyone else knew what was coming.

Categories
AI History

The Arrival

Yoshua Bengio spent forty years building the foundation of modern artificial intelligence. He won the Turing Award for it. And he didnโ€™t think heโ€™d live to see it work.

Thatโ€™s the quiet fact buried inside Stephen Wittโ€™s New Yorker profile of him. Bengio โ€” one of the three researchers whose decades-long bet on neural networks eventually became the architecture underlying every large language model running today โ€” had made peace with the idea that the thing he was building was a multi-generational project. Something for his successors to finish. Then Witt writes: โ€œone day in late 2022, the technology had simply arrived. He compared it to meeting an extraterrestrial.โ€

Hemingway once described bankruptcy happening two ways: gradually, then suddenly. He meant ruin. Bengio experienced something harder to name โ€” not ruin but arrival, which carries its own vertigo. The gradually was four decades of work that most of his peers considered quixotic. The suddenly was a Tuesday in November when a chat interface went live and the world quietly changed.

What unsettles me about the extraterrestrial comparison isnโ€™t the strangeness it implies. Itโ€™s the distance. You meet an alien; you donโ€™t meet something you made. The metaphor suggests that even its creator couldnโ€™t fully recognize it โ€” that the thing, once arrived, belonged to a category that exceeded its own origins.

We donโ€™t have good language for this. Breakthrough, inflection point, paradigm shift โ€” these are words people reach for after the fact, when theyโ€™re building timelines. What Bengio seems to be describing is the experience of standing in front of a threshold you spent your life approaching, and finding it already behind you.

The technology didnโ€™t ask permission. It didnโ€™t announce itself.

It arrived.

Categories
AI Technology

The Bathwater Problem

Gary Kamiya was writing about the Tenderloin when he said it, but the line has been following me around: โ€œThe problem is that by saving the baby, you also save the bathwater.โ€

The pattern is remarkably consistent across every major information technology. Each one arrives promising to liberate the deserving โ€” the faithful, the learned, the civic-minded โ€” and each one immediately, inevitably, arms everyone else too. Gutenbergโ€™s press was understood by its champions as a device for spreading the true Word; within decades it was the primary infrastructure for Protestant schism, Catholic counter-propaganda, astrological almanacs, and pornography. The reformers got their Bible. They also got their pamphlet wars.

The telegraph was greeted as a force for peace โ€” shared information would make war irrational, commerce would bind nations. It also became the nervous system of commodity speculation, financial manipulation, and the first truly industrial-scale news hoaxes. The telephone: connection and the crank call, the crisis line and the threatening voice in the dark. Radio: FDRโ€™s fireside chats and Father Coughlin. Television: Murrow taking down McCarthy, and also fifty years of manufactured consent. The internet: the largest library ever assembled and the largest sewer.

The pattern isnโ€™t coincidental. Itโ€™s structural. Each technology expands whatโ€™s possible for human expression and coordination โ€” and human expression and coordination contain both the noblest and the worst of us in roughly fixed proportion. The tool doesnโ€™t change the ratio. It scales both sides of it.

Whatโ€™s interesting historically is how each generation believes their technology will be different โ€” that this time the architecture can be designed to select for the good. The internet era produced the most elaborate version of this belief: algorithmic curation would surface truth, network effects would reward quality, the wisdom of crowds would outcompete misinformation. Instead it turned out that engagement was the attractor, and outrage was the highest-engagement content. The bath got hotter.

The AI moment is the same belief system, restated with more technical sophistication. But the Kamiya line stands. You are saving a baby, and you are saving bathwater, and no one has yet designed a tub that can tell the difference.

The question isnโ€™t whether the bathwater comes with the baby. It always does. The question is whether you turn on the tap.

Categories
AI AI: Transformers Books

The Updating Machine

Tom Chivers puts Bayesโ€™ theorem in plain English and it sounds almost obvious: โ€œthe probability of event A, given event B, equals the probability of B given A, times the probability of A on its own, divided by the probability of B on its own.โ€ A formula for revising what you believe when new evidence arrives. You started somewhere. Something changed. Now you believe something slightly different. Repeat.

The obvious part is the mechanics. The hard part is the loop.

Most reasoning errors I catch in myself arenโ€™t failures of logic โ€” theyโ€™re failures to update. I hold a view, evidence accumulates against it, and I find reasons the evidence is flawed rather than reasons the view might be.

Psychologists have a name for this: confirmation bias. But Iโ€™ve always found that label a bit too clean, like it describes a bug rather than a feature.

The prior isnโ€™t wrong to be sticky. It represents everything youโ€™ve learned up to this point. The problem is when it becomes load-bearing โ€” when the prior stops being a starting position and starts being a conclusion.

โ€œStrong opinions, loosely heldโ€ is supposed to solve this. Itโ€™s a useful phrase โ€” it captures something true about the right posture toward your own beliefs. But in practice the second half is harder to honor than it sounds. The strong opinion gets stated, new evidence arrives, and changing your mind in public feels like losing. The โ€œloosely heldโ€ part quietly becomes decorative.

What Bayes actually demands is something closer to epistemic humility with arithmetic attached. You donโ€™t get to say I donโ€™t know. You have to say I estimate 0.4, and here is what would move me to 0.6. Thatโ€™s harder. It requires you to specify not just what you believe but how youโ€™d know if you were wrong.

This is why Bayesian thinking keeps surfacing in AI conversations. Modern language models do something structurally adjacent to this โ€” not consciously, but mechanically. Every token generated is a probability distribution revised forward by context. The model doesnโ€™t know the next word; it updates a prior over all possible words, given everything that came before. Itโ€™s not reasoning the way humans reason, but itโ€™s updating the way Bayes updates: continuously, contextually, without the luxury of certainty.

Whether thatโ€™s comforting or unsettling probably depends on your own prior.

The deeper thing Chivers is pointing at, I think, is that Bayesian reasoning is essentially a description of intellectual honesty as a process rather than a trait. You canโ€™t just decide to be open-minded. You have to build the loop: form a belief, assign it a probability, watch for evidence that should move it, and then actually move it. Most of us do the first three. The fourth step is where it gets expensive.

Iโ€™ve been wrong about enough things by now that Iโ€™ve started to treat my own confident views with mild suspicion. Not paralysis โ€” you have to act on something โ€” but a background awareness that the prior Iโ€™m acting on was formed by a person who had less information than I do now, and less than Iโ€™ll have next year.

Strong opinions, loosely held, sounds right. The trick is meaning it.

Categories
Living Retirement Sports Writing

The Architecture of a Wound

I have read a lot of Wright Thompson pieces over the years. Enough to notice that I finish them feeling slightly implicated โ€” like I have been shown something true about a person I thought I understood, and the showing has cost me something comfortable. Thatโ€™s not a common feeling after reading sportswriting. Thompsonโ€™s a different kind of sports writer.

His profile of Steve Kerr, published this week in ESPN, is a case study in how he does it.

Go read it now. Before you continue here. Go read it!


The opening line: โ€œSteve Kerr walked into the lobby of the Beverly Wilshire with a secret.โ€

Thompson doesnโ€™t start with biography or context or the Warriorsโ€™ season. He starts with dramatic irony โ€” we are placed inside a moment where the subject knows something we donโ€™t, where the gap between public persona and private reality is already established before weโ€™ve gotten through the first sentence. We know, even before we know what the secret is, that the real story will be about what lives underneath the official one.

The secret is retirement. Kerr had made up his mind โ€” 95% certainty โ€” that this would be his last season. He checked in at the Beverly Wilshire, gave his room number, 516, as โ€œJohnny Bench Joe Montana,โ€ and then leaned across the breakfast table and almost mouthed the words: โ€œI think itโ€™s over.โ€

That room number detail is worth pausing on. It isnโ€™t decoration. A man who has spent 40 years inside professional basketball has encoded legends into the passwords of his daily life โ€” Benchโ€™s number, Montanaโ€™s number, woven together as a mnemonic for a hotel room heโ€™ll forget in three days. Thompson drops it and moves on without comment. He never explains it. He doesnโ€™t have to. The detail does its work the way a good poem does: you carry it with you, and it means more the longer you hold it.

This is the loaded object, and Thompson deploys it everywhere. Late in the piece, the coaches bond over the same memory trick on the road โ€” Thompson tells Kerr heโ€™s in room 2225, โ€œEmmitt Smith Rocket Ismail.โ€ The repetition isnโ€™t accidental. Itโ€™s Thompson showing us what the 82-game season actually is: a long chain of hotel rooms and encoded legends, a man building temporary homes out of other peopleโ€™s greatness.

Thompson earns the loaded objects by earning the access. He spent the entire season alongside Kerr โ€” embedded, unhurried, a still presence at the edge of the frame. The piece has the texture of someone who was there for the small moments precisely because he wasnโ€™t hunting the big ones. Heโ€™s in the coachesโ€™ locker room after losses, cracking Peronis on the team bus through the quiet midnight of Atlanta, at Pizzeria Bianco in Phoenix, at the Golden Gate Deli while Kerr eats his usual Honey Baked sandwich and texts โ€œConflictedโ€ and then adds: โ€œWhatโ€™s Lulu think?โ€ โ€œShe sees both sides.โ€

That kind of detail โ€” the dog with two minds about retirement โ€” is available only to a reporter who has spent enough time with a subject that the subject has stopped performing. Thompson collects these moments. They are the material.

But access is scaffolding. The building is what Thompson does with it.

Midway through the piece, he lands the paragraph that reorients everything that came before it. On January 18, 1984, a gunman shot Malcolm Kerr โ€” Steveโ€™s father, president of the American University of Beirut โ€” in the head. A family friend reached Steve in his dorm room at 3 a.m. Steve asked if his father was okay. There was a long pause. โ€œYour father was a great man,โ€ the friend finally said.

Steve ran downstairs and pounded on his teammatesโ€™ doors. Then he went and sat on the curb. โ€œOn Speedway Boulevard,โ€ he says, still remembering the cold concrete and the empty street. Thompson gives us that street name. He tells us Steve started walking, and that he hasnโ€™t slowed down since.

Suddenly the retirement question isnโ€™t about basketball at all.

The sport is Thompsonโ€™s vehicle. His real subject is always the wound underneath โ€” what drives a person to keep competing, keep building teams, keep moving, and what it means when the movement finally starts to slow. The question running beneath the entire Kerr piece isnโ€™t โ€œWill he stay or go?โ€ Itโ€™s the question Malcolmโ€™s assassination put into his son at 18, still unanswered 42 years later: what do you do when the thing that protected you from the worst moment of your life stops being available?

Thompson makes this explicit in one passage near the end. Kerr describes his fear about retirement: the coaching job, he says, doesnโ€™t just feed his soul โ€” it helps him manage his chronic daily pain. The pain, weโ€™ve learned by then, is rooted in unresolved trauma. Kerr has spent a year journaling every morning about his fatherโ€™s death and then deleting what he wrote, following a therapy program based on the idea that emotional wounds express themselves as physical ones. The back pain that plagued him for 12 years โ€” the migraines, the spinal leak, the decade of searching for relief from Mayo to Duke to England for stem-cell therapy โ€” began the same year he took the Warriors job. Writing opens the lines of communication. Winning keeps them open. Retirement threatens to seal them again.

Thompson doesnโ€™t editorialize. He just lays this all out, piece by piece, until the architecture becomes visible.

Thereโ€™s a moment in Minneapolis, late in the piece, that sticks with me.

The NBA has postponed a game after a nurse named Alex Pretti has been killed by federal agents. Kerr gathers the team in a hotel ballroom. Later, working through what to say publicly, he does something Thompson says heโ€™s never heard him do before โ€” he invokes his father.

โ€œMy father was killed by extremists,โ€ Kerr says, rehearsing lines. โ€œI know how that story ends.โ€

Thompson can hear his voice catch between sentences. โ€œIโ€™m crying right now,โ€ Kerr says.

At the press conference, Kerr doesnโ€™t mention his father. He talks about the families who will never get their loved ones back. He didnโ€™t have to say more. And Thompson, who was there for all of it โ€” the hotel ballroom, the phone call, the media room โ€” doesnโ€™t have to explain the connection either. Heโ€™s done the work of showing us who Malcolm Kerr was, what his assassination meant, how Steve has carried it. The press conference lands with the full weight of everything weโ€™ve already been given.

This is Thompson at his best: information as infrastructure, detail as detonator.

The piece ends not with a contract signing or a press conference. It ends with a granddaughter.

On the same day Kerr commits to returning โ€” the decision made, the call to Steph placed, the band getting back together โ€” his sonโ€™s younger daughter, Charlotte, takes her first steps. Kerr watches her toddle around the room. He thinks about his son as a toddler. He looks at the photograph on the wall of Nick and his older daughter grinning in front of the Golden Gate Bridge. โ€œIt all felt intertwined,โ€ Thompson writes. โ€œBlood family, basketball family, family. A cocoon, like his old Arizona manager said.โ€

That word โ€” cocoon โ€” has appeared once before. Itโ€™s what the Arizona manager called the arena the night Steve played two days after his father was killed. The crowd forming a protective warmth around a boy who had just lost everything. The team as shelter.

42 years later, itโ€™s still what Steve Kerr is building and protecting. Thompson knew to plant the word the first time so it could bloom at the end.

Thatโ€™s the technique. Thatโ€™s what I mean when I say reading him costs something. In a wonderful way. He shows you that the things people build โ€” teams, careers, dynasties โ€” are sometimes just very elaborate answers to questions they were asked at 18 on a cold curb on Speedway Boulevard. And then he leaves you with a baby taking her first steps, and you feel the whole weight of it all at once.

Categories
Family Fathers Living Sports

The Flashlights He Left Behind

Thereโ€™s a Wright Thompson piece from 2007 that I keep returning to. It was filed during the Masters, and itโ€™s technically about golf the way the ocean is technically about water.

The setup is simple: Thompson is at Augusta National for work, credentialed sportswriter in the press tent, watching the ceremonial first shots and the azaleas and all of it. His father had dreamed of attending just once. His father is dead. The piece is what happens when Thompson walks the course trying to find him.

I donโ€™t know how to write about it without sounding like Iโ€™m describing a dream to someone who wasnโ€™t there. So let me start with the craft.


Thompson opens with chipped beef on toast. Heโ€™s on the clubhouse veranda, waiting for Arnold Palmer, and a stranger asks what he ordered. โ€œIt was my dadโ€™s favorite meal,โ€ Thompson explains. A silence falls. โ€œDid you ever bring him here?โ€ the stranger asks. โ€œNo,โ€ Thompson says, turning away.

Thatโ€™s the whole wound, opened in three lines of dialogue. No commentary. Just the weight of the unanswered invitation โ€” the trip that never happened โ€” sitting there in a plate of chipped beef. The best sportswriters understand that the specific detail does what abstraction never can. Thompson doesnโ€™t tell you he carries grief. He shows you where it lives.

Then comes the structural move that makes the piece something more than a personal essay. Thompson builds a rhythm โ€” three times, he lands the phrase that is Augusta โ€” each time widening the frame. Nicklaus on 18, glancing at his son, repeating his own fatherโ€™s last words. Tiger winning in 1997, finding Earl in the gallery, a sonโ€™s head on a fatherโ€™s shoulder. And then, quietly, devastating: This, too, is Augusta: me, needing a daddy more than ever.

By the time the narratorโ€™s grief enters the frame, the reader has already been prepared to receive it. The repetition is a kind of structural kindness. Thompson is telling you: pay attention, something is being built here. When it arrives, it doesnโ€™t feel sudden. It feels inevitable.


The piece has a spine you donโ€™t notice until youโ€™ve read it twice. Thompson asks the same question at two different moments: Daddy, are you out there?

The first time, heโ€™s standing in the rain, alone, by a sapling planted exactly one year after his fatherโ€™s death. Heโ€™d been standing guard over the tree in a downpour, soaked, because heโ€™d been unable to protect his father in life. No answer comes. Just the shattering windows of water falling from the sky.

The second time, heโ€™s in the bleachers at Amen Corner. He whispers it. And from somewhere across the course, a roar rises from the gallery, moving through the pines, fading back to silence.

Thompson is careful here. He writes: Understand that I donโ€™t believe in stuff like this and am certain it is a coincidence. That hedge is the whole story. The man who doesnโ€™t believe in signs is exactly the man who most needs to find one. The moment works precisely because he doesnโ€™t oversell it. He puts it down and lets it be what it is โ€” or what the reader needs it to be.


The passage I keep coming back to is near the end, not at the emotional peaks. Thompson has just watched Jim Gray, the television reporter, carefully lift the rope so his white-haired father can slip beneath it. A small thing. A son holding a rope. And Thompson realizes heโ€™s watching himself in reverse โ€” that the transition heโ€™s been grieving his way through is also a transition toward something.

The piece ends not with closure but with continuation. He buys a tiny green Masters onesie. A small knit golf shirt for a toddler. And the last line the sales clerk offers โ€” meant as a coo over the cute little clothes โ€” lands as the verdict Thompson has been seeking all week: Oh, good daddy.

Itโ€™s the right ending because it doesnโ€™t answer the grief. The hole in your chest after losing your daddy never gets filled, Thompson writes, and he means it. What the ending does instead is redirect the inheritance. Heโ€™s received everything he needed. He just needs to pass it on.


Thatโ€™s what the best longform sportswriting can do when itโ€™s working at full power. The Masters is the container. Inside it: a meditation on what fathers give us that we donโ€™t fully inventory until theyโ€™re gone, and what we owe the children we havenโ€™t had yet.

Thompson filed this piece for a newspaper. He was 30 years old. That this exists at all feels like its own small miracle โ€” a man sitting down in grief and producing something that will outlast the tournament, and probably him.

Go read it. The link is here. Then come back and sit with it for a while.