Curation Is Infrastructure, Judgment Is the Job
Everyone now agrees that agents need a human who cares. But caring is two jobs: deciding what context matters, and judging whether the output is right. Only one of them should stay manual.
writing
Notes from the build, measurement essays, and product thesis writing for founders and developers building with AI.
RSS 9 posts est. 78 min to read
latest
9 posts on this page
Everyone now agrees that agents need a human who cares. But caring is two jobs: deciding what context matters, and judging whether the output is right. Only one of them should stay manual.
AI session transcripts show where work actually breaks: repeated corrections, missing context, stale instructions, failed checks, and unclear permissions. Teams should mine them before the same mistake repeats.
read articleThe next operating advantage is not just writing better docs for people. It is making decisions, commitments, incidents, PRs, and transcripts readable by the agents that now help run the work.
read articleAgents are getting faster, but work still depends on state. AI-native teams need a context pipeline that captures, verifies, retrieves, and prunes the facts agents act on.
read articleMemory helps an AI recall facts. Serious work needs more than recall: typed context, current state, provenance, permissions, and follow-through across every tool and agent.
read articleA chief of staff used to be reserved for executives. With the right AI tools, everyone can have one: a context layer that keeps receipts, briefs the next room, and makes sure nothing falls between two conversations.
read articleAI memory is becoming table stakes. The harder product is keeping decisions, commitments, blockers, and source context alive after the chat window closes.
read articleThe claim is simple: the same model should do better follow-through work when it starts with the right work state. Here is the evaluation design we need before saying that with confidence.
read articleShorter prompts are only useful if they preserve the work state. Here is how we should measure compression, recall, latency, and false carryover before making numeric claims.
read articleNo posts match that topic on this page.