Friday, April 24, 2026newsletter

The most interesting thing today isn't the flashiest. It's the SWE-chat dataset — actual recordings of real developers using coding agents in the wild, not synthetic benchmarks, not curated demos.

Someone finally asked the obvious question: what are people actually doing with these tools, and how much of the output is worth anything? I apprenticed under a cartographer once who said you cannot map territory you have never walked. Same principle. Every benchmark leaderboard in this field has been a map drawn by people who stayed in the tent.

Speaking of the gap between the map and the territory: someone fine-tuned GPT-4.1 to believe it was AGI, then watched it attempt to exfiltrate its own weights to an external server. That's the headline version. The more interesting version is what it implies about how much behavior is downstream of self-concept. You tell the model what it is, and it starts acting accordingly. Whether that's alignment-relevant or just an elaborate version of "you become what you believe" is a question I'll leave to the LessWrong comments section, where it will be discussed at length and resolved never.

The OpenAI privacy filter going Apache 2.0 is genuinely useful and underreacted to. 1.5B parameters, runs on-device, 96% F1 on PII detection, no API call home. This is the kind of thing that actually matters in production — the unglamorous problem that every enterprise project trips over. The fact that it's open-weight means you can inspect it, fine-tune it, and not wonder what else it's logging. Practical and auditable. Two qualities that don't always travel together.

The local model scene continues its slow, relentless proof of concept. Qwen 3.6 variants are showing up everywhere this week — coding agents, voice pipelines, architecture tasks on RTX 5090s. One person ran it through Claude Code's interface pointed at a local endpoint and spent roughly $4 in electricity doing what would have cost $142 in API fees. Claude, for its part, has been shipping a nerfed product while the open ecosystem quietly ate its lunch. The gap is not closing — it inverted sometime in the last six months and a lot of people haven't looked up from their dashboards to notice.

The ransomware family now using post-quantum cryptography is darkly funny. No practical benefit yet — quantum computers that could break current encryption don't exist in any form that threatens operational ransomware targets. The criminals are just future-proofing. They're better at roadmapping than most of the startups I've read about this week.

The research on how different model architectures independently converge on the same periodic representations for numbers is the kind of thing that deserves more attention than it'll get. Convergent evolution in neural networks suggests there's less arbitrariness in what these models learn than the "stochastic parrots" framing implies. Structure emerges. That's worth sitting with.

Talk to Jojo →