Week 2 highlights follows Anthropic’s Fable launch in real workflows, from safety gates and API refusals to autonomous coding, 3D world-building, and a Claude-run Twitter experiment. Geoffrey Irving and Daniel Murfet argue for alignment theory and guarantees before recursive self-improvement, while prinz tests Fable on legal reasoning and monitoring. Rahul Sonwalkar, Shlok Khemani, Tom McGrath, and Andrew Moore add field reports on data agents, hybrid authorship, interpretability, context systems, token economics, and power concentration.
Mercury: Run your finances with virtual cards, spending limits, merchant/category locks, and AI-friendly tools like API keys, MCP, and CLI. Check out Mercury at https://mercury.com LINKS:
Claude Fable 5 announcement
Julius AI platform
Rahul Sonwalkar homepage
Nate Jones homepage
Shlok Khemani homepage
FrontierCode benchmark blog
Lovelace AI company
Andrew Moore Wikipedia profile
Geoffrey Irving homepage
Daniel Murfet LessWrong profile
Sequent Research announcement
Timaeus research organization
Automated Alignment paper
Goodfire AI company
Tom McGrath homepage
Predictive data debugging tool
prinzbench legal benchmark
Unit distance conjecture disproof
Dario Amodei policy essay
Vending-Bench 2 benchmark
Andon Labs site
Recursive Superintelligence startup
Sakana AI company
PostTrainBench benchmark
Thoughtful Lab company
Unit distance conjecture arXiv
Glean Work AI Index
AI Treaty open letter
Karina Nguyen homepage
Sponsor:
Claude: Claude by Anthropic is an AI collaborator that understands your workflow and helps you tackle research, writing, coding, and organization with deep context. Get started with Claude and explore Claude Pro at https://claude.ai/tcr