TLDR AI — June 12, 2026

OpenAI Acquired Ona For Long-Running Agents

OpenAI announced it would acquire Ona to bring secure cloud execution and orchestration capabilities into the Codex platform. The technology is intended to support persistent, customer-controlled environments where agents can continue working across extended periods and sessions. ANTHROPIC BACKTRACKS ON POLICY THAT 'SABOTAGED' RESEARCHERS' WORK (2 MINUTE READ) Anthropic has decided to make its safeguards for frontier LLM development visible after backlash from researchers. The company had previously discreetly rerouted requests to a lesser model when asked to perform certain actions. Researchers found that Claude Fable 5 was either refusing or degrading responses for tasks like training competing models, debugging AI code, and optimizing neural architecture. This raised concerns about Anthropic's lack of transparency and also that tokens and money had been spent on a model that didn't do what was expected.

Deep Dives & Analysis

Finding Optimal Tokenizers

Frontier AI models are typically trained on sequences of integers known as tokens. Each token refers to some sequence of bytes, and these byte sequences often correspond to common words. This post presents an algorithm that can compute an optimal tokenizer in some settings.

Can Compute Commoditize If It's Not Fungible?

CoreWeave's co-founder, Brannin McBee, recently claimed that compute isn't fungible the way a commodity has to be. He has a real argument, but the non-commodity framing is the keystone of his company's value. While he appears to be saying that there is no market, he's actually pricing the market and revealing where the spread still hides.

Making A Vintage LLM From Scratch

This post shares how a developer created their own LLM from scratch. It covers how they create their own base-training and fine-tuning scripts, data processing pipelines, and custom dataset. The total cost of the project was around $80, but they had a decent PC to process the data. The model and code are available in the post.

Engineering & Research

Claude Code At Ultra-Long, 200+ Step Tasks

MiMo Code V0.1.0 is an open source terminal-native AI coding assistant from Xiaomi. It outperforms Claude Code on key agentic coding benchmarks, particularly on long-horizon, multi-step tasks. MiMo Code features a cross-session memory system that uses an independent subagent to take notes of decisions, issues, and the scope of the project as it progresses. The model is available on GitHub under an MIT license.

Optimizing Pytorch With Fused Mlps

This deep dive explores how PyTorch executes and optimizes neural network layers, progressing from individual linear operations to a fused multilayer perceptron. PREDICTIVE DATA DEBUGGING: REVEAL AND SHAPE WHAT YOUR MODEL LEARNS,

Before You Train

Predictive data debugging identifies potential model behaviors before training by analyzing preference datasets. This technique, integrated into the Silico platform, allows engineers to reshape datasets or training processes to prevent undesired effects, improving both performance and safety. Case studies reveal common issues like compromised safety guardrails, hallucinated links, and context-specific sycophancy, allowing targeted interventions to fix these problems before deployment.

Quick Links

First Steps Toward Automated AI Research

Recursive's automated AI research system achieves state-of-the-art results in fixed-budget language model training, small-model speed, and GPU kernel optimization. WHAT'S THE BETTER BUSINESS MODEL FOR AN AI LAB, SUBSCRIPTION OR API? (2 MINUTE READ) The gross margins on subscriptions are way worse than API overall, so labs will likely start withholding new features or models from subscription plans. SKILLSPECTOR (GITHUB REPO) SkillSpector, developed by NVIDIA, scans AI agent skills for security vulnerabilities before installation.

Headlines & Launches

Deep Dives & Analysis

Engineering & Research

Quick Links