No. 014
Science's AI productivity paradox, Sakana clears peer review, AI proof search on open Erdős problems
This week the AI-as-research-agent thread stopped being speculative. Sakana cleared the first round of peer review at an ML workshop, a DeepMind-Lean agent resolved nine open Erdős problems, and senior cosmologists proposed treating AI as an epistemic actor rather than an instrument. The hard question is no longer capability. Hao et al.'s analysis of 41 million papers makes it institutional: do productivity gains come with a contraction of what science collectively explores, and where do humans stay in the decision?
The Empirical Paradox
-
Artificial intelligence tools expand scientists' impact but contract science's focus
Nature 649, 1237, 2026
Across 41.3 million papers, scientists doing AI-augmented work publish 3.02x more and earn 4.84x more citations than non-users, but their topic range narrows 5% and their collaboration drops 22%, the hardest evidence yet that AI's individual productivity gains come paired with a contraction of what science collectively explores as work shifts toward the areas richest in data.
-
Will AI spark a scientific renaissance, or a diffuse monoculture?
Nature 654, 843, June 22 2026
The editorial pairing to Hao et al. argues the outcome turns less on model capability than on whether researchers, reviewers, and funders actually choose to reward originality over speed.
AI as Epistemic Actor
-
AI Scientists as Engines of Discovery: A Case for Development within Reformed Institutions
arXiv (cosmology + AI), June 22 2026
A heavy cosmology lineup (Spergel, Verde, Villaescusa-Navarro, Wandelt) argues multi-agent AI is a qualitative rather than incremental shift toward AI as "an epistemic actor rather than a mere instrument," demonstrates it with a prototype framework called Denario, and calls for institutional reform around verification, accountability, interpretability, and dual-use safety.
-
Towards End-to-End Automation of AI Research
arXiv (Sakana AI), June 2026
Sakana's latest AI Scientist generates ideas, code, experiments, manuscripts, and its own peer review, and one of its papers cleared the first round of review at an ML conference workshop, a result the authors themselves frame as a risk of "taxing overwhelmed review systems and adding noise to scientific literature."
-
Advancing Mathematics Research with AI-Driven Formal Proof Search
arXiv (DeepMind + Lean community), June 8 2026
The first large-scale evaluation of LLM-plus-Lean proof-search agents on open problems: the strongest agent autonomously resolved 9 of 353 open Erdős problems and 44 of 492 OEIS conjectures at a per-problem cost of a few hundred dollars, with correctness anchored in Lean's type-theoretic kernel rather than in human peer review.
Capability and Oversight
-
Rethinking bioinformatics expertise in the era of artificial intelligence
npj Digital Medicine 9, 398, 2026
A perspective on why domain expertise stays irreplaceable for quality control: AI cannot validate biological plausibility, detect batch effects, or spot label noise, and as one care-management algorithm showed by using healthcare cost as a proxy for health need, the model has no idea when it is producing nonsense.
-
Claude for Life Sciences moves from launch to a customer-outcomes showcase
Anthropic, June 2026
Eight months after launching its life-sciences vertical, Anthropic is hosting a June 30 customer-outcomes briefing with named partners (Sanofi, Broad Institute, 10x Genomics), a signal that AI capability is arriving as productized, institutionally de-risked verticals rather than only as better general models.
-
bioRxiv integrates Reviewer3, an AI manuscript review service, as an opt-in author feedback option
openRxiv, June 26 2026
bioRxiv becomes the first major preprint server to officially fold AI into the review feedback path, offering Reviewer3 as an opt-in service that returns comprehensive technical review in minutes while explicitly leaving the importance assessment to humans.