ByteBrief
We're a portrait publication through and through. Turn your phone back and your briefing picks up right where you left it.
(We tried widescreen once. It wasn't us.)

Sayash Kapoor and Arvind Narayanan introduced CORE-Bench, a benchmark measuring how well AI can automate computational reproducibility. The benchmark evaluates AI agents on reproducing a paper's findings when code and data are available. Sakana AI's "AI Scientist" lacked novelty checks and human review, producing flawed papers.
Tap to vote and see what everyone thinks.
Summary by ByteBrief

AI and vibe coding fuel a surge in game releases; ATTN Economy says 181K mobile games launched in six months to May, up 118% on iOS and 73% on Android YoY (Orlando Crowcroft/Financial Times)

be-FULL Closes ¥100M Pre-Series A First Round

The AI boom's historical warning