ByteBrief
We're a portrait publication through and through. Turn your phone back and your briefing picks up right where you left it.
(We tried widescreen once. It wasn't us.)
OpenAI released GeneBench-Pro, a benchmark testing AI agents on ambiguous computational biology tasks. It includes 129 synthetic problems across 10 domains with known causal structures. External experts reviewed the benchmark for realism and difficulty, expanding on the original GeneBench.
Tap to vote and see what everyone thinks.
Summary by ByteBrief