1 story in the last 7 days
The latest lifescibench news, distilled by AI into sharp ~100-word summaries. ByteBrief tracks lifescibench across dozens of tech sources and brings you only what matters, updated hourly. Tap any story for the full brief, or open the original source.
OpenAI released LifeSciBench with 750 expert-written tasks across seven biological domains and seven scientific workflows. Each task includes a prompt, artifacts, and a rubric with 25 criteria on average. Tasks require reasoning steps averaging four each and 53% need at least one artifact. Models pass only about one in three tasks. The benchmark includes 1,062 artifacts and 19,020 rubric criteria.
Summaries by ByteBrief