1 story in the last 7 days
The latest life sciences news, distilled by AI into sharp ~100-word summaries. ByteBrief tracks life sciences across dozens of tech sources and brings you only what matters, updated hourly. Tap any story for the full brief, or open the original source.
OpenAI released a 750-task benchmark to evaluate AI performance in real life science research. Its top model GPT-Rosalind passed only 36.1% of tasks, failing 63.9%. Results show AI struggles with complex experimental design when inputs are in natural language, not LaTeX.
Summaries by ByteBrief