ByteBrief
We're a portrait publication through and through. Turn your phone back and your briefing picks up right where you left it.
(We tried widescreen once. It wasn't us.)

Subquadratic launched a sparse-attention model handling 12-million token contexts faster than current large language models earlier this year without publishing benchmarks or wide access then. In June the company released its first model card for SubQ 1.1 with third-party verification from Appen and opened design partner access to verify performance claims.
Tap to vote and see what everyone thinks.
Summary by ByteBrief
Nolen Jonker switches from 9B LLMs to smaller models