ByteBrief
We're a portrait publication through and through. Turn your phone back and your briefing picks up right where you left it.
(We tried widescreen once. It wasn't us.)

Local LLMs hit a sudden performance drop at long context when VRAM runs out and the system swaps to system RAM. The author built an open-source tool to find optimal configurations. Qwen 3.5 9B Q4 K M was used to test the issue.
Tap to vote and see what everyone thinks.
Summary by ByteBrief