ByteBrief
Skimming the internet so you don't have to
The paradox of LLM self-distillation: Faster reasoning, weaker generalization | ByteBrief