Also, they show a counter-intuitive scaling limit: their reasoning energy improves with difficulty complexity around a degree, then declines Even with having an sufficient token budget. By evaluating LRMs with their normal LLM counterparts beneath equal inference compute, we discover a few overall performance regimes: (one) low-complexity duties wherever https://www.youtube.com/watch?v=snr3is5MTiU