view article Article Where should test-time compute go? Surprisal-guided selection in verifiable environments 23 days ago • 1