Search for a command to run...
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance