Search for a command to run...
Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning