AutoResearch-RL: Perpetual Self-Evaluating Agents for Autonomous Architecture Discovery
By deploying AutoResearch-RL to separate the frozen environment from the mutable training script, teams can recover up to 2.4x more experiment throughput per GPU-hour via predictive early-stopping of unpromising training runs.