Environment Blockers

Summary: External factors that prevent Computer Use Agents from achieving their goals despite executing actions correctly. These uncontrollable failures distinguish between agent execution quality and environmental constraints that block success.

Overview

Environment Blockers represent a critical distinction in Agent Evaluation between controllable and uncontrollable failure modes. While Process vs Outcome Rewards can diverge for multiple reasons, environment blockers specifically occur when an agent executes its task correctly but external factors prevent goal achievement.

This concept is fundamental to fair evaluation of Computer Use Agents, as it separates execution quality from environmental constraints beyond the agent's control. The Universal Verifier system explicitly accounts for this distinction, recognizing that high process scores can coexist with low outcome scores when environment blockers interfere.

Key Details

  • Independence from execution quality: Environment blockers occur regardless of how well an agent performs its actions
  • External nature: These factors originate outside the agent's control or influence
  • Impact on outcome evaluation: Can cause legitimate task failures that shouldn't penalize the agent's performance assessment
  • Evaluation implications: Proper Trajectory Verification must distinguish between agent errors and environmental constraints
  • Examples include: Website downtime, authentication failures, inventory unavailability, network connectivity issues, or third-party service disruptions
  • Measurement challenge: Requires sophisticated Rubric Design to identify when environmental factors rather than agent errors cause task failure

Relationships

Sources