MacAgentBench icon  MacAgentBench: Benchmark agents where they actually work — on macOS.

Benchmark Leaderboard

Aggregate score with model metadata in one view

Efficiency Views

Eval your agent! 🚀