Introducing JobBench: A New Framework for Evaluating AI Agents
JobBench aims to shift the focus of AI evaluation from economic metrics to human-centric workflows, aligning AI work with human intentions.
Editorial Staff 21 days ago
1 article tagged with "Human-Centric"
JobBench aims to shift the focus of AI evaluation from economic metrics to human-centric workflows, aligning AI work with human intentions.