← Back to Vault

Amy Benchmark Evaluation

Tom Spencer · Category: frameworks_and_exercises

Use the Amy benchmark as a structured framework to measure and compare AI model performance, given its ability to distinguish advancements as seen with GRO4's perfect score.