← Back to Vault

Comparative Experimentation

Tom Spencer · Category: frameworks_and_exercises

Use an LLM as a judge to compare outputs from multiple experiments or models, facilitating side-by-side evaluation and selection of the best performing prompt.