← Back to Vault

End-to-end Experiment Pipeline

Cameron Rohn · Category: frameworks_and_exercises

Cameron built a basic pipeline using an evaluator that overrides rows in an existing dataset for multiple experiments, tracking correctness rates, run counts, and latency.