← Back to Vault

Activation Scaling Model

Cameron Rohn · Category: frameworks_and_exercises

Use back-of-the-envelope estimates of parameter activations (e.g., 3.2B in a 20B parameter model) to optimize model selection for efficiency and resource allocation.