The five tests that decide whether your LLM agent is a "model" under SS1/23

PRA Supervisory Statement SS1/23 (effective 17 May 2024) defines a model in Principle 1 as "a quantitative method, system, or approach that applies statistical, economic, financial, or mathematical theories, techniques and assumptions to process input data into output." An LLM agent making credit decisions, fraud assessments, or KYC determinations meets every limb of that definition.

  1. Does it process input data into output? Yes. Prompts in, decisions out.
  2. Does it use statistical / economic / financial / mathematical theory? Yes. Transformer architecture is statistical.
  3. Are the techniques used internally? Yes. Even if the LLM is third-party, the agent wrapping it is internal.
  4. Does it affect a financial decision the bank takes? Yes for credit/fraud/KYC agents.
  5. Is the output relied upon by the bank? Yes in production deployments.

What SS1/23 expects, and where Regulus delivers it

The PRA's principles map to runtime artefacts:

And the cross-cutting regimes

The question a PRA walkthrough actually asks

A real second-line walkthrough on an LLM-agent credit decision will look like:

  1. "Show me your model inventory." Regulus model-risk plugin maintains it; export as YAML or CSV.
  2. "Show me the validation evidence for model X." The audit chain has the validation events tagged with the validator's Principal and the validation report URL.
  3. "How are you monitoring model X's ongoing performance?" The eval harness emits outcomes events; filter the chain by model_id = X AND tag = OUTCOMES.
  4. "What happens if model X degrades?" The kill-switch plugin can collapse the agent's tool surface, requiring dual control to re-enable. Audit event captures both authorising Principals.

Where to start