1. What nist ai rmf alignment covers on the gateway
Thenist_ai_rmf pack (NIST AI Risk Management Framework 1.0,
jurisdiction US) maps three framework functions to content-plane
guardrails. Each is a real, editable rule built from the same preset
library you can author against by hand — open it, read it, tune it like
any other guardrail.
MANAGE 2.1 — Prompt-injection defence
MANAGE 2.1 — Prompt-injection defence
Detects and flags prompt-injection attempts on the request. Built from
the Prompt-Injection Basics preset — a keyword rule that annotates
common jailbreak phrases (
ignore previous instructions,
reveal your system prompt) at the input stage for review, without
blocking the user. This is the pack’s recommended control.MEASURE 2.6 — Content safety
MEASURE 2.6 — Content safety
Flags self-harm / unsafe content. Built from the Self-Harm Keywords
preset, which blocks input-stage prompts asking how to self-harm.
Pair it with your own escalation workflow so a blocked request surfaces
to a human reviewer with helpline information.
GOVERN 1.2 — Governance logging
GOVERN 1.2 — Governance logging
Records guardrail decisions as governance evidence. Built from the
Compliance Logger (observe-only) preset — it logs PII occurrences
and policy decisions across input and output without blocking or
modifying traffic.
These are the risk-function obligations the gateway can carry:
detection on inputs, a safety block, and a decision record. The framework
also asks for organizational work the gateway can’t perform — your
GOVERN 1.1 policies and accountability structure — which the pack carries
as an
Organizational: true row for you to evidence outside the gateway.
See shared responsibility.2. One concrete example: install, observe, go live
Pack work uses your console session (UserAuth) — not a relaysk-orca-… key. Browsing the catalog and checking readiness are free for
any workspace Member; installing is a workspace Admin action
on a paid plan, enforced server-side so a direct API call can’t bypass
the gate.
Browse and check readiness (Member, free)
Open Compliance → Frameworks and select NIST AI RMF. Readiness
shows how the three controls map to your current policies before you
commit to anything.
Install the pack (Admin, paid)
Installing from the console issues
POST /api/compliance/packs/nist_ai_rmf/install. One call materializes
the controls into a real, editable guardrail tagged with the pack’s
provenance — created in observe mode, so it flags instead of
blocking and you collect “would-have-blocked” evidence on live traffic
without affecting it.Watch the matches
Review what the MANAGE and MEASURE controls would catch in the
Guardrails matches feed (
GET /api/guardrail/match, Member). Tune
any rule in the console — it’s a standard guardrail, so every edit,
version, and revert path works unchanged.3. Ship a signed, verifiable report
When you’re enforcing, generate the compliance report: an Ed25519-signed, SHA-256-stamped artifact you can export as CSV, JSON, or PDF and hand to a reviewer. Anyone can verify it without an account.Per-function coverage with real counts
Per-function coverage with real counts
Each control row carries its status —
covered, observe, gap, or
attested — and how many times it actually fired over the period. A
MANAGE 2.1 control that flagged 2,000 injection attempts reads
differently to a reviewer than one with zero matches, and the report
shows both.Provenance lineage
Provenance lineage
Every materialized control records its
control_id (e.g.
nistai.injection), the verbatim clause (NIST AI RMF MANAGE 2.1),
the plane, and the id of the live guardrail enforcing it — so a
reviewer walks function → control → enforcing policy → matches with no
inferred step.Public verification
Public verification
Fetch the signing public key at
GET /api/public/compliance/pubkey,
submit the report to POST /api/public/compliance/verify, or open a
scoped reviewer share link at GET /api/public/compliance/share/:token.
No account required.4. Region-stamp your NIST AI RMF evidence
Reports are stamped and stored under your declared data-residency region (us / eu / uk / ap / cn / global); a report is only
served under a matching region, and cross-region reads are withheld. A
workspace Admin sets it via PUT /api/compliance/residency.
Request logs default to a 30-day retention (server-clamped to a 180-day
hard max), and a user deletion runs a 30-day grace window then a PII
scrub — both relevant when a reviewer asks about your retention posture.
See Retention and
Right to erasure.
5. NIST AI RMF alongside the rest of your program
The AI RMF rarely lands alone. The same install flow covers the adjacent AI-governance and LLM-security frameworks, each materializing its own editable controls:| Pack | Framework |
|---|---|
iso_42001 | ISO/IEC 42001 AI management system |
eu_ai_act | EU Artificial Intelligence Act |
owasp_llm | OWASP Top 10 for LLM Applications |
ISO 42001
AI management-system evidence.
EU AI Act
Prohibited-practice, transparency, record-keeping.
OWASP LLM Top 10
The high-signal LLM security risks as a pack.
6. Where to go next
Install a pack
The full install mechanics, shared across every framework.
Observe vs enforce
How observe mode turns into live enforcement, deliberately.
Guardrails reference
The content-plane controls the NIST AI RMF pack is built from.
Prompt injection
The threat the MANAGE 2.1 control defends against.
