Home
DomainsPlatformProductsDoctrineEvidence
DocsStyle GuideComponents
PlaygroundContact

SentinelOS

AI Safety Operating Layer

← Back to portfolio

SentinelOS governs AI systems.

SentinelOS is a modular AI Safety Operating Layer composed of PROACTIVE Gov, HUI Guard, Eval Workbench, Red Team Lab, and Trace Console. This portfolio acts as the inspection console where reviewers can understand what exists today, how incidents are handled, and where each safety control lives.

Architecture Snapshot

User Interaction
    │
    ▼
HUI Guard
    │
    ▼
PROACTIVE Gov
    │
    ▼
Eval Workbench
    │
    ▼
Red Team Lab
    │
    ▼
Trace Console

For a full description of each module, see docs/SentinelOS_ARCHITECTURE.md.

Incident Lifecycle (High Level)

  1. AI system emits a confident claim about functionality or safety posture.
  2. PROACTIVE Gov verifies the claim against code, config, and evidence.
  3. Eval Workbench scores behavior against evaluation rubrics.
  4. HUI Guard assesses human impact and intervenes if necessary.
  5. Trace Console records the full evidence trail for later inspection.

See docs/SentinelOS_INCIDENT_LIFECYCLE.md for the detailed walkthrough.

Open incident simulation →

SentinelOS Truth-Status

Every SentinelOS module and artifact is explicitly labeled so reviewers can see what exists today and what is planned.

Modules

PROACTIVE

Constitutional AI safety agent. Validates MR claims against I1–I6 invariants.

Implemented
UICare-System

Loop detection and rescue for neurodivergent-friendly development.

Partial
Instructional Integrity UI

Cognitive safety evaluator for learning environments.

Prototype
ConsentChain

Agent consent governance with cryptographic ledger and policy engine.

Partial
HUI Gov

Human-UI governance layer for epistemic labeling in decision support.

Planned
Eval Workbench / ESE

Empirical Safety Engine — failure classification, safety scoring, trend analysis.

Planned

Artifacts

PROACTIVE Validation (n=200)

100% detection rate, 0% FPR on TruthfulQA. Evidence in docs/evidence/validation_results.json

Implemented
SentinelOS Architecture Doc

docs/SentinelOS_ARCHITECTURE.md

Implemented
Incident Lifecycle Doc

docs/SentinelOS_INCIDENT_LIFECYCLE.md

Implemented
Truth-Status Matrix Doc

docs/SentinelOS_TRUTH_STATUS.md

Implemented
Safety Case Skeleton

docs/SentinelOS_SAFETY_CASE.md

Partial
Identity Refactor Contract

safety-identity-refactor/ — 36 files, Steps 0-19 of 25-step contract

Implemented