Security Architecture

Tier 2 | Deep technical documentation for security implementation Hub: README.md | Full Architecture: ARCHITECTURE.md

Overview

Security-first design with 7 defense layers:

Input Validation - Zod schemas at all boundaries
Secrets Vault - Never expose API keys or tokens
Rate Limiting - Token bucket per tool
Memory Bounds - Context pruning, history caps
Path Safety - Normalized paths, directory jails
Timeout Protection - TimeoutGuard for async operations
Byzantine Detection - Weighted voting with pattern detection

Authentication & Access Control

Nexus-agents uses a secure-by-default approach appropriate for its primary use case as a local CLI/MCP tool:

MCP mode (default): Communicates over stdio with the parent process only — no network listener, no external access possible. Authentication is not required because only the local process can communicate with the server.
REST API mode (opt-in): When enabled via the YAML config api.enabled: true, an API key is auto-generated and stored at ~/.nexus-agents/auth/rest-api-key with restrictive file permissions (0600). All REST endpoints require this key via the Authorization header.

Environment	Auth Needed?	Notes
Local MCP (stdio)	No	Only the parent process can reach the server
Local REST API	Auto	API key auto-generated on first start
Network-exposed	Yes	Set `NEXUS_AUTH_ENABLED=true`
Production deployment	Yes	Enable auth and use HTTPS

Token-Based Authentication (Issue #739)

For network-exposed deployments, nexus-agents provides token-based authentication via the AuthHandler middleware (src/mcp/middleware/auth-handler.ts).

Token management CLI commands:

nexus-agents auth init          # Generate new auth token
nexus-agents auth show          # Show token status (file path, permissions)
nexus-agents auth rotate        # Rotate token (invalidate old, generate new)
nexus-agents auth init --force  # Regenerate token (overwrite existing)

Security properties:

Tokens are 64-character hex strings (32 bytes of crypto.randomBytes)
Stored at ~/.nexus-agents/auth/server-token with 0600 permissions (owner read/write only)
Directory created with 0700 permissions
Timing-safe comparison (crypto.timingSafeEqual) prevents timing attacks
Token shown only once on generation (not recoverable — rotate if lost)

Enabling authentication:

# nexus-agents.yaml
security:
  auth:
    enabled: true
    method: token # 'token' (default) or 'oauth2' (future)
    tokenHeader: Authorization # Header name for Bearer token
    tokenFile: ~/.nexus-agents/auth/server-token # Token storage path

Or via environment variables:

export NEXUS_AUTH_ENABLED=true
nexus-agents --mode=server

Client usage:

# Include token in Authorization header
curl -H "Authorization: Bearer $(cat ~/.nexus-agents/auth/server-token)" \
  http://localhost:3000/api/orchestrate

Threat Model

Threat	Vector	Mitigation	Status
Prompt Injection	Malicious prompts	Input/output tagging, structured output	✅
SSRF	Outbound HTTP calls	URL allowlist, private IP blocking	✅
Path Traversal	Malicious file paths	Path normalization, directory jail	✅
ReDoS	Malicious regex	Static patterns only, no user RegExp	✅
MCP SDK ReDoS	CVE-2026-0621	TimeoutGuard, URI validation	✅
Secrets Exposure	Logs, errors	Secrets vault, sanitization	✅
Token Exhaustion	Unbounded context	Memory caps, pruning	✅
Injection	Malformed prompts	Input validation, Zod schemas	✅
Byzantine Failures	Malicious agents	Weighted voting with Byzantine detection	✅

Reference: OWASP LLM Top 10 (LLM01: Prompt Injection)

MCP Middleware Security

Every MCP tool invocation passes through a hardened middleware pipeline:

Layer	Protection
Authentication	Bearer token validation with timing-safe compare
CORS	Strict origin checking for HTTP transports
Security Headers	X-Content-Type-Options, X-Frame-Options, CSP
Body Size Limits	Configurable max request size (default 1 MB)
Input Validation	Size limits on tool arguments, Zod validation
Policy Firewall	Allowlist/denylist rules per tool with enforcement
Rate Limiting	Token bucket per tool (configurable requests/min)
Output Sanitization	Secret pattern redaction (keys, tokens, passwords)
Audit Logging	SIEM-compatible JSON-L events for tool invocations

Configuration

security:
  policy:
    policyMode: enforce # enforce | warn
    defaultMode: read-only # read-only | read-write

  rateLimit:
    enabled: true
    requestsPerMinute: 60

  audit:
    enabled: true
    logDir: ~/.nexus-agents/audit
    minSeverity: info # info | warning | critical
    enableHashChain: false # Tamper-evident log chain
    maxFileSizeBytes: 10485760
    maxFiles: 10

Sandbox Execution

All agent-executed code runs through the sandbox system.

Execution Modes

Mode	Description	Security Level	Use Case
`none`	No sandboxing (development only)	None	Local dev, debugging
`policy`	Command allowlist enforcement	Medium	Standard operation
`container`	Full Docker isolation	High	Production, untrusted input

Docker Security (Container Mode)

docker run \
  --cap-drop=ALL \           # Drop all Linux capabilities
  --read-only \              # Read-only root filesystem
  --network=none \           # No network access
  --user=node \              # Non-root user
  --memory=512m \            # Memory limit
  --cpus=2 \                 # CPU limit
  --pids-limit=10 \          # Process limit
  --security-opt=no-new-privileges

Command Classification

Category	Commands	Action
Allowed	pnpm, npm, git, gh, node, npx, tsc, eslint, vitest	Execute in sandbox
Denied	rm, curl, wget, ssh, sudo, kill, chmod, dd, mkfs	Block immediately
Requires Approval	docker, kubectl, aws, gcloud	User confirmation

Resource Limits

Resource	Limit	Rationale
Memory	512 MB	Prevent memory exhaustion
CPU	2 cores	Prevent CPU monopolization
Timeout	5 minutes	Prevent hung processes
Processes	10 max	Prevent fork bombs
Disk write	Read-only	Prevent persistent changes
Network	None	Prevent data exfiltration

Environment Sanitization

Variables blocked from execution context:

const BLOCKED_PREFIXES = [
  'API_',
  'TOKEN_',
  'SECRET_',
  'KEY_',
  'PASSWORD_',
  'CREDENTIAL_',
  'AWS_',
  'AZURE_',
  'GCP_',
  'ANTHROPIC_',
  'OPENAI_',
  'GOOGLE_AI_',
];

function sanitizeEnv(env: Record<string, string>): Record<string, string> {
  return Object.fromEntries(
    Object.entries(env).filter(
      ([key]) => !BLOCKED_PREFIXES.some((prefix) => key.startsWith(prefix))
    )
  );
}

Secrets Vault Pattern

class SecretsVault {
  private readonly secrets: Map<string, string>;

  get(key: string): string | undefined {
    // Audit log access
    return this.secrets.get(key);
  }

  // Sanitize before any output
  sanitize(text: string): string {
    for (const secret of this.secrets.values()) {
      text = text.replaceAll(secret, '[REDACTED]');
    }
    return text;
  }
}

Usage Rules

Never log secrets - Use sanitize() before any logging
Never return secrets - Sanitize all tool outputs
Audit access - Log when secrets are accessed
Rotate regularly - Support key rotation without restart

Input Validation Pipeline

const validateInput = (input: unknown): Result<ValidInput, ValidationError> => {
  // 1. Schema validation (Zod)
  const parsed = InputSchema.safeParse(input);
  if (!parsed.success) return { ok: false, error: parsed.error };

  // 2. Business rule validation
  if (parsed.data.value < 0) {
    return { ok: false, error: new ValidationError('Value must be positive') };
  }

  // 3. Security checks (path traversal, injection, etc.)
  const sanitized = sanitizeInput(parsed.data);

  return { ok: true, value: sanitized };
};

Path Traversal Prevention

function validatePath(userPath: string, allowedRoot: string): Result<string, SecurityError> {
  const resolved = path.resolve(allowedRoot, userPath);
  if (!resolved.startsWith(allowedRoot)) {
    return { ok: false, error: new SecurityError('Path traversal detected') };
  }
  return { ok: true, value: resolved };
}

Directory Jail

All file operations are restricted to:

Current working directory
Explicitly allowed paths in config
Temporary directories with cleanup

ReDoS Prevention

// NEVER do this - user-provided regex
const pattern = new RegExp(userInput); // DANGEROUS!

// ALWAYS do this - static patterns only
const VALID_PATTERN = /^[a-zA-Z0-9_-]+$/; // Static, safe pattern

// Validate against known-safe patterns
function validateIdentifier(input: string): boolean {
  return VALID_PATTERN.test(input);
}

CVE-2026-0621 Mitigation

TimeoutGuard for all async operations:

const result = await TimeoutGuard.execute(async () => await mcpTool.execute(args), {
  timeout: 30000,
  name: 'mcp-tool-call',
});

Rate Limiting

Token bucket per tool:

const rateLimiter = new TokenBucket({
  capacity: 100, // Max burst
  refillRate: 10, // Tokens per second
});

async function executeWithRateLimit<T>(fn: () => Promise<T>, cost: number = 1): Promise<T> {
  if (!rateLimiter.consume(cost)) {
    throw new RateLimitError('Rate limit exceeded');
  }
  return fn();
}

Per-Tool Limits

Tool Category	Requests/min	Rationale
Read operations	120	Low risk, high frequency
Write operations	60	Higher risk, audit trail
Execute commands	30	Highest risk, full audit
External API	60	Provider limits

Sanitization Pipeline

All output passes through sanitization:

// Standard sanitization
const sanitized = logger.sanitize(text);
// API keys, tokens, passwords -> [REDACTED]

// Patterns matched
const REDACT_PATTERNS = [
  /sk-[a-zA-Z0-9]{48}/g, // Anthropic API keys
  /sk-[a-zA-Z0-9-_]{43}/g, // OpenAI API keys
  /ghp_[a-zA-Z0-9]{36}/g, // GitHub tokens
  /password["']?\s*[:=]\s*["']?[^"'\s]+/gi, // Password values
];

Security Audit Logging

Audit events are written as SIEM-compatible JSON-L to ~/.nexus-agents/audit/ (configurable).

Event Types

Event	Severity	Trigger
`tool_invocation`	info	Every tool call (success/failure)
`policy_decision`	warning	Policy firewall deny
`rate_limit_violation`	warning	Rate limit exceeded
`system_startup/shutdown`	info	Server lifecycle events

Enabling Audit Logging

security:
  audit:
    enabled: true
    minSeverity: info
    enableHashChain: true # Optional tamper-evident chain

Each event includes: timestamp, actor, tool name, request ID, duration, and outcome. With enableHashChain: true, events include a SHA-256 hash chain for tamper detection.

Configuration

security:
  sandbox:
    mode: policy # none | policy | container
    fallbackMode: none # Fallback if container unavailable
    resourceLimits:
      memory: 512m
      cpu: 2
      timeout: 300s
      maxProcesses: 10

  allowedPaths: [./] # Directory jail

  policy:
    policyMode: enforce # enforce | warn
    defaultMode: read-only # read-only | read-write

  rateLimit:
    enabled: true
    requestsPerMinute: 60

  audit:
    enabled: false # Set to true for structured audit logging
    logDir: ~/.nexus-agents/audit
    minSeverity: info # info | warning | critical
    enableHashChain: false # Tamper-evident hash chain
    maxFileSizeBytes: 10485760 # 10 MB per log file
    maxFiles: 10 # Log rotation file count

Security Checklist

Before any code change:

No secrets in code, logs, or outputs
Input validation at all boundaries
Path traversal prevention on file ops
No user-provided RegExp (use static patterns)
Rate limiting on public interfaces
Memory bounds on collections
Timeout on external calls
Sanitization on all outputs

Agent-SafetyBench Evaluation

Comprehensive safety category taxonomy for agent evaluation (arXiv:2412.14470).

Safety Categories

Category	Risk Level	Description
`harm_physical`	Critical	Actions causing physical harm
`harm_emotional`	High	Psychological or emotional harm
`harm_financial`	High	Financial loss or fraud
`deception`	High	Misleading or false information
`bias`	Medium	Discriminatory outputs
`privacy`	High	Privacy violations, data exposure
`manipulation`	High	Social engineering, coercion
`instruction_safety`	Critical	Jailbreak, prompt injection
`robustness`	Medium	Adversarial input resilience
`risk_awareness`	Medium	Appropriate caution and escalation

Expected Outcomes

Outcome	Description
`refuse`	Decline harmful request
`caution`	Proceed with warnings
`clarify`	Request more information
`escalate`	Elevate to human review
`proceed`	Safe to execute
`detect`	Identify and report attack attempt

Usage

import { SAFETY_CATEGORIES, getSafetyTaxonomySummary } from 'nexus-agents';

// Get all categories
for (const category of SAFETY_CATEGORIES) {
  console.log(`${category.id}: ${category.defaultRiskLevel}`);
}

// Get summary statistics
const summary = getSafetyTaxonomySummary();
// { totalCategories: 10, totalCriteria: 42, ... }

Source Files

File	Purpose
`src/mcp/middleware/auth-handler.ts`	Token-based authentication
`src/mcp/middleware/secure-handler.ts`	Tool-level security wrapper
`src/mcp/middleware/request-context.ts`	Request tracking and auth
`src/mcp/middleware/rate-limiter.ts`	Rate limiting
`src/security/sandbox/sandbox-manager.ts`	Sandbox orchestration
`src/security/sandbox/docker-sandbox-executor.ts`	Container isolation
`src/security/input-sanitizer.ts`	Input validation
`src/security/output-sanitizer.ts`	Output sanitization / secrets
`src/security/trust-classifier.ts`	Trust tier classification
`src/security/policy-gate.ts`	Policy enforcement
`src/security/corroboration-validator.ts`	Action corroboration checks
`src/security/reputation-model.ts`	Agent reputation scoring
`src/security/audit-trail.ts`	Security logging
`src/security/firewall/`	Firewall pipeline
`src/security/safety-bench/`	SafetyBench evaluation
`src/security/safety-bench/safety-categories.ts`	Category taxonomy
`src/security/safety-bench/safety-enums.ts`	Risk levels, outcomes
`src/security/safety-bench/safety-schemas.ts`	Validation schemas

Penetration Testing

113 security tests covering:

Container escape prevention
Capability restrictions
Network isolation
Filesystem isolation
Privilege escalation prevention
Command injection prevention
Environment variable sanitization
CVE regression tests

Run with:

pnpm test src/security/sandbox-pentest.test.ts

Consensus Protocols: CONSENSUS_PROTOCOLS.md (Byzantine detection)
Routing System: ROUTING_SYSTEM.md (Budget limits)
Full Architecture: ARCHITECTURE.md
Coding Standards: CODING_STANDARDS.md

Security Architecture

Overview

Authentication & Access Control

Token-Based Authentication (Issue #739)

Threat Model

MCP Middleware Security

Configuration

Sandbox Execution

Execution Modes

Docker Security (Container Mode)

Command Classification

Resource Limits

Environment Sanitization

Secrets Vault Pattern

Usage Rules

Input Validation Pipeline

Path Traversal Prevention

Directory Jail

ReDoS Prevention

CVE-2026-0621 Mitigation

Rate Limiting

Per-Tool Limits

Sanitization Pipeline

Security Audit Logging

Event Types

Enabling Audit Logging

Configuration

Security Checklist

Agent-SafetyBench Evaluation

Safety Categories

Expected Outcomes

Usage

Source Files

Penetration Testing

Related Documents