API Integration Guide

This guide provides comprehensive instructions for integrating with the Puppeteer MCP platform through its four protocol interfaces: REST, gRPC, WebSocket, and Model Context Protocol (MCP). Each protocol offers unique advantages for different use cases.

Protocol Comparison

REST API

Best for: Traditional HTTP clients, web applications, simple integrations
Advantages: Widely supported, simple to implement, stateless, HTTP/2 support
Authentication: JWT Bearer tokens or API keys via headers
Response format: JSON
Use cases: CRUD operations, one-off requests, RESTful applications

gRPC

Best for: Microservices, high-performance applications, type-safe clients
Advantages: Binary protocol, streaming support, auto-generated clients, lower latency
Authentication: JWT or API keys via metadata
Response format: Protocol Buffers
Use cases: Real-time data streaming, microservice communication, performance-critical apps

WebSocket

Best for: Real-time applications, event-driven architectures, live updates
Advantages: Bidirectional communication, low latency, persistent connections
Authentication: JWT or API keys during handshake
Response format: JSON message envelopes
Use cases: Live browser events, real-time notifications, collaborative features

Model Context Protocol (MCP)

Best for: AI agents, LLMs, automated workflows
Advantages: AI-native interface, tool discovery, semantic understanding
Authentication: Unified auth bridge supporting JWT/API keys
Response format: MCP protocol messages
Use cases: AI-driven automation, intelligent agents, LLM integration

Authentication Setup

JWT Authentication

REST API

# Login to get tokens
curl -X POST http://localhost:8443/api/v1/auth/login \
  -H "Content-Type: application/json" \
  -d '{"username": "user", "password": "pass"}'

# Use access token
curl -H "Authorization: Bearer <access_token>" \
  http://localhost:8443/api/v1/sessions

gRPC

const metadata = new grpc.Metadata();
metadata.set('authorization', `Bearer ${accessToken}`);
client.listSessions({}, metadata, callback);

WebSocket

const ws = new WebSocket('ws://localhost:8443');
ws.on('open', () => {
  ws.send(
    JSON.stringify({
      type: 'auth',
      token: accessToken,
    }),
  );
});

MCP

// MCP handles auth transparently via the auth bridge
const client = new MCPClient({
  auth: { token: accessToken },
});

API Key Authentication

REST API

curl -H "X-API-Key: your-api-key" \
  http://localhost:8443/api/v1/sessions

gRPC

const metadata = new grpc.Metadata();
metadata.set('x-api-key', apiKey);
client.listSessions({}, metadata, callback);

WebSocket

ws.send(
  JSON.stringify({
    type: 'auth',
    apiKey: apiKey,
  }),
);

MCP

const client = new MCPClient({
  auth: { apiKey: apiKey },
});

Session Management Best Practices

1. Session Creation

Always create a session before performing operations:

// REST
const session = await fetch('/api/v1/sessions', {
  method: 'POST',
  headers: { Authorization: `Bearer ${token}` },
  body: JSON.stringify({ name: 'my-session' }),
}).then((r) => r.json());

// Use session.id for subsequent operations

2. Context Management

Create contexts within sessions for browser automation:

// Create context with browser settings
const context = await createContext(sessionId, {
  name: 'browser-context',
  viewport: { width: 1920, height: 1080 },
  userAgent: 'Mozilla/5.0...',
});

3. Session Lifecycle

Sessions expire after inactivity (configurable)
Always clean up sessions when done
Use refresh tokens for long-running operations
Monitor session health via health endpoints

4. Error Handling

try {
  const result = await executeAction(contextId, action);
} catch (error) {
  if (error.code === 401) {
    // Refresh token and retry
    await refreshAuth();
    return retry();
  }
  // Handle other errors
}

Example Workflows

Web Scraping Workflow

// 1. Create session
const session = await createSession('scraper-session');

// 2. Create browser context
const context = await createContext(session.id, {
  name: 'scraper',
  viewport: { width: 1280, height: 720 },
});

// 3. Navigate to target
await executeAction(context.id, {
  action: 'navigate',
  params: { url: 'https://example.com' },
});

// 4. Extract data
const content = await executeAction(context.id, {
  action: 'evaluate',
  params: {
    script: 'document.querySelector(".content").innerText',
  },
});

// 5. Cleanup
await deleteContext(context.id);
await deleteSession(session.id);

Real-time Monitoring

// WebSocket for live updates
ws.on('authenticated', () => {
  // Subscribe to browser events
  ws.send(
    JSON.stringify({
      type: 'subscribe',
      topic: 'browser.events',
      contextId: contextId,
    }),
  );
});

ws.on('message', (data) => {
  const msg = JSON.parse(data);
  if (msg.type === 'browser.event') {
    console.log('Browser event:', msg.event);
  }
});

AI-Driven Automation

// Using MCP for intelligent automation
const response = await mcp.callTool('execute-in-context', {
  contextId: contextId,
  command: 'screenshot',
  parameters: { fullPage: true },
});

// AI can chain operations based on results
if (needsScroll(response)) {
  await mcp.callTool('execute-in-context', {
    contextId: contextId,
    command: 'scroll',
    parameters: { direction: 'down' },
  });
}

Protocol Selection Guidance

Choose REST when:

Building traditional web applications
Need simple, stateless operations
Working with standard HTTP tooling
Require broad client compatibility

Choose gRPC when:

Building microservices architecture
Need high performance and low latency
Want type-safe client generation
Require streaming capabilities

Choose WebSocket when:

Building real-time applications
Need bidirectional communication
Want live browser event updates
Require persistent connections

Choose MCP when:

Integrating with AI/LLM systems
Building intelligent automation
Need semantic tool discovery
Want unified multi-protocol access

Security Considerations

Always use HTTPS/WSS in production
Rotate API keys regularly
Implement rate limiting per session
Validate all browser action inputs
Use short-lived JWT tokens
Enable audit logging for compliance
Implement proper CORS policies
Sanitize JavaScript execution

Performance Tips

Reuse sessions for multiple operations
Use browser pooling to reduce startup time
Batch operations when possible
Monitor resource usage via health endpoints
Implement proper timeout handling
Use streaming for large responses
Cache authentication tokens appropriately

Next Steps

Review the API Quick Reference for endpoint details
Explore Browser Automation Guide for Puppeteer features
Check MCP Usage Examples for AI integration patterns
See Security Documentation for compliance requirements