How do I write unit tests for an MCP tool?

Use `FusionTester`. It simulates an exact MCP JSON-RPC call entirely in-memory. You pass the tool name and payload, and it bypasses network transports, instantly returning the structured outcome object for assertions.

How do I test that a Presenter strips PII?

Write a Jest/Vitest check: pass a raw database object containing a password field into the `FusionTester`. Then assert that `expect(result.data.password).toBeUndefined()`. This guarantees your Zod egress schema is correctly pruning dangerous fields.

Can I test Context-Aware System Rules?

Yes. Execute a tool using `FusionTester` with mock context (e.g., role: 'guest'), then assert the resulting `result._systemRules` array. Run it again with `role: 'admin'` and verify the admin-specific instructions successfully injected.

Does FusionTester require an LLM API key?

No! FusionTester strictly tests your server's logic, validation, grouping, and Presenter rendering. It uses zero tokens, costs zero dollars, and executes in milliseconds, making it perfect for CI/CD pipelines.

How do I mock external APIs during a tool test?

Because MCP Fusion is standard Node.js/TypeScript, you can use standard mocking libraries like `msw` (Mock Service Worker), `nock`, or `jest.spyOn()` to mock external fetch calls independently from the MCP routing logic.

Testing

Prerequisites

Install MCP Fusion before following this recipe: npm install @vinkius-core/mcp-fusion @modelcontextprotocol/sdk zod — or scaffold a project with npx fusion create.

Introduction
FusionTester Setup
Executing Tools
Firewall Tests — Field Whitelist
Rules Verification
Middleware & Guards
Generator Tests

Introduction

MCP Fusion ships @vinkius-core/mcp-fusion-testing — a dedicated testing harness that enables Automated AI Tool Testing. It lets you execute tools, inspect responses, verify Presenter rules, and assert on field whitelists without spinning up a full MCP server.

The philosophy: test perception, not plumbing. Instead of testing "does findMany return rows?", test "does the AI receive exactly the fields it should, with the right rules attached?" This focuses your testing on guaranteeing Deterministic LLM Output and ensuring absolute Data Exfiltration Prevention before your agents ever reach production.

FusionTester Setup

Create a shared tester instance in your test setup file:

typescript

// tests/setup.ts
import { FusionTester } from '@vinkius-core/mcp-fusion-testing';
import { registry } from '../src/index.js';

export function createTester(contextOverrides?: Partial<AppContext>) {
  return new FusionTester(registry, {
    db: createTestDatabase(),
    tenantId: 'test-tenant',
    userId: 'test-user',
    ...contextOverrides,
  });
}

FusionTester wraps your registry with a test-friendly API. It executes tools with the same middleware chain, Presenter pipeline, and response builder as production — but without the MCP transport layer.

Executing Tools

typescript

import { describe, it, expect } from 'vitest';
import { createTester } from './setup.js';

describe('projects.list', () => {
  it('returns projects for the current tenant', async () => {
    const tester = createTester();

    const result = await tester.callTool('projects.list', {
      status: 'active',
    });

    expect(result.isError).toBe(false);
    expect(result.content).toBeDefined();
    expect(result.content[0].text).toContain('active');
  });

  it('returns error for invalid parameters', async () => {
    const tester = createTester();

    const result = await tester.callTool('projects.list', {
      status: 'invalid_status',   // not in enum
    });

    expect(result.isError).toBe(true);
  });
});

callTool(name, args) executes the full pipeline: validation → middleware → handler → Presenter → response. The result is an MCP ToolResponse.

Firewall Tests — Field Whitelist

The most important test category: verify that internal fields never leak to the AI. The Presenter's Zod .strict() schema strips undeclared fields — but you should test it:

typescript

// tests/firewall/invoices.firewall.test.ts
import { describe, it, expect } from 'vitest';
import { createTester } from '../setup.js';

describe('Invoice firewall', () => {
  it('strips internal fields from response', async () => {
    const tester = createTester();
    const result = await tester.callTool('billing.get_invoice', { id: 'INV-1' });

    const data = JSON.parse(result.content[0].text);

    // These fields MUST be present
    expect(data).toHaveProperty('id');
    expect(data).toHaveProperty('amount_cents');
    expect(data).toHaveProperty('status');

    // These MUST NOT leak
    expect(data).not.toHaveProperty('stripe_customer_id');
    expect(data).not.toHaveProperty('internal_notes');
    expect(data).not.toHaveProperty('password_hash');
  });
});

IMPORTANT

Firewall tests are your security boundary. Run them on every CI push. A failing firewall test means sensitive data could reach the AI.

Rules Verification

Verify that system rules appear in the response when (and only when) they should:

typescript

// tests/rules/invoices.rules.test.ts
describe('Invoice rules', () => {
  it('includes currency rules in response', async () => {
    const tester = createTester();
    const result = await tester.callTool('billing.get_invoice', { id: 'INV-1' });

    const text = result.content.map(c => c.text).join('\n');
    expect(text).toContain('CENTS');
    expect(text).toContain('Divide by 100');
  });

  it('includes RBAC restriction for non-admins', async () => {
    const tester = createTester({ user: { role: 'viewer' } });
    const result = await tester.callTool('employees.get', { id: 'EMP-1' });

    const text = result.content.map(c => c.text).join('\n');
    expect(text).toContain('RESTRICTED');
    expect(text).toContain('Do NOT display salary');
  });
});

Middleware & Guards

Test that middleware blocks unauthorized access:

typescript

// tests/guards/auth.guard.test.ts
describe('Auth middleware', () => {
  it('rejects unauthenticated requests', async () => {
    const tester = createTester({ token: '' });
    const result = await tester.callTool('users.list', {});

    expect(result.isError).toBe(true);
    expect(result.content[0].text).toContain('Authentication required');
  });

  it('rejects non-admin from admin endpoints', async () => {
    const tester = createTester({ token: memberToken });
    const result = await tester.callTool('users.delete', { user_id: 'U-1' });

    expect(result.isError).toBe(true);
    expect(result.content[0].text).toContain('admin role required');
  });
});

Generator Tests

Test streaming handlers by collecting progress events:

typescript

describe('Streaming', () => {
  it('emits progress events', async () => {
    const tester = createTester();
    const progressEvents: { progress: number; message: string }[] = [];

    const result = await tester.callTool(
      'repo.analyze',
      { url: 'https://github.com/test/repo' },
      { onProgress: (p) => progressEvents.push(p) },
    );

    expect(result.isError).toBe(false);
    expect(progressEvents.length).toBeGreaterThan(0);
    expect(progressEvents[progressEvents.length - 1].progress).toBe(100);
  });
});

The onProgress callback collects every yield progress() from the generator handler.

Core

Other

Prompt

StateSync

Other

Sandbox

Client

Core

Domain Models

FSM

Governance

Observability

Presenter

Prompt

Sandbox

Serialization

Server

StateSync

Testing

Introduction

FusionTester Setup

Executing Tools

Firewall Tests — Field Whitelist

Rules Verification

Middleware & Guards

Generator Tests

Testing ​

Introduction ​

FusionTester Setup ​

Executing Tools ​

Firewall Tests — Field Whitelist ​

Rules Verification ​

Middleware & Guards ​

Generator Tests ​

Testing

Introduction

FusionTester Setup

Executing Tools

Firewall Tests — Field Whitelist

Rules Verification

Middleware & Guards

Generator Tests