What problems does MVA solve that raw MCP doesn't?

Raw MCP servers dump JSON.stringify() output, have no domain context, no action hints, leak internal fields, and force switch/case routing. MVA solves all of this with structured perception packages, system rules, Agentic HATEOAS, Zod .strip() security, and discriminator-based action consolidation.

How does action consolidation reduce token usage?

Instead of registering 50 individual tools (each with name + description + schema in the prompt consuming ~100 tokens), mcp-fusion consolidates them behind ONE tool with a discriminator enum. The LLM sees a single tool definition instead of 50, reducing prompt token usage by up to 10x.

How do cognitive guardrails prevent context DDoS?

When a query returns 10,000 rows, .agentLimit(50) truncates to 50 items and injects guidance: "Showing 50 of 10,000. Use filters to narrow results." This prevents context overflow, reduces costs from ~$2.40 to ~$0.02 per call, and maintains AI accuracy.

How does mcp-fusion improve security over raw MCP?

Raw MCP servers leak all database fields to the LLM, including internal data like password_hash and SSN. mcp-fusion uses Zod .strip() as a security boundary — only fields declared in the Presenter schema reach the AI. Undeclared fields are silently removed.

What is Agentic HATEOAS?

Agentic HATEOAS is the concept of providing explicit next-action hints to AI agents based on data state, inspired by REST HATEOAS. Using .suggestActions(), each response includes tools the agent can call next with reasons. Example: invoice status "pending" suggests { tool: "billing.pay", reason: "Process payment" }.

How does TOON encoding save tokens?

TOON (Token-Oriented Object Notation) is a compact serialization format in mcp-fusion that reduces token count by ~40% compared to standard JSON. Use toonSuccess(data) instead of success(data). It strips quotes, uses shorthand notation, and minimizes whitespace while remaining parseable by LLMs.

Without MVA vs With MVA

Every MCP server today follows the same pattern: raw JSON output, manual routing, zero guardrails. The table below shows what changes when you adopt MVA.

The Quick Comparison

Aspect	Without MVA	With MVA (MCP Fusion)
Tool count	50 individual tools registered. LLM sees ALL of them. Token explosion.	Action consolidation — 5,000+ operations behind ONE tool via `module.action` discriminator. 10x fewer tokens.
Response format	Raw `JSON.stringify()` — the AI parses and guesses	Structured perception package — validated data + rules + UI + affordances
Domain context	None. `amount_cents: 45000` — is it dollars? cents? yen?	System rules travel with the data: "CRITICAL: amount_cents is in CENTS. Divide by 100."
Next actions	The AI hallucinates tool names	Agentic HATEOAS — `.suggestActions()` provides explicit hints based on data state
Large datasets	10,000 rows dump into context — token DDoS	Cognitive guardrails — `.agentLimit(50)` truncates and teaches the agent to use filters
Security	Internal fields (`password_hash`, `ssn`) leak to LLM	Schema as boundary — Zod `.strict()` rejects undeclared fields with actionable errors. Automatic.
Reusability	Same entity rendered differently by different tools	Presenter defined once, reused everywhere. Same rules, same UI, same affordances
Charts & visuals	Not possible — text only	UI Blocks — `.uiBlocks()` renders ECharts, Mermaid diagrams, summaries server-side
Routing	`switch/case` with hundreds of branches	Hierarchical groups — `platform.users.list`, `platform.billing.refund` — infinite nesting
Validation	Manual `if (!args.id)` checks	Zod schema at the framework level. Handlers receive only valid, typed data
Error recovery	`throw new Error('not found')` — the AI gives up	Self-healing errors — `toolError()` with recovery hints and suggested retry args
Middleware	Copy-paste auth checks in every handler	tRPC-style — `defineMiddleware()` with context derivation, pre-compiled chains
Composition	Flat responses, no nesting	Presenter embedding — `.embed()` nests child Presenters. Rules and UI merge automatically
Cache signals	None — the AI re-fetches stale data forever	State sync — `cacheSignal()` and `invalidates()` — RFC 7234-inspired temporal awareness
Token efficiency	Full JSON payloads every time	TOON encoding — `toonSuccess()` reduces token count by ~40%
Type safety	Manual type casting, no client types	Type-safe client — `createFusionClient()` with end-to-end inference, catches errors at build time
Streaming	No progress feedback during long operations	Generator-based streaming — `yield progress(0.5, 'Processing...')`
Tool exposure	All or nothing	Tag filtering — selective tool exposure per session with `.tags()` and `filter`
Immutability	Mutable state, runtime surprises	Freeze-after-build — `Object.freeze()` prevents mutations after build
Observability	`console.log()`	Zero-overhead observer — `createDebugObserver()` with typed event system

Side-by-Side Code

Returning an invoice

Without MVAWith MVA

typescript

// ❌ Raw MCP — the AI is on its own
server.setRequestHandler(CallToolRequestSchema, async (request) => {
    const { name, arguments: args } = request.params;

    if (name === 'get_invoice') {
        const invoice = await db.invoices.findUnique(args.id);
        // Raw JSON. No rules. No hints. No security boundary.
        return {
            content: [{
                type: 'text',
                text: JSON.stringify(invoice)
            }]
        };
    }
    // ...50 more if/else branches
});

// What the AI receives:
// { "id": "inv_123", "amount_cents": 45000, "status": "pending",
//   "internal_margin": 0.12, "customer_ssn": "123-45-6789" }
//
// Problems:
// - AI doesn't know amount_cents is in cents → displays $45,000 instead of $450
// - Internal fields leak (margin, SSN)
// - AI doesn't know it can call "pay" next
// - No visual representation

typescript

// ✅ mcp-fusion — the Presenter handles perception
const InvoicePresenter = createPresenter('Invoice')
    .schema(z.object({
        id: z.string(),
        amount_cents: z.number(),
        status: z.enum(['paid', 'pending', 'overdue']),
        // internal_margin and customer_ssn are NOT in the schema
        // → rejected with actionable error naming each invalid field.
    }))
    .systemRules([
        'CRITICAL: amount_cents is in CENTS. Divide by 100 for display.',
        'Always show currency as USD.',
    ])
    .uiBlocks((inv) => [
        ui.echarts({
            series: [{ type: 'gauge', data: [{ value: inv.amount_cents / 100 }] }]
        }),
    ])
    .suggestActions((inv) =>
        inv.status === 'pending'
            ? [{ tool: 'billing.pay', reason: 'Invoice is pending — process payment' }]
            : [{ tool: 'billing.archive', reason: 'Invoice is settled — archive it' }]
    );

const billing = defineTool<AppContext>('billing', {
    actions: {
        get_invoice: {
            returns: InvoicePresenter, // ← One line. That's it.
            params: { id: 'string' },
            handler: async (ctx, args) => ctx.db.invoices.findUnique(args.id),
        },
    },
});

// What the AI receives:
// ── System Rules ──
// CRITICAL: amount_cents is in CENTS. Divide by 100 for display.
// Always show currency as USD.
//
// ── Data ──
// { "id": "inv_123", "amount_cents": 45000, "status": "pending" }
// (internal_margin and customer_ssn were rejected by .strict())
//
// ── UI ──
// [ECharts gauge: $450.00]
//
// ── Suggested Actions ──
// → billing.pay — "Invoice is pending — process payment"

Listing users with guardrails

Without MVAWith MVA

typescript

// ❌ Returns ALL 10,000 users into the context window
case 'list_users':
    const users = await db.users.findMany();
    return {
        content: [{
            type: 'text',
            text: JSON.stringify(users) // 10,000 users × 500 tokens each = context DDoS
        }]
    };

// Result: ~5,000,000 tokens per call. Context overflow. Degraded accuracy.

typescript

// ✅ Cognitive guardrails protect the context window
const UserPresenter = createPresenter('User')
    .schema(z.object({ id: z.string(), name: z.string(), role: z.string() }))
    .agentLimit(50, {
        warningMessage: 'Showing {shown} of {total}. Use filters to narrow results.',
    })
    .suggestActions(() => [
        { tool: 'users.search', reason: 'Search by name or role for specific users' },
    ]);

// Result: 50 users shown. Agent guided to use filters.
// Cost: ~25,000 tokens per call (200x reduction). Context protected.

Error recovery

Without MVAWith MVA

typescript

// ❌ The AI receives "Error" and gives up
if (!invoice) {
    return {
        content: [{ type: 'text', text: 'Invoice not found' }],
        isError: true
    };
}
// AI: "I encountered an error. Please try again."
// (It has no idea what to try differently)

typescript

// ✅ Self-healing errors with recovery hints
if (!invoice) {
    return toolError('NOT_FOUND', {
        message: `Invoice ${args.id} not found`,
        recovery: {
            action: 'list',
            suggestion: 'List invoices to find the correct ID',
        },
        suggestedArgs: { status: 'pending' },
    });
}
// AI: "Invoice not found. Let me list pending invoices to find the right one."
// → Automatically calls billing.list with { status: 'pending' }

The Architecture Difference

text

Without MVA:                          With MVA:
┌──────────┐                          ┌──────────┐
│  Handler  │→ JSON.stringify() →     │  Handler  │→ raw data →
│           │  raw data to LLM        │           │
└──────────┘                          └──────────┘
                                           ↓
                                      ┌──────────────────────┐
                                      │     Presenter        │
                                      │ ┌──────────────────┐ │
                                      │ │ Schema (strict)  │ │
                                      │ │ System Rules     │ │
                                      │ │ UI Blocks        │ │
                                      │ │ Agent Limit      │ │
                                      │ │ Suggest Actions  │ │
                                      │ │ Embeds           │ │
                                      │ └──────────────────┘ │
                                      └──────────────────────┘
                                           ↓
                                      Structured Perception
                                      Package → LLM

Summary

	Without MVA	With MVA
Lines of code per tool	20-50 (routing + validation + formatting)	3-5 (handler only — framework handles the rest)
Security	Hope you didn't forget to strip fields	Schema IS the boundary. `.strict()` rejects. Automatic.
Agent accuracy	~60-70% on complex tasks	~95%+ with deterministic rules and affordances
Token cost per call	High (raw dumps, large payloads)	Low (guardrails, TOON encoding, truncation)
Maintenance	Every tool re-implements rendering	Presenter defined once, reused across all tools

The MVA Pattern → — The full architectural theory
Presenter Deep Dive → — Schema, rules, UI, affordances
Cookbook & Examples → — 14 copy-pasteable patterns

Without MVA vs With MVA ​

The Quick Comparison ​

Side-by-Side Code ​

Returning an invoice ​

Listing users with guardrails ​

Error recovery ​

The Architecture Difference ​

Summary ​

Without MVA vs With MVA

The Quick Comparison

Side-by-Side Code

Returning an invoice

Listing users with guardrails

Error recovery

The Architecture Difference

Summary