Document vMCP performance and health check configuration #538

yrobla · 2026-02-09T12:14:26Z

Description

Add comprehensive documentation for Virtual MCP Server sizing guidance and health check configuration to help operators plan deployments and monitor backend availability.

Type of change

New documentation

Related issues/PRs

#512

Submitter checklist

Content and formatting

I have reviewed the content for technical accuracy
I have reviewed the content for spelling, grammar, and style

Navigation

New pages include a frontmatter section with title and description at a minimum
Sidebar navigation (sidebars.ts) updated for added, deleted, reordered, or renamed files
Redirects added to vercel.json for moved, renamed, or deleted pages (i.e., if the URL slug changed)

Reviewer checklist

Content

I have reviewed the content for technical accuracy
I have reviewed the content for spelling, grammar, and style

vercel · 2026-02-09T12:14:32Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
docs-website	Ready	Preview, Comment	Feb 11, 2026 9:19am

Copilot

Pull request overview

Adds operator-focused documentation for Virtual MCP Server (vMCP) deployment planning/sizing and for configuring and observing backend health checks, to help plan production deployments and monitor backend availability.

Changes:

Added “Deployment planning” guidance (baseline resources, scaling factors, and operational indicators) to the vMCP introduction.
Added a new “Configure health checks” section to backend discovery docs, including CRD-based configuration examples and operational monitoring notes.
Updated the vMCP configuration guide to link to the new health check documentation and related backend discovery info.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
docs/toolhive/guides-vmcp/intro.mdx	Adds deployment planning guidance for sizing/capacity and scaling considerations.
docs/toolhive/guides-vmcp/configuration.mdx	Updates “Next steps” and related links to point readers to health checks and backend discovery.
docs/toolhive/guides-vmcp/backend-discovery.mdx	Documents health check configuration, circuit breaker settings, timeouts, and health status monitoring.

docs/toolhive/guides-vmcp/backend-discovery.mdx

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 5 comments.

docs/toolhive/guides-vmcp/backend-discovery.mdx

docs/toolhive/guides-vmcp/performance.mdx

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

docs/toolhive/guides-vmcp/backend-discovery.mdx

docs/toolhive/guides-vmcp/performance.mdx

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

docs/toolhive/guides-vmcp/performance.mdx

jerm-dro

Just a few blocking questions, but otherwise LGTM

docs/toolhive/guides-vmcp/performance.mdx

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

docs/toolhive/guides-vmcp/performance.mdx

docs/toolhive/guides-vmcp/backend-discovery.mdx

jerm-dro · 2026-02-11T21:31:11Z

docs/toolhive/guides-vmcp/configuration.mdx

+:::caution[Network overhead]
+
+Enabling health checks for remote backends increases network traffic to external
+services. Only enable this if you need real-time health status for remote
+endpoints.
+
+:::
+
+#### Degraded backend detection
+
+Backends are marked as **degraded** when:
+
+- Health checks succeed but response times exceed 5 seconds (slow performance)
+- Backend recently recovered from failures and is stabilizing
+
+Degraded backends remain in the routing table but may indicate performance
+problems.
+
+#### Health check behavior
+
+1. **Initial check**: All backends checked immediately on startup
+2. **Periodic checks**: Repeated at `healthCheckInterval` (default: 30s)
+3. **Status updates**: Reported to Kubernetes at `statusReportingInterval`
+   (default: 30s)
+4. **Backend unhealthy**: After `unhealthyThreshold` consecutive failures
+   (default: 3), backend marked unhealthy
+5. **Recovery**: One successful check marks backend as healthy (or degraded if
+   slow)


This seems like too much content for this page. I'd recommend deleting it. This page is really just "how do I configure X" without detail on the implementation or implications.

jerm-dro · 2026-02-11T21:32:33Z

docs/toolhive/guides-vmcp/intro.mdx

+## Next steps
+
+- Review [performance and sizing guidance](./performance.mdx) for deployment
+  planning
+- Follow the [Quickstart: Virtual MCP Server](../tutorials/quickstart-vmcp.mdx)
+  tutorial


This section is redundant with "Related Information." I'd also recommend putting the Quickstart at the top of the list, given this is an intro page.

jerm-dro · 2026-02-11T21:33:27Z

docs/toolhive/guides-vmcp/performance.mdx

+## Backend scale recommendations
+
+vMCP performs well across different scales:
+
+| Backend Count | Use Case                      | Notes                                |
+| ------------- | ----------------------------- | ------------------------------------ |
+| 1-5           | Small teams, focused toolsets | Minimal resource overhead            |
+| 5-15          | Medium teams, diverse tools   | Recommended range for most use cases |
+| 15-30         | Large teams, comprehensive    | Increase health check interval       |
+| 30+           | Enterprise-scale deployments  | Consider multiple vMCP instances     |


Suggestion: delete. This is a long page and doesn't add much information.

jerm-dro · 2026-02-11T21:35:32Z

docs/toolhive/guides-vmcp/performance.mdx

+:::info[Why no replicas field?]
+
+VirtualMCPServer intentionally omits a `spec.replicas` field to avoid conflicts
+with HPA/VPA autoscaling. This design allows you to choose between static
+scaling (kubectl) or dynamic autoscaling (HPA/VPA) without operator
+interference.
+
+For static replica counts, scale the Deployment after creating the
+VirtualMCPServer. The operator will preserve your scaling configuration.


Suggestion: delete. It's redundant with the information above.

jerm-dro · 2026-02-11T21:38:38Z

docs/toolhive/guides-vmcp/performance.mdx

+### Monitoring
+
+Track these metrics via [telemetry integration](./telemetry-and-metrics.mdx):
+
+| Metric                    | Healthy State | Action Threshold           |
+| ------------------------- | ------------- | -------------------------- |
+| Backend request latency   | P95 < SLO     | Alert on spikes            |
+| Backend error rate        | < 1%          | Investigate > 5%           |
+| Health check success rate | > 95%         | Early warning              |
+| Workflow execution time   | Varies        | Check for serial execution |
+
+**Setup:** Create dashboards for trend analysis and configure alerts for
+anomalies. Catches degradation before users notice.
+


Suggestion: delete. Anyone operating vMCP at scale would already be doing this.

jerm-dro · 2026-02-11T21:43:21Z

docs/toolhive/guides-vmcp/performance.mdx

+:::caution[Backend scaling]
+
+When scaling vMCP horizontally, the backend MCP servers will also see increased
+load. Ensure your backend deployments (MCPServer resources) are also scaled
+appropriately to handle the additional traffic.
+
+:::
+
+**Session affinity is required** when using multiple replicas. Clients must be
+routed to the same vMCP instance for the duration of their session. Configure
+based on your deployment:
+
+- **Kubernetes Service**: Use `sessionAffinity: ClientIP` for basic
+  client-to-pod stickiness
+  - Note: This is IP-based and may not work well behind proxies or with changing
+    client IPs
+- **Ingress Controller**: Configure cookie-based sticky sessions (recommended)
+  - nginx: Use `nginx.ingress.kubernetes.io/affinity: cookie`
+  - Other controllers: Consult your Ingress controller documentation
+- **Gateway API**: Use appropriate session affinity configuration based on your
+  Gateway implementation
+
+:::tip[Session affinity recommendations]
+
+- For **stateless backends**: Cookie-based sticky sessions work well and provide
+  reliable routing through proxies
+- For **stateful backends** (Playwright, databases): Consider vertical scaling
+  or dedicated vMCP instances instead of horizontal scaling with session
+  affinity, as session resumption may not work reliably
+
+:::


Suggestion: delete. I'd prefer to reduce the information here. A lot of this is specific to the environment vMCP is specifically scaled in.

jerm-dro · 2026-02-11T21:45:31Z

docs/toolhive/guides-vmcp/performance.mdx

@@ -0,0 +1,287 @@
+---
+title: Performance and sizing


Blocker: rename this page "Scaling" and keep the content just focused on:

How can I vertically scale?

How can I horizontally scale?

When is horizontally scaling hard?

jerm-dro · 2026-02-11T21:46:10Z

docs/toolhive/guides-vmcp/performance.mdx

+### Baseline resources
+
+**Minimal deployment** (development/testing):
+
+- **CPU**: 100m (0.1 cores)
+- **Memory**: 128Mi
+
+**Production deployment** (recommended):
+
+- **CPU**: 500m (0.5 cores)
+- **Memory**: 512Mi


Suggestion: delete. This isn't that useful and takes up a lot of visual space

jerm-dro · 2026-02-11T21:47:47Z

docs/toolhive/guides-vmcp/performance.mdx

+## When to scale
+
+### Scale up (increase resources)
+
+Increase CPU and memory when you observe:
+
+- High CPU usage (>70% sustained) during normal operations
+- Memory pressure or OOM (out-of-memory) kills
+- Slow response times (>1 second) for simple tool calls
+- Health check timeouts or frequent backend unavailability
+
+### Scale out (increase replicas)
+
+Add more vMCP instances when:
+
+- CPU usage remains high despite increasing resources
+- You need higher availability and fault tolerance
+- Request volume exceeds capacity of a single instance
+- You want to distribute load across multiple availability zones


Suggestion: delete. This is redundant with information elsewhere.

jerm-dro · 2026-02-11T21:49:07Z

sidebars.ts

      items: [
        'toolhive/guides-vmcp/intro',
        'toolhive/guides-vmcp/configuration',
+        'toolhive/guides-vmcp/performance',


Performance should be the bottom-most item in the vMCP guide. It's arguably the most advanced topic, so it should come last.

Copilot AI review requested due to automatic review settings February 9, 2026 12:14

Copilot started reviewing on behalf of yrobla February 9, 2026 12:14 View session

Copilot AI reviewed Feb 9, 2026

View reviewed changes

docs/toolhive/guides-vmcp/backend-discovery.mdx Outdated Show resolved Hide resolved

docs/toolhive/guides-vmcp/backend-discovery.mdx Outdated Show resolved Hide resolved

docs/toolhive/guides-vmcp/backend-discovery.mdx Outdated Show resolved Hide resolved

yrobla force-pushed the issue-512 branch from 9f15431 to 466e99b Compare February 9, 2026 12:20

vercel bot deployed to Preview February 9, 2026 12:21 View deployment

yrobla force-pushed the issue-512 branch from 466e99b to e7740e3 Compare February 9, 2026 12:21

vercel bot deployed to Preview February 9, 2026 12:22 View deployment

yrobla force-pushed the issue-512 branch from e7740e3 to 320b0d8 Compare February 9, 2026 12:23

yrobla requested a review from Copilot February 9, 2026 12:23

Copilot started reviewing on behalf of yrobla February 9, 2026 12:24 View session

vercel bot deployed to Preview February 9, 2026 12:24 View deployment

yrobla force-pushed the issue-512 branch from 320b0d8 to be09453 Compare February 9, 2026 12:25

vercel bot deployed to Preview February 9, 2026 12:26 View deployment

yrobla force-pushed the issue-512 branch from be09453 to f7aa128 Compare February 9, 2026 12:28

Copilot AI reviewed Feb 9, 2026

View reviewed changes

vercel bot deployed to Preview February 9, 2026 12:30 View deployment

yrobla force-pushed the issue-512 branch from f7aa128 to 087c4be Compare February 9, 2026 12:34

yrobla requested review from amirejaz, danbarr and jerm-dro February 9, 2026 12:34

vercel bot deployed to Preview February 9, 2026 12:34 View deployment

yrobla force-pushed the issue-512 branch from 087c4be to 59998b0 Compare February 9, 2026 12:40

vercel bot deployed to Preview February 9, 2026 12:41 View deployment

yrobla requested a review from Copilot February 9, 2026 12:42

Copilot started reviewing on behalf of yrobla February 9, 2026 12:42 View session

Copilot AI reviewed Feb 9, 2026

View reviewed changes

docs/toolhive/guides-vmcp/backend-discovery.mdx Outdated Show resolved Hide resolved

docs/toolhive/guides-vmcp/performance.mdx Show resolved Hide resolved

yrobla force-pushed the issue-512 branch from 59998b0 to b2a78ac Compare February 9, 2026 13:31

yrobla requested a review from Copilot February 9, 2026 13:31

Copilot started reviewing on behalf of yrobla February 9, 2026 13:32 View session

danbarr requested changes Feb 10, 2026

View reviewed changes

docs/toolhive/guides-vmcp/performance.mdx Outdated Show resolved Hide resolved

vercel bot deployed to Preview February 10, 2026 14:59 View deployment

yrobla force-pushed the issue-512 branch from 90f6058 to 90fb25f Compare February 10, 2026 15:00

vercel bot deployed to Preview February 10, 2026 15:02 View deployment

danbarr previously approved these changes Feb 10, 2026

View reviewed changes

yrobla dismissed danbarr’s stale review via 50564f1 February 10, 2026 15:35

yrobla force-pushed the issue-512 branch from 90fb25f to 50564f1 Compare February 10, 2026 15:35

vercel bot deployed to Preview February 10, 2026 15:36 View deployment

yrobla requested review from Copilot and danbarr February 10, 2026 15:37

Copilot started reviewing on behalf of yrobla February 10, 2026 15:37 View session

Copilot AI reviewed Feb 10, 2026

View reviewed changes

docs/toolhive/guides-vmcp/performance.mdx Show resolved Hide resolved

fixes from review

f51e4f1

yrobla force-pushed the issue-512 branch from 50564f1 to f51e4f1 Compare February 10, 2026 15:44

vercel bot deployed to Preview February 10, 2026 15:46 View deployment

jerm-dro reviewed Feb 10, 2026

View reviewed changes

docs/toolhive/guides-vmcp/performance.mdx Outdated Show resolved Hide resolved

docs/toolhive/guides-vmcp/performance.mdx Outdated Show resolved Hide resolved

vercel bot deployed to Preview February 11, 2026 08:18 View deployment

yrobla force-pushed the issue-512 branch from a83d515 to 38819a0 Compare February 11, 2026 08:57

vercel bot deployed to Preview February 11, 2026 08:57 View deployment

yrobla force-pushed the issue-512 branch from 38819a0 to 159e742 Compare February 11, 2026 09:01

yrobla requested a review from Copilot February 11, 2026 09:01

vercel bot deployed to Preview February 11, 2026 09:02 View deployment

Copilot started reviewing on behalf of yrobla February 11, 2026 09:02 View session

Copilot AI reviewed Feb 11, 2026

View reviewed changes

docs/toolhive/guides-vmcp/performance.mdx Show resolved Hide resolved

docs/toolhive/guides-vmcp/backend-discovery.mdx Show resolved Hide resolved

yrobla force-pushed the issue-512 branch from 159e742 to 84747c6 Compare February 11, 2026 09:13

vercel bot deployed to Preview February 11, 2026 09:14 View deployment

fixes from review

9e9269e

yrobla force-pushed the issue-512 branch from 84747c6 to 9e9269e Compare February 11, 2026 09:18

vercel bot deployed to Preview February 11, 2026 09:19 View deployment

jerm-dro reviewed Feb 11, 2026

View reviewed changes

Document vMCP performance and health check configuration #538

Are you sure you want to change the base?

Document vMCP performance and health check configuration #538

Conversation

yrobla commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Related issues/PRs

Submitter checklist

Content and formatting

Navigation

Reviewer checklist

Content

Uh oh!

vercel bot commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

jerm-dro left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

yrobla commented Feb 9, 2026 •

edited

Loading

vercel bot commented Feb 9, 2026 •

edited

Loading