An MCP server that autonomously evaluates web applications.

What is Operative Sh web eval agent

๐Ÿš€ operative.sh web-eval-agent MCP Server

Let the coding agent debug itself, you've got better things to do.

Demo

๐Ÿ”ฅ Supercharge Your Debugging

operative.sh's MCP Server launches a browser-use powered agent to autonomously execute and debug web apps directly in your code editor.

โšก Features

  • ๐ŸŒ Navigate your webapp using BrowserUse (2x faster with operative backend)
  • ๐Ÿ“Š Capture network traffic - requests are intelligently filtered and returned into the context window
  • ๐Ÿšจ Collect console errors - captures logs & errors
  • ๐Ÿค– Autonomous debugging - the Cursor agent calls the web QA agent mcp server to test if the code it wrote works as epected end-to-end.

๐Ÿงฐ MCP Tool Reference

Tool Purpose
web_eval_agent ๐Ÿค– Automated UX evaluator that drives the browser, captures screenshots, console & network logs, and returns a rich UX report.
setup_browser_state ๐Ÿ”’ Opens an interactive (non-headless) browser so you can sign in once; the saved cookies/local-storage are reused by subsequent web_eval_agent runs.

Key arguments

  • web_eval_agent

    • url (required) โ€“ address of the running app (e.g. http://localhost:3000)
    • task (required) โ€“ natural-language description of what to test ("run through the signup flow and note any UX issues")
    • headless_browser (optional, default false) โ€“ set to true to hide the browser window
  • setup_browser_state

    • url (optional) โ€“ page to open first (handy to land directly on a login screen)

You can trigger these tools straight from your IDE chat, for example:

Evaluate my app at http://localhost:3000 โ€“ run web_eval_agent with the task "Try the full signup flow and report UX issues".

๐Ÿ Quick Start (macOS/Linux)

  1. Pre-requisites (typically not needed):
  • brew: /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
  • npm: (brew install npm)
  • jq: brew install jq
  1. Run the installer after getting an api key (free)
curl -LSf https://operative.sh/install.sh -o install.sh && bash install.sh && rm install.sh
  1. Visit your favorite IDE and restart to apply the changes
  2. Send a prompt in chat mode to call the web eval agent tool! e.g.
Test my app on http://localhost:3000. Use web-eval-agent.

๐Ÿ› ๏ธ Manual Installation

  1. Get your API key at operative.sh
  2. Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh)
  1. Install playwright:
npm install -g chromium playwright && uvx --with playwright playwright install --with-deps
  1. Add below JSON to your relevant code editor with api key
  2. Restart your code editor

๐Ÿ”ƒ Updating

  • uv cache clean
  • refresh MCP server
    "web-eval-agent": {
      "command": "uvx",
      "args": [
        "--refresh-package",
        "webEvalAgent",
        "--from",
        "git+https://github.com/Operative-Sh/web-eval-agent.git",
        "webEvalAgent"
      ],
      "env": {
        "OPERATIVE_API_KEY": "<YOUR_KEY>"
      }
    }

Operative Discord Server

๐Ÿ› ๏ธ Manual Installation (Mac + Cursor/Cline/Windsurf)

  1. Get your API key at operative.sh
  2. Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh)
  1. Install playwright:
npm install -g chromium playwright && uvx --with playwright playwright install --with-deps
  1. Add below JSON to your relevant code editor with api key
  2. Restart your code editor

Manual Installation (Windows + Cursor/Cline/Windsurf)

We're refining this, please open an issue if you have any issues!

  1. Do all this in your code editor terminal
  2. curl -LSf https://operative.sh/install.sh -o install.sh && bash install.sh && rm install.sh
  3. Get your API key at operative.sh
  4. Install uv (curl -LsSf https://astral.sh/uv/install.sh | sh)
  5. uvx --from git+https://github.com/Operative-Sh/web-eval-agent.git playwright install
  6. Restart code editor

๐Ÿšจ Issues

  • Updates aren't being received in code editors, update or reinstall for latest version: Run uv cache clean for latest
  • Any issues feel free to open an Issue on this repo or in the discord!
  • 5/5 - static apps without changes weren't screencasting, fixed! uv clean + restart to get fix

Changelog

  • 4/29 - Agent overlay update - pause/play/stop agent run in the browser

๐Ÿ“‹ Example MCP Server Output Report

๐Ÿ“Š Web Evaluation Report for http://localhost:5173 complete!
๐Ÿ“ Task: Test the API-key deletion flow by navigating to the API Keys section, deleting a key, and judging the UX.

๐Ÿ” Agent Steps
  ๐Ÿ“ 1. Navigate โ†’ http://localhost:5173
  ๐Ÿ“ 2. Click     "Login"        (button index 2)
  ๐Ÿ“ 3. Click     "API Keys"     (button index 4)
  ๐Ÿ“ 4. Click     "Create Key"   (button index 9)
  ๐Ÿ“ 5. Type      "Test API Key" (input index 2)
  ๐Ÿ“ 6. Click     "Done"         (button index 3)
  ๐Ÿ“ 7. Click     "Delete"       (button index 10)
  ๐Ÿ“ 8. Click     "Delete"       (confirm index 3)
๐Ÿ Flow tested successfully โ€“ UX felt smooth and intuitive.

๐Ÿ–ฅ๏ธ Console Logs (10)
  1. [debug] [vite] connectingโ€ฆ
  2. [debug] [vite] connected.
  3. [info]  Download the React DevTools โ€ฆ
     โ€ฆ

๐ŸŒ Network Requests (10)
  1. GET /src/pages/SleepingMasks.tsx                   304
  2. GET /src/pages/MCPRegistryRegistry.tsx             304
     โ€ฆ

โฑ๏ธ Chronological Timeline
  01:16:23.293 ๐Ÿ–ฅ๏ธ Console [debug] [vite] connectingโ€ฆ
  01:16:23.303 ๐Ÿ–ฅ๏ธ Console [debug] [vite] connected.
  01:16:23.312 โžก๏ธ GET /src/pages/SleepingMasks.tsx
  01:16:23.318 โฌ…๏ธ 304 /src/pages/SleepingMasks.tsx
     โ€ฆ
  01:17:45.038 ๐Ÿค– ๐Ÿ Flow finished โ€“ deletion verified
  01:17:47.038 ๐Ÿค– ๐Ÿ“‹ Conclusion repeated above
๐Ÿ‘๏ธ  See the "Operative Control Center" dashboard for live logs.

Star History

Star History Chart


Built with <3 @ operative.sh

Leave a Comment

Frequently Asked Questions

What is MCP?

MCP (Model Context Protocol) is an open protocol that standardizes how applications provide context to LLMs. Think of MCP like a USB-C port for AI applications, providing a standardized way to connect AI models to different data sources and tools.

What are MCP Servers?

MCP Servers are lightweight programs that expose specific capabilities through the standardized Model Context Protocol. They act as bridges between LLMs like Claude and various data sources or services, allowing secure access to files, databases, APIs, and other resources.

How do MCP Servers work?

MCP Servers follow a client-server architecture where a host application (like Claude Desktop) connects to multiple servers. Each server provides specific functionality through standardized endpoints and protocols, enabling Claude to access data and perform actions through the standardized protocol.

Are MCP Servers secure?

Yes, MCP Servers are designed with security in mind. They run locally with explicit configuration and permissions, require user approval for actions, and include built-in security features to prevent unauthorized access and ensure data privacy.

Related MCP Servers

chrisdoc hevy mcp avatar

chrisdoc hevy mcp

mcp
sylphlab pdf reader mcp avatar

sylphlab pdf reader mcp

An MCP server built with Node.js/TypeScript that allows AI agents to securely read PDF files (local or URL) and extract text, metadata, or page counts. Uses pdf-parse.

pdf-parsetypescriptnodejs
aashari mcp server atlassian bitbucket avatar

aashari mcp server atlassian bitbucket

Node.js/TypeScript MCP server for Atlassian Bitbucket. Enables AI systems (LLMs) to interact with workspaces, repositories, and pull requests via tools (list, get, comment, search). Connects AI directly to version control workflows through the standard MCP interface.

atlassianrepositorymcp
aashari mcp server atlassian confluence avatar

aashari mcp server atlassian confluence

Node.js/TypeScript MCP server for Atlassian Confluence. Provides tools enabling AI systems (LLMs) to list/get spaces & pages (content formatted as Markdown) and search via CQL. Connects AI seamlessly to Confluence knowledge bases using the standard MCP interface.

atlassianmcpconfluence
prisma prisma avatar

prisma prisma

Next-generation ORM for Node.js & TypeScript | PostgreSQL, MySQL, MariaDB, SQL Server, SQLite, MongoDB and CockroachDB

cockroachdbgomcp
Zzzccs123 mcp sentry avatar

Zzzccs123 mcp sentry

mcp sentry for typescript sdk

mcptypescript
zhuzhoulin dify mcp server avatar

zhuzhoulin dify mcp server

mcp
zhongmingyuan mcp my mac avatar

zhongmingyuan mcp my mac

mcp
zhixiaoqiang desktop image manager mcp avatar

zhixiaoqiang desktop image manager mcp

MCP ๆœๅŠกๅ™จ๏ผŒ็”จไบŽ็ฎก็†ๆกŒ้ขๅ›พ็‰‡ใ€ๆŸฅ็œ‹่ฏฆๆƒ…ใ€ๅŽ‹็ผฉใ€็งปๅŠจ็ญ‰๏ผˆๅฎŒๅ…จ่ฎฉTraeๅฎž็Žฐ๏ผ‰

mcp
zhixiaoqiang antd components mcp avatar

zhixiaoqiang antd components mcp

An MCP service for Ant Design components query | ไธ€ไธชๅ‡ๅฐ‘ Ant Design ็ป„ไปถไปฃ็ ็”Ÿๆˆๅนป่ง‰็š„ MCP ๆœๅŠก๏ผŒๅŒ…ๅซ็ณป็ปŸๆ็คบ่ฏใ€็ป„ไปถๆ–‡ๆกฃใ€API ๆ–‡ๆกฃใ€ไปฃ็ ็คบไพ‹ๅ’Œๆ›ดๆ–ฐๆ—ฅๅฟ—ๆŸฅ่ฏข

designantdapi

Submit Your MCP Server

Share your MCP server with the community

Submit Now