Rod MCP Server

Rod MCP Server avatar

by go-rod

golangmcpBrowser Automation

Model Context Protocol Server of Rod

What is Rod MCP Server

Rod MCP Server

Wow! It's awesome, now you can use the MCP server of Rod!🚀

Rod-MCP provides browser automation capabilities for your applications by using Rod. The server provides many useful mcp tools enable LLMs to interact with the web pages, like click, take screenshot, save page as pdf etc.

Features

  • 🚀 Browser automation powered by Rod
  • 🎯 Rich web interaction capabilities
    • Element clicking
    • Screenshot capture
    • PDF generation
    • And more...
  • 🎨 Headless/GUI mode support
  • ⚡ High performance and stability
  • 🔧 Easy to configure and extend
  • 🤖 Designed for LLMs interaction

Installation

Prerequisites

  • Go 1.23 or higher
  • Chrome/Chromium browser

Steps

  1. Clone the repository:
git clone https://github.com/go-rod/rod-mcp.git
cd rod-mcp
  1. Install dependencies:
go mod tidy
  1. Build the project:
go build

Usage

Basic Usage

  1. Clone Repo and Build Self or Go to Download Release
  2. Configure MCP:
{
    "mcpServers": {
        "rod-mcp": {
            "command": "rod-mcp",
            "args": [
                "-c", "rod-mcp.yaml"
            ]
        }
    }
}

Configuration

The configuration file supports the following options:

  • serverName: Server name, default is "Rod Server"
  • browserBinPath: Browser executable file path, use system default browser if empty
  • headless: Whether to run the browser in headless mode, default is false
  • browserTempDir: Browser temporary file directory, default is "./rod/browser"
  • noSandbox: Whether to disable sandbox mode, default is false
  • proxy: Proxy server settings, supports socks5 proxy

Project Structure

rod-mcp/
├── assets/          # Static resources
├── banner/          # Banner resources
├── cmd.go           # Command line processing
├── main.go          # Program entry
├── resources/       # Resource files
├── server.go        # Server implementation
├── tools/           # Tool implementation
├── types/           # Type definitions
└── utils/           # Utility functions

Contribution Guidelines

Welcome to submit Pull Request or create Issue!

License

This project is licensed under the MIT License - see the LICENSE file

Leave a Comment

Comments section will be available soon. Stay tuned!

Frequently Asked Questions

What is MCP?

MCP (Model Context Protocol) is an open protocol that standardizes how applications provide context to LLMs. Think of MCP like a USB-C port for AI applications, providing a standardized way to connect AI models to different data sources and tools.

What are MCP Servers?

MCP Servers are lightweight programs that expose specific capabilities through the standardized Model Context Protocol. They act as bridges between LLMs like Claude and various data sources or services, allowing secure access to files, databases, APIs, and other resources.

How do MCP Servers work?

MCP Servers follow a client-server architecture where a host application (like Claude Desktop) connects to multiple servers. Each server provides specific functionality through standardized endpoints and protocols, enabling Claude to access data and perform actions through the standardized protocol.

Are MCP Servers secure?

Yes, MCP Servers are designed with security in mind. They run locally with explicit configuration and permissions, require user approval for actions, and include built-in security features to prevent unauthorized access and ensure data privacy.