Safari MCP Server
Provides tools for controlling and automating the Safari browser, enabling visual web access, tab management, navigation, element interaction, and screenshots through Safari's automation interfaces.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Safari MCP ServerNavigate to example.com"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Safari MCP Server
A MCP (Model Context Protocol) server for visual web access through Safari.
Features
Visual Web Access: Viewport screenshots for visual inspection
Full Authentication State: Operates on the user's actual Safari session with cookies preserved
Tab-Aware Targeting: Act operations target a captured working tab, observe operations follow the user's focus
Viewport Scrolling: Pixel amount or viewport-page index
Element Inspection: Tag, visibility, attributes, bounding rect for any CSS selector
Interaction Tools: Click, type, hover, select option
Wait Conditions: Selector to appear, disappear, or page text to render
Link Extraction: Anchor links as
{ text, href }pairsBrowser History: Back and forward navigation
Console Error Capture: Two-phase injection during and after page load
Prerequisites
macOS → System Settings → Desktop & Dock → Windows → "Prefer tabs when opening documents" option must be set to Always
macOS → System Settings → Privacy & Security → Screen & System Audio Recording → Terminal app must be enabled
Safari → Settings → Developer → Automation → "Allow JavaScript from Apple Events" option must be enabled
MCP Server Configuration
Add to mcp.json servers configuration:
{
"mcpServers": {
"safari": {
"command": "npx",
"args": ["-y", "@axivo/mcp-safari"],
"env": {
"SAFARI_WINDOW_HEIGHT": "1600"
}
}
}
}Environment Variables
All variables are optional:
SAFARI_PAGE_TIMEOUT- Page load and selector wait timeout, in milliseconds (default:10000)SAFARI_WINDOW_BOUNDS- Browser window margin offset from top-left corner, in pixels (default:20)SAFARI_WINDOW_HEIGHT- Browser window height, in pixels (default:1024)SAFARI_WINDOW_WIDTH- Browser window width, in pixels (default:1280)
Prompt Examples
"Open Safari and use
statustool for guidelines""Navigate to
example.com""Search for
example query""Take a screenshot of the current page"
"Read the page content to understand what's on the page"
"Click the 'Sign In' button"
"Type my email into the login form and submit"
"Refresh the page to see the latest changes"
"Go back to the previous page"
"Navigate forward two steps in browser history"
"Scroll down 500 pixels"
"Scroll to page 3 of this article"
"Search for 'Claude AI' and click the first result"
"List all open browser tabs"
"Open a new browser tab and go to
example.com""Switch to the first browser tab"
"Close the second browser tab"
"Inspect the submit button before clicking it"
"Hover over the Products menu"
"Choose 'Canada' in the country dropdown"
"Wait for the loading spinner to disappear"
"Read all links on this page"
The "use status tool" instruction helps Claude pause and process the _meta.usage guidelines before interacting with the browser.
MCP Tools
Call status first at session start to get the runtime state and full tool surface:
Act tools target a captured working tab
Observe tools target the front window's current tab
clickClick an element on the working tab
Type:
acttoolOptional inputs:
key(string): Key to press (e.g., Escape, ArrowRight, Enter, Tab)selector(string): CSS selector for the target elementtext(string): Visible text or aria-label to matchwait(string): CSS selector to wait for after clickx(number): X coordinate in pixelsy(number): Y coordinate in pixels
Returns: Result with change detection
closeClose the working tab
Type:
acttool
executeExecute JavaScript in the working tab
Type:
acttoolRequired inputs:
script(string): JavaScript code
hoverDispatch hover events to reveal hover-triggered UI
Type:
acttoolOptional inputs (one is required):
selector(string): CSS selector for the target elementtext(string): Visible text to match
inspectReturn element metadata for a CSS selector
Type:
observetoolRequired inputs:
selector(string): CSS selector for the target element
Optional inputs:
index(number): Tab index in the front window
Returns:
{ found, tag, text, visible, disabled, attributes, rect }
navigateNavigate the working tab to a URL or through history
Type:
acttoolOptional inputs (
urlordirectionrequired):direction(string:backorforward)selector(string): CSS selector to wait for after loadsteps(number, default: 1): Steps for history navigationurl(string): URL to navigate to
openOpen a blank tab as the working target
Type:
acttool
readGet page title, URL, and text or links from a tab
Type:
observetoolOptional inputs:
index(number): Tab index in the front windowmode(string:textorlinks, default:text)selector(string): CSS selector to scope extraction
refreshRefresh the working tab
Type:
acttoolOptional inputs:
hard(boolean, default: false): Bypass cacheselector(string): CSS selector to wait for after reload
screenshotCapture the Safari window, an element, the full page, or the screen
Type:
observetoolOptional inputs:
display(number): Display index forscreenmode, 1-based, defaults to the main displaymode(string:element,page,screen,window, default:window): Capture modeselector(string): CSS selector forelementmodesettle(number): Forpagemode, milliseconds to wait after each scroll for content to settle (default:500). Raise for slow dynamic sites, lower for static sites.share(boolean, default:false): Save to disk and return only the file path instead of the inline image
Returns: Inline base64 image when
shareisfalse, or{ path, width, height, mimeType, ... }whenshareistrue. Browser metadata{ innerHeight, scrollHeight, pages }is included for non-screenmodes.
scrollScroll by direction or to a viewport-page index
Type:
observetoolOptional inputs:
direction(string:upordown)page(number): Viewport-page index to scroll topixels(number): Pixels to scroll, paired withdirection
searchSearch using the browser's default engine
Type:
acttoolRequired inputs:
text(string): Search query
selectChoose an option in a
<select>elementType:
acttoolRequired inputs:
selector(string): CSS selector for the<select>
Optional inputs (one is required):
text(string): Option visible textvalue(string): Option value attribute
statusReturn current Safari tabs and full tool surface
Type:
observetoolReturns:
{ tabs, tools }
typeType text into an input field
Type:
acttoolRequired inputs:
text(string): Text to type
Optional inputs:
append(boolean, default: false): Append instead of replaceselector(string): CSS selector for the inputsubmit(boolean, default: false): Press Enter after typing
waitWait for selector or page text condition
Type:
observetoolOptional inputs (exactly one of the first three required):
selector(string): CSS selector to wait forselectorGone(string): CSS selector to wait absenttext(string): Page text to wait fortimeoutMs(number): Timeout in milliseconds
Returns:
{ matched, elapsedMs }
windowManage browser window tabs
Type:
observetoolRequired inputs:
action(string:close,list,open,switch)
Optional inputs:
index(number): Tab index forcloseandswitchurl(string): URL foropen
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/axivo/mcp-safari'
If you have feedback or need assistance with the MCP directory API, please join our Discord server