navvi_browse
Perform web browsing and interactions through natural language instructions. Automates navigation, clicking, typing, and screenshots, handling cookie banners, login forms, and CAPTCHAs.
Instructions
PRIMARY BROWSING TOOL — use this for ANY web interaction instead of manually calling navvi_open, navvi_find, navvi_click, navvi_fill. Give a natural language instruction and optional URL; it handles navigation, element finding, clicking, typing, and screenshots internally.
Examples:
navvi_browse(instruction="search for 'Python FastMCP'", url="https://duckduckgo.com")
navvi_browse(instruction="click the first link in the results")
navvi_browse(instruction="accept cookie banners and screenshot the clean page", url="https://example.com")
navvi_browse(instruction="read the inbox and list unread emails", url="https://app.tuta.com")
Handles cookie banners, login detection, CAPTCHAs (escalates to VNC), and multi-step flows automatically. Returns screenshots and a step-by-step log.
If a stored flow recipe exists for the target domain, it will be used to guide or fast-track execution depending on confidence level. After completion, you'll be prompted to save new flows for reuse.
Only fall back to atomic tools (navvi_open, navvi_find, navvi_click) if this tool explicitly asks for guidance.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| instruction | Yes | ||
| url | No | ||
| max_steps | No | ||
| persona | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |