browse_web
Automate web browsing tasks using AI to navigate websites, click buttons, fill forms, and extract information through natural language commands.
Instructions
Browse the web to complete a task using AI-powered browser automation.
The AI agent can navigate websites, click buttons, fill forms, search for information,
and interact with web pages just like a human user. This runs synchronously and returns
when the task is complete.
Args:
task: What you want to accomplish (e.g., "Find the top 3 gaming laptops on Amazon")
url: Starting webpage (defaults to Google)
Returns:
Dictionary containing:
- ok: Boolean indicating success
- data: Task completion message with results
- screenshot_dir: Path to saved screenshots
- session_id: Unique session identifier
- progress: List of actions taken during browsing
- error: Error message (if task failed)
Examples:
- "Search for Python tutorials and summarize the top result"
- "Go to example.com and click the login button"
- "Find product reviews for iPhone 15 Pro"
Note: For long-running tasks, consider using start_web_task instead.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
task | Yes | ||
url | No | https://www.google.com |
Input Schema (JSON Schema)
{
"properties": {
"task": {
"title": "Task",
"type": "string"
},
"url": {
"default": "https://www.google.com",
"title": "Url",
"type": "string"
}
},
"required": [
"task"
],
"type": "object"
}