scrapeBalanced

Extract web content efficiently with a balanced approach, including images and paginated data, while controlling parameters like scroll attempts, timeouts, and image size for precise results.

Instructions

Balanced web scraping approach with good coverage and reasonable speed

Input Schema

Name	Required	Description
`downloadImages`	No	Whether to download images locally
`maxImages`	No	Maximum number of images to extract
`maxScrolls`	No	Maximum number of scroll attempts (default: 10)
`minImageSize`	No	Minimum width/height for images in pixels
`output`	No	Output directory for downloaded images
`pages`	No	Number of pages to scrape (if pagination is present)
`scrapeImages`	No	Whether to include images in the scrape result
`scrollDelay`	No	Delay between scrolls in ms (default: 2000)
`timeout`	No	Maximum time in ms for the scrape operation (default: 30000)
`url`	Yes	URL of the webpage to scrape

Input Schema (JSON Schema)

{
  "properties": {
    "downloadImages": {
      "description": "Whether to download images locally",
      "type": "boolean"
    },
    "maxImages": {
      "description": "Maximum number of images to extract",
      "type": "number"
    },
    "maxScrolls": {
      "description": "Maximum number of scroll attempts (default: 10)",
      "type": "number"
    },
    "minImageSize": {
      "description": "Minimum width/height for images in pixels",
      "type": "number"
    },
    "output": {
      "description": "Output directory for downloaded images",
      "type": "string"
    },
    "pages": {
      "description": "Number of pages to scrape (if pagination is present)",
      "type": "number"
    },
    "scrapeImages": {
      "description": "Whether to include images in the scrape result",
      "type": "boolean"
    },
    "scrollDelay": {
      "description": "Delay between scrolls in ms (default: 2000)",
      "type": "number"
    },
    "timeout": {
      "description": "Maximum time in ms for the scrape operation (default: 30000)",
      "type": "number"
    },
    "url": {
      "description": "URL of the webpage to scrape",
      "type": "string"
    }
  },
  "required": [
    "url"
  ],
  "type": "object"
}

Prysm MCP Server

scrapeBalanced

Instructions

Input Schema

Input Schema (JSON Schema)

Other Tools from Prysm MCP Server

Related Tools

MCP directory API