s-fetch-pattern

Extracts specific content from web pages using regex patterns, avoiding bot detection. Supports basic, stealth, and max-stealth modes for efficient retrieval, returning metadata and match details for targeted follow-ups.

Instructions

Extracts content matching regex patterns from web pages. Retrieves specific content from websites with bot-detection avoidance. For best performance, start with 'basic' mode (fastest), then only escalate to 'stealth' or 'max-stealth' modes if basic mode fails. Returns matched content as 'METADATA: {json}\n\n[content]' where metadata includes match statistics and truncation information. Each matched content chunk is delimited with '॥๛॥' and prefixed with '[Position: start-end]' indicating its byte position in the original document, allowing targeted follow-up requests with s-fetch-page using specific start_index values.

Input Schema

Name	Required	Description	Default
`context_chars`	No	Number of characters to include before and after each match
`format`	No	Output format (html or markdown)	markdown
`max_length`	No	Maximum number of characters to return.
`mode`	No	Fetching mode (basic, stealth, or max-stealth)	basic
`search_pattern`	Yes	Regular expression pattern to search for in the content
`url`	Yes	URL to fetch

Input Schema (JSON Schema)

{
  "properties": {
    "context_chars": {
      "default": 200,
      "description": "Number of characters to include before and after each match",
      "minimum": 0,
      "title": "Context Chars",
      "type": "integer"
    },
    "format": {
      "default": "markdown",
      "description": "Output format (html or markdown)",
      "title": "Format",
      "type": "string"
    },
    "max_length": {
      "default": 5000,
      "description": "Maximum number of characters to return.",
      "exclusiveMaximum": 1000000,
      "exclusiveMinimum": 0,
      "title": "Max Length",
      "type": "integer"
    },
    "mode": {
      "default": "basic",
      "description": "Fetching mode (basic, stealth, or max-stealth)",
      "title": "Mode",
      "type": "string"
    },
    "search_pattern": {
      "description": "Regular expression pattern to search for in the content",
      "title": "Search Pattern",
      "type": "string"
    },
    "url": {
      "description": "URL to fetch",
      "title": "Url",
      "type": "string"
    }
  },
  "required": [
    "url",
    "search_pattern"
  ],
  "title": "PatternFetchRequest",
  "type": "object"
}

Scrapling Fetch MCP

s-fetch-pattern

Instructions

Input Schema

Input Schema (JSON Schema)

Other Tools from Scrapling Fetch MCP

Related Tools

MCP directory API