s_fetch_pattern
Extract content matching regex patterns from web pages while avoiding bot detection. Retrieve specific website data with configurable modes for different security levels.
Instructions
Extracts content matching regex patterns from web pages. Retrieves specific content from websites with bot-detection avoidance. For best performance, start with 'basic' mode (fastest), then only escalate to 'stealth' or 'max-stealth' modes if basic mode fails. Returns matched content as 'METADATA: {json}\n\n[content]' where metadata includes match statistics and truncation information. Each matched content chunk is delimited with '॥๛॥' and prefixed with '[Position: start-end]' indicating its byte position in the original document, allowing targeted follow-up requests with s-fetch-page using specific start_index values.
Args:
url: URL to fetch
search_pattern: Regular expression pattern to search for in the content
mode: Fetching mode (basic, stealth, or max-stealth)
format: Output format (html or markdown)
max_length: Maximum number of characters to return.
context_chars: Number of characters to include before and after each matchInput Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | ||
| search_pattern | Yes | ||
| mode | No | basic | |
| format | No | markdown | |
| max_length | No | ||
| context_chars | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |