extract_timestamps
Extract webpage creation, modification, and publication timestamps from HTML meta tags, HTTP headers, and structured data using an input URL. Configurable options include timeout, user agent, redirect handling, and heuristic detection.
Instructions
Extract creation, modification, and publication timestamps from a webpage
Input Schema
Name | Required | Description | Default |
---|---|---|---|
config | No | Optional configuration for the extraction | |
url | Yes | The URL of the webpage to extract timestamps from |
Input Schema (JSON Schema)
{
"properties": {
"config": {
"description": "Optional configuration for the extraction",
"properties": {
"enableHeuristics": {
"description": "Whether to enable heuristic timestamp detection (default: true)",
"type": "boolean"
},
"followRedirects": {
"description": "Whether to follow HTTP redirects (default: true)",
"type": "boolean"
},
"maxRedirects": {
"description": "Maximum number of redirects to follow (default: 5)",
"type": "number"
},
"timeout": {
"description": "Request timeout in milliseconds (default: 10000)",
"type": "number"
},
"userAgent": {
"description": "User agent string to use for requests",
"type": "string"
}
},
"type": "object"
},
"url": {
"description": "The URL of the webpage to extract timestamps from",
"type": "string"
}
},
"required": [
"url"
],
"type": "object"
}