scrape_generic
Extract content from diverse formats like Jupyter, HTML, OpenAPI, and Confluence to create AI-ready skills and RAG knowledge bases.
Instructions
Scrape content from new source types: jupyter, html, openapi, asciidoc, pptx, confluence, notion, rss, manpage, chat. A generic entry point that delegates to the appropriate CLI scraper module.
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| source_type | Yes | ||
| name | Yes | ||
| path | No | ||
| url | No |