Extract clean, readable text from any webpage by removing navigation, headers, and scripts. Use start_index and max_length to paginate through lengthy content.
MIT
Guardian Open Platform: content search, articles, sections, tags. Free dev key.
GOV.UK Content + Search APIs (every gov.uk page + full search)