Skip to main content
Glama
Soflutionltd

deepcrawl-mcp

by Soflutionltd

Clone, scrape and crawl any website. Free Firecrawl alternative. No API keys, no rate limits, no subscription.

Quick Start

Cursor (one-click): Click the "Install in Cursor" button above.

Manual MCP config:

{
  "mcpServers": {
    "deepcrawl-mcp": {
      "command": "npx",
      "args": ["-y", "deepcrawl-mcp@latest"]
    }
  }
}

CLI:

npx deepcrawl-mcp@latest

Related MCP server: mcp-server-scraper

Tools

deepcrawl_scrape

Scrape a single page and return clean markdown. Extracts title, description, links, images, and metadata. Strips navigation, footer, ads, and tracking.

"Scrape https://example.com and give me the main content"

Parameter

Default

Description

url

required

Page URL to scrape

mainContentOnly

true

Extract only main content (skip nav/footer)

includeLinks

true

Include discovered links

includeImages

true

Include image URLs

deepcrawl_clone

Clone a full page with all assets: HTML, CSS, JS, images, fonts, favicons. Downloads everything into a local folder, rewrites URLs to relative paths. Open index.html in a browser and it works.

"Clone https://competitor.com into a local folder"

Parameter

Default

Description

url

required

Page URL to clone

outputDir

~/deepcrawl-clones/<domain>

Output folder

depth

0

Link depth: 0 = single page, 1+ = follow links

deepcrawl_crawl

Crawl an entire site following internal links. Returns every page as clean markdown. Great for content analysis, SEO audit, or feeding a RAG pipeline.

"Crawl https://docs.example.com and return all pages as markdown"

Parameter

Default

Description

url

required

Starting URL

maxPages

20

Max pages to crawl (max: 100)

includeImages

false

Include image URLs per page

deepcrawl_map

Discover all URLs from a site via sitemap.xml parsing and homepage link crawling. Run this before a crawl to see the site's scope.

"Map all pages on https://example.com"

Parameter

Default

Description

url

required

Site URL

maxUrls

200

Max URLs to discover

vs Firecrawl

deepcrawl

Firecrawl

Price

Free

$19+/mo

API key

None

Required

Rate limits

None

Yes

Scrape to markdown

Yes

Yes

Full site crawl

Yes

Yes

Site map

Yes

Yes

Clone with assets

Yes

No

JS rendering

Yes (via Playwright)

Yes

Anti-bot bypass

Partial (UA rotation, headers, delays)

Yes

deepcrawl handles static sites out of the box. For JS-heavy SPAs (React, Next.js, SvelteKit), install Playwright for full rendering:

npm install -g playwright
npx playwright install chromium

Once installed, deepcrawl auto-detects Playwright and enables JS rendering. Use jsRender: true on any tool to activate it. All tools also include UA rotation, realistic browser headers, and random delays to avoid basic bot detection.

Also by Soflution

  • brandcheck - Check brand name availability across 27 platforms

  • depsonar - Dependency audit, security scan, license check

License

MIT

A
license - permissive license
-
quality - not tested
D
maintenance

Maintenance

Maintainers
Response time
Release cycle
Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Soflutionltd/deepcrawl'

If you have feedback or need assistance with the MCP directory API, please join our Discord server