Crawlbase

Crawlbase MCP Connector for Claude

A+

Scrape and crawl via Crawlbase — perform HTML extraction, handle JS-rendered pages, bypass CAPTCHAs, and scrape social profiles directly from any AI agent.

10 tools Official Updated Jun 28, 2026 Official Vinkius Partner

Connect your Crawlbase (formerly ProxyCrawl) account to any AI agent and take full control of your web scraping and anonymous crawling workflows through natural conversation.

What you can do

  • Standard Scraper — Identify bounded routing spaces inside the headless engine to extract explicitly attached HTML content via datacenter proxies
  • JS Rendering — Discover disconnected physical limits tracking exactly what JS-rendered frames expose to extract exact single-page UI bounds
  • Structured JSON Extraction — Analyzes specific global bounds driving auto-extraction pipelines to force raw HTTP outputs into structured JSON format strictly
  • Screenshot Capture — Dispatch automated validation checks to generate valid proxy endpoints returning configured Crawlbase screenshot URLs
  • Specialized Scraping — Leverage dedicated algorithms for Amazon products, LinkedIn profiles, Facebook pages, and Twitter (X) graph profiles natively
  • Search Engine Discovery — Explain explicitly mapped proxy lists targeting Google domains to parse SERP limits and bypass CAPTCHAs limitlessly
  • Custom Proxy Management — Provision highly-available request payloads generating custom proxies with specific headers and crawling logic

How it works

  1. Subscribe to this server
  2. Enter your Crawlbase Normal Token and your optional JavaScript Token (found in your Crawlbase Dashboard)
  3. Start scraping and crawling from Claude, Cursor, or any MCP-compatible client

Who is this for?

  • Data Analysts — extract structured web data and search engine results without writing complex scraping scripts
  • Growth Hackers — monitor competitor products on Amazon or social profiles on LinkedIn and Twitter in real-time
  • Developers — test and debug web extraction pipelines and JS-rendering logic through natural conversation
  • Market Researchers — perform deep web crawls and capture snapshots of target sites for offline analysis
proxycaptcha-solvinghtml-extractionheadless-browserdata-collectionweb-crawling

10 tools expose this connector's capabilities to your AI agent.

scrape_html

crawlbase.com` datacenter proxies. Identify bounded routing spaces inside the Headless Crawlbase Engine

scrape_js_rendered

Retrieve explicit Cloud logging tracing explicit Payload IDs limitlessly

scrape_json_format

Perform structural extraction of properties driving active Fields

get_screenshot_link

Dispatch an automated validation check routing explicit Web Snapshot domains

scrape_amazon

Inspect deep internal arrays mitigating specific E-Commerce constraints

scrape_linkedin

Retrieve the exact structural matching verifying Blueprint constraints

scrape_facebook

Enumerate explicitly attached structured rules exporting active Social Pages

scrape_google_serp

Identify precise active arrays spanning rented Context domains for Search

scrape_twitter

Fetch elaborate explicit mapped limits via Crawlbase X extraction

custom_scrape

Provision a highly-available Request Payload generating Custom proxies

See how to talk to your AI agent using Crawlbase.

Scrape the price and features from this Amazon product: [Amazon URL]

Amazon scraping complete! I've extracted the following JSON data: Title: 'Eco-Smart Watch', Price: '$199.00', Rating: '4.5 stars', Features: ['Waterproof', 'Sleep tracking', '10-day battery'].

Get Google search results for 'best machine learning platforms 2024'

I've extracted the Google SERP results for your query. Top organic links include 'Top 10 ML Platforms (Site A)', 'The Future of AI (Site B)', and 'Enterprise ML Guide (Site C)'. Would you like the full meta descriptions for these?

Take a screenshot of https://example.com

Screenshot requested! Crawlbase is generating the snapshot. You can access the rendered image at this temporary proxy link: [Crawlbase Screenshot URL].

Use the Normal Token for fast, static HTML extraction. Switch to the JavaScript Token when the target site uses frameworks like React or Angular, where content is rendered dynamically in the browser. The 'scrape_js_rendered' tool requires the JS Token to function.

Related Connectors