Import.io (Web Data Extraction)

Import.io (Web Data Extraction) MCP Connector for Claude

A+

Extract structured data from any website via Import.io — run extractors, manage bulk crawls, and monitor API usage.

10 tools Official Updated Jun 28, 2026 Official Vinkius Partner

Connect your Import.io account to any AI agent and take full control of your web data extraction and large-scale scraping through natural conversation.

What you can do

  • Precision Extraction — Trigger predefined extractors for specific URLs to retrieve clean, structured JSON data directly from your agent
  • Bulk Crawling — Start large-scale data extraction jobs across multiple pages concurrently using managed crawlers and monitor their progress in real-time
  • AI-Powered Magic API — Automatically identify and extract tabular data from any website without pre-configured extractors, perfect for rapid exploration
  • Status Monitoring — Poll ongoing extraction runs and crawl jobs to track processing states, success rates, and total pages processed
  • Data Export — Retrieve extraction results in structured JSON or CSV formats, ready for spreadsheet processing or downstream application logic
  • Usage Audit — Monitor your monthly API credit consumption and subscription limits to manage your data extraction budget effectively

How it works

  1. Subscribe to this server
  2. Enter your Import.io API Key
  3. Start extracting web data from Claude, Cursor, or any MCP-compatible client

Who is this for?

  • Data Analysts — automate the collection of market data and pricing intelligence through natural conversation
  • Market Researchers — perform large-scale competitor monitoring and web audits without manual scraping scripts
  • Product Managers — verify data extraction schemas and monitor crawler health across multiple projects efficiently
data-extractionweb-crawlingstructured-datajson-exportautomationdata-pipeline

10 tools expose this connector's capabilities to your AI agent.

get_crawl_data

Retrieve the unified JSON output of a completed import.io crawl job

get_crawl_status

start_crawl to check progress (pages processed, success rate, current state). Check the status of an ongoing import.io crawl job

download_csv

Returns the first 1000 characters and file stats. Ideal for spreadsheet processing. Download extraction data directly as CSV text

get_extractor_data

Fails if the run is still in progress. Retrieve structured JSON data from a completed import.io extraction

list_extractors

Useful for finding the correct extractor_id to run dynamically. List all extractors configured on the import.io account

run_magic_api

Ideal for quick, unstructured exploration. Run the import.io automated Magic API against a URL

run_extractor

Triggers an async extraction run and returns a run_id. Use this run_id to poll for status or data. Trigger an import.io extractor for a specific URL

start_crawl

Starts a large-scale data extraction job across multiple pages concurrently. Trigger an import.io bulk crawl job

get_extractor_status

run_extractor. Returns the current state (running, completed, failed) and metadata about the run. Check the status of an active or past import.io extraction run

account_usage

No parameters required. Check import.io account API credit usage

See how to talk to your AI agent using Import.io (Web Data Extraction).

Run extractor 'ext-123' against 'https://example.com/products'

Extraction run started. Run ID: 'run-98765'. I'll monitor the status for you. It usually takes a few seconds to process dynamic pages. Would you like me to fetch the results once it's completed?

List all extractors in my Import.io account

I've found 5 extractors in your account: 'Amazon Scraper' (ext-001), 'Competitor Pricing' (ext-002), 'Real Estate Leads' (ext-003), 'Stock Tracker' (ext-004), and 'News Monitor' (ext-005). Which one would you like to run?

Check my monthly API credit usage

You've used 12,450 credits out of your 50,000 monthly limit (24.9%). You have 37,550 credits remaining for this billing cycle. Your account health is excellent.

Yes. Use the `run_magic_api` tool. It uses Import.io's AI logic to automatically detect and extract structured or tabular data from any URL, making it ideal for quick exploration of new data sources.

Related Connectors