This skill should be used when the user asks to "research web data", "scrape websites", "extract web data", "perform market research", "analyze competitors",...
Advanced web research powered by Bright Data MCP - perform market analysis, competitive intelligence, data extraction, and comprehensive web research with anti-bot protection.
This skill provides complete workflows for automated web research using Bright Data MCP. Handle search discovery, content collection, structured data extraction, and comprehensive analysis with browser automation support.
Activate this skill when the user's request involves:
Use search_engine tool to find relevant sources:
{
"tool": "search_engine",
"parameters": {
"query": "site:etsy.com nba merchandise",
"engine": "google",
"cursor": "0"
}
}
Search strategies:
"site:etsy.com keywords""machine learning in healthcare""iphone -case -cover"Three collection modes based on research depth:
Quick Mode (3-5 URLs, serial processing):
scrape_as_markdown for each URLStandard Mode (10-20 URLs, parallel batch):
scrape_batch for up to 10 URLs concurrentlyDeep Mode (20-50 URLs, browser automation):
scraping_browser_navigate for JavaScript-rendered pagesscraping_browser_links to discover page linksscraping_browser_click for interactionsUse extract tool for AI-powered structured data extraction:
{
"tool": "extract",
"parameters": {
"url": "https://example.com/product",
"extraction_prompt": "Extract: product name, price as number, rating (0-5), number of reviews, seller name, availability status"
}
}
Common extraction schemas:
Three report formats for different use cases:
Report Format (default):
JSON Format:
Markdown Format:
Understand the research intent:
Use search_engine to find URLs:
Choose appropriate mode:
scrape_as_markdown per URLscrape_batch 10 URLs at oncescraping_browser_navigate + browser toolsHandle errors gracefully:
Apply extraction schema:
extract with custom promptsProcess and analyze:
Generate output:
scrape_batch for parallel processing (10x faster)deep mode when necessary (much slower)Query: "site:etsy.com nba merchandise"
Mode: standard
Extract: product name, price, rating, reviews, seller
Output: report
Expected: Price analysis, popular products, top sellers
Query: "iphone 15 pro max 256GB price comparison"
Mode: standard
Extract: retailer, price, availability, shipping
Output: json
Expected: Structured comparison with best deal identified
Query: "machine learning in healthcare 2024 papers"
Mode: standard
Extract: title, authors, date, key findings, methodology
Output: report
Expected: Literature review with trends and insights
Query: "competitor.com features pricing"
Mode: deep
Extract: feature name, description, pricing tier, availability
Output: report
Expected: Feature comparison, pricing analysis, recommendations
Purpose: Find relevant web pages Parameters: query (required), engine (google/bing/yandex), cursor (page number) Returns: SERP results in markdown
Purpose: Get clean, AI-ready markdown Parameters: url (required) Returns: Formatted markdown without ads/clutter
Purpose: Get raw HTML Parameters: url (required) Returns: Complete HTML document
Purpose: AI-powered structured data extraction Parameters: url (required), extraction_prompt (optional) Returns: JSON object with extracted data
Purpose: Process multiple URLs in parallel Parameters: urls (array, max 10) Returns: Array of page contents
Purpose: Navigate JavaScript-rendered pages Parameters: url (required) Returns: Page info (title, URL, status)
Purpose: Click elements on page Parameters: selector (CSS selector) Returns: Action result
Purpose: Get all links on current page Parameters: None Returns: Array of links with text, href, selector
mode=deepmax_resultsmode=standard instead of deepRead with offset and limit to read in chunksGrep to search for specific patternsGlob to find files by patternFor detailed workflows and techniques:
references/search-discovery.md - Search strategies and URL discoveryreferences/content-scraping.md - Content collection methodsreferences/data-extraction.md - Extraction schemas and validationreferences/deep-scraping.md - Browser automation techniquesreferences/analysis-report.md - Analysis and report generationComplete research examples:
examples/market-research-etsy-nba.md - E-commerce market researchexamples/competitive-analysis-pricing.md - Price comparison workflowexamples/academic-research-ml-healthcare.md - Academic literature reviewThis SKILL.md provides core workflows and quick reference (approximately 2,000 words).
For detailed implementation patterns, advanced techniques, and comprehensive examples, consult the references/ files which load as needed during research tasks.
ZIP package — ready to use