Enriching Tables With Web Data
by brightdata
Enriches Excel tables with up-to-date web data using Bright Data MCP tools. Use when the user asks to enrich, update, or add information to spreadsheet data from LinkedIn, Instagram, Amazon, e-commerce sites, social media, or any web source. Supports company profiles, product data, social media metrics, reviews, and more.
Skill Details
Repository Files
1 file in this skill directory
name: enriching-tables-with-web-data description: Enriches Excel tables with up-to-date web data using Bright Data MCP tools. Use when the user asks to enrich, update, or add information to spreadsheet data from LinkedIn, Instagram, Amazon, e-commerce sites, social media, or any web source. Supports company profiles, product data, social media metrics, reviews, and more.
Table Enrichment with Bright Data
Enrich Excel tables with current web data using Bright Data's 60+ MCP tools.
Requirements
Install openpyxl for Excel file handling:
pip install openpyxl --break-system-packages
Workflow
1. Analyze the table
Read the Excel file and identify:
- Columns that contain identifiers (URLs, usernames, product IDs, company names)
- Columns that need enrichment (empty or to be updated)
- Data type (social media, e-commerce, business, etc.)
import openpyxl
workbook = openpyxl.load_workbook("file.xlsx")
sheet = workbook.active
# Read headers and first few rows to understand structure
headers = [cell.value for cell in sheet[1]]
sample_data = [[cell.value for cell in row] for row in list(sheet.rows)[1:6]]
2. Match tools to data type
Based on identifiers in the table, select appropriate Bright Data tools:
Social Media:
BrightData:web_data_linkedin_person_profile- LinkedIn profilesBrightData:web_data_linkedin_company_profile- LinkedIn companiesBrightData:web_data_instagram_profiles- Instagram profilesBrightData:web_data_tiktok_profiles- TikTok profilesBrightData:web_data_youtube_profiles- YouTube channels
E-commerce:
BrightData:web_data_amazon_product- Amazon productsBrightData:web_data_walmart_product- Walmart productsBrightData:web_data_ebay_product- eBay productsBrightData:web_data_etsy_products- Etsy products
Business:
BrightData:web_data_crunchbase_company- Company dataBrightData:web_data_zoominfo_company_profile- ZoomInfo data
General:
BrightData:scrape_as_markdown- Any public webpage
3. Enrich data row by row
For each row, call the appropriate MCP tool and extract relevant fields:
# Example: Enriching LinkedIn profiles
for row_idx, row in enumerate(sheet.iter_rows(min_row=2), start=2):
linkedin_url = row[url_column_idx].value
if linkedin_url:
# Call BrightData:web_data_linkedin_person_profile
# Extract: name, title, company, location, connections, etc.
# Write to corresponding columns
sheet.cell(row=row_idx, column=name_col).value = profile_data['name']
sheet.cell(row=row_idx, column=title_col).value = profile_data['title']
4. Handle errors and missing data
- Skip rows with invalid/missing identifiers
- Log which rows were enriched successfully
- Note rate limits or tool failures
- Preserve original data
5. Save enriched file
Save to /mnt/user-data/outputs/:
output_path = '/mnt/user-data/outputs/enriched_table.xlsx'
workbook.save(output_path)
Provide link: [View enriched table](computer:///mnt/user-data/outputs/enriched_table.xlsx)
Tool selection logic
If table has LinkedIn URLs → Use BrightData:web_data_linkedin_person_profile or BrightData:web_data_linkedin_company_profile
If table has Instagram URLs → Use BrightData:web_data_instagram_profiles
If table has Amazon URLs (with /dp/) → Use BrightData:web_data_amazon_product
If table has product names but no URLs → Use BrightData:search_engine to find URLs first, then scrape
If table has company names → Search for LinkedIn/Crunchbase URLs, then enrich
If URLs are present but platform unknown → Use BrightData:scrape_as_markdown
Best practices
- Batch similar requests: Group rows by data type before calling tools
- Start small: Test enrichment on first 3-5 rows before processing entire table
- Preserve originals: Create new columns for enriched data
- Show progress: Update user after every 10-20 rows processed
- Handle nulls: Explicitly mark cells where data couldn't be retrieved
Example enrichment types
LinkedIn profiles → Add: job title, company, location, connections, industry
Instagram accounts → Add: followers, posts count, engagement rate, bio
Amazon products → Add: price, rating, review count, availability, seller
Company names → Add: LinkedIn URL, website, employee count, funding, industry
Generic URLs → Add: page title, description, key content, last updated
Related Skills
Xlsx
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Analyzing Financial Statements
This skill calculates key financial ratios and metrics from financial statement data for investment analysis
Data Storytelling
Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.
Kpi Dashboard Design
Design effective KPI dashboards with metrics selection, visualization best practices, and real-time monitoring patterns. Use when building business dashboards, selecting metrics, or designing data visualization layouts.
Dbt Transformation Patterns
Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.
Sql Optimization Patterns
Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.
Anndata
This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.
Xlsx
Spreadsheet toolkit (.xlsx/.csv). Create/edit with formulas/formatting, analyze data, visualization, recalculate formulas, for spreadsheet processing and analysis.
