Example Data Processor

by fkesheh

data

Process CSV data files by cleaning, transforming, and analyzing them. Use this when users need to work with CSV files, clean data, or perform basic data analysis tasks.

Skill Details

Repository Files

4 files in this skill directory


name: example-data-processor description: Process CSV data files by cleaning, transforming, and analyzing them. Use this when users need to work with CSV files, clean data, or perform basic data analysis tasks.

Example Data Processor

This skill demonstrates a complete skill structure with scripts, references, and proper documentation.

What This Skill Does

Processes CSV data files with these capabilities:

  • Clean and validate data
  • Transform columns
  • Generate summary statistics
  • Export results

Usage

Process a CSV file

To process a CSV file:

Process the data in myfile.csv

The skill will:

  1. Read the CSV file
  2. Clean the data (remove nulls, fix formats)
  3. Generate statistics
  4. Output a summary report

Custom Processing

For custom processing options:

Process sales.csv and group by region

Scripts

scripts/process_csv.py - Main data processing script

  • Reads CSV files
  • Applies transformations
  • Generates output

scripts/fetch_data.py - API data fetcher (demonstrates uv dependencies)

  • Fetches data from APIs using requests
  • Beautiful output formatting with rich
  • Auto-installs dependencies via uv inline metadata (PEP 723)
  • No manual pip install needed!

scripts/validate.py - Data validation script

  • Checks data quality
  • Reports issues

Configuration

The scripts use these environment variables:

  • OUTPUT_DIR - Where to save processed files (optional)
  • MAX_ROWS - Maximum rows to process (optional)

Set them using:

Set OUTPUT_DIR to /path/to/output

Reference Documentation

For detailed information:

Troubleshooting

"File not found" error:

  • Ensure the CSV file exists
  • Provide the full path to the file

"Invalid data" error:

  • Check the CSV format matches expected schema
  • See Data Formats for requirements

Related Skills

Xlsx

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas

data

Clickhouse Io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

datacli

Clickhouse Io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

datacli

Analyzing Financial Statements

This skill calculates key financial ratios and metrics from financial statement data for investment analysis

data

Data Storytelling

Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.

data

Kpi Dashboard Design

Design effective KPI dashboards with metrics selection, visualization best practices, and real-time monitoring patterns. Use when building business dashboards, selecting metrics, or designing data visualization layouts.

designdata

Dbt Transformation Patterns

Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.

testingdocumenttool

Sql Optimization Patterns

Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.

designdata

Anndata

This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.

arttooldata

Xlsx

Spreadsheet toolkit (.xlsx/.csv). Create/edit with formulas/formatting, analyze data, visualization, recalculate formulas, for spreadsheet processing and analysis.

tooldata

Skill Information

Category:Data
Last Updated:10/18/2025