Data Connector

by argythana

data

Connect to and inspect data sources. Use this skill when you need to verify data access, inspect table schemas, check row counts, or understand the structure of a dataset before performing analysis.

Skill Details


name: data-connector description: Connect to and inspect data sources. Use this skill when you need to verify data access, inspect table schemas, check row counts, or understand the structure of a dataset before performing analysis.

Data Connector

Connect to data sources and retrieve basic information about datasets.

When to Use

  • Before starting any data analysis task
  • To verify data is accessible and readable
  • To inspect column names and types
  • To check dataset size (rows, columns, file size)

Available Scripts

data-connect - Inspect Data Source

Connects to a data source and returns schema and summary information.

# Basic usage (outputs to stdout)
data-connect --source <path>

# Save to file
data-connect --source <path> --output report.md

Arguments:

  • --source (required): Path to data file or connection string
  • --output: Output file path (default: stdout)
  • --type: Override source type detection (parquet, csv, json)

Output Format

The script produces a markdown report with:

  • Source path and type
  • Row count and column count
  • File size (if applicable)
  • Column listing with data types

Example Output

# Data Connection Report

- **source**: data/sales.parquet
- **type**: parquet
- **row_count**: 1,234,567
- **column_count**: 15
- **file_size**: 45.2 MB

## Columns

| Column | Type |
|--------|------|
| id | INTEGER |
| date | DATE |
| amount | DOUBLE |
| category | VARCHAR |

Supported Data Sources

The connector auto-detects source type from file extension:

  • .parquet - Apache Parquet files
  • .csv - CSV files (auto-detects delimiter)
  • .json, .jsonl - JSON files
  • .db, .duckdb - DuckDB database files

Related Skills

Xlsx

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas

data

Clickhouse Io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

datacli

Clickhouse Io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

datacli

Analyzing Financial Statements

This skill calculates key financial ratios and metrics from financial statement data for investment analysis

data

Data Storytelling

Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.

data

Kpi Dashboard Design

Design effective KPI dashboards with metrics selection, visualization best practices, and real-time monitoring patterns. Use when building business dashboards, selecting metrics, or designing data visualization layouts.

designdata

Dbt Transformation Patterns

Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.

testingdocumenttool

Sql Optimization Patterns

Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.

designdata

Anndata

This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.

arttooldata

Xlsx

Spreadsheet toolkit (.xlsx/.csv). Create/edit with formulas/formatting, analyze data, visualization, recalculate formulas, for spreadsheet processing and analysis.

tooldata

Skill Information

Category:Data
Last Updated:1/5/2026