Data Skills

Discover and use data skills to extend Claude's capabilities

665 Data Skills Available

All Categories

2689 Creative596 Data665 Document83 Enterprise43 Skill869 Technical433

Diagram Generator

by Anthropic

Generates architecture, database, and system diagrams using Mermaid syntax. Creates visual representations of system architecture, database schemas, component relationships, and data flows.

data

Data

🎯

Text To Sql

by Anthropic

Convert natural language queries to SQL. Use for database queries, data analysis, and reporting.

data

Data

🎯

Explores data in a Bauplan lakehouse safely using the Bauplan Python SDK. Use to inspect namespaces, tables, schemas, samples, and profiling queries; and to export larger result sets to files. Read-only exploration only; no writes or pipeline runs.

data

Data

🎯

Interpreting Culture Index

by Anthropic

Use when interpreting Culture Index surveys, CI profiles, behavioral assessments, or personality data. Supports individual interpretation, team composition (gas/brake/glue), burnout detection, profile comparison, hiring profiles, manager coaching, interview transcript analysis for trait prediction, candidate debrief, onboarding planning, and conflict mediation. Handles PDF vision or JSON input.

data

DataAGPL-3.0

🎯

Acsets Hatchery

by Anthropic

Attributed C-Sets as algebraic databases. Category-theoretic data structures generalizing graphs and dataframes with Gay.jl color integration.

data

Data

🎯

Data Provenance

by Anthropic

data

Data

🎯

Matplotlib Best Practices

by Anthropic

Best practices for Matplotlib data visualization, plotting, and creating publication-quality figures in Python

data

Data

🎯

Pandas Best Practices

by Anthropic

Best practices for Pandas data manipulation, analysis, and DataFrame operations in Python

data

Data

🎯

Data Analyst

by Anthropic

Data analysis best practices with pandas, numpy, matplotlib, seaborn, and Jupyter notebooks.

data

Data

🎯

Segment Cdp

by Anthropic

Expert patterns for Segment Customer Data Platform including Analytics.js, server-side tracking, tracking plans with Protocols, identity resolution, destinations configuration, and data governance best practices. Use when "segment, analytics.js, customer data platform, cdp, tracking plan, event tracking, identify track page, data routing, segment, cdp, analytics, tracking, data-pipeline, customer-data" mentioned.

data

Data

🎯

Community Analytics

by Anthropic

Expert in measuring what matters in communities. Covers health metrics, engagement analytics, sentiment analysis, cohort tracking, and reporting. Knows that good data drives good decisions, and bad metrics drive bad behavior. Use when "community metrics, community analytics, measure community, community health, engagement metrics, community reporting, " mentioned.

data

Data

🎯

Ydata Eda Profiling

by Anthropic

Generate and compare ydata-profiling EDA reports with sampling, consistent random seeds, and HTML outputs; often follows duckdb-parquet-lab-workflow when data is queried from Parquet.

workflowdata

Data

🎯

Duckdb Parquet Lab Workflow

by Anthropic

Use DuckDB to query Parquet files, inspect metadata, join tables, and convert results to pandas for analysis; commonly precedes ydata-eda-profiling for EDA on extracted tables.

workflowdata

Data

🎯

Xlsx

by Anthropic

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. Use when the assistant needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas

data

DataProprietary. LICENSE.txt has complete terms

🎯

Vaex

by Anthropic

Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that do not fit in memory.

data

DataMIT license

🎯

Umap Learn

by Anthropic

UMAP dimensionality reduction. Fast nonlinear manifold learning for 2D/3D visualization, clustering preprocessing (HDBSCAN), supervised/parametric UMAP, for high-dimensional data.

data

DataBSD-3-Clause license

🎯

Policyengine District Analysis

by Anthropic

Analyze policy impacts for congressional districts and representatives' constituents. Use when the user mentions a specific district (NY-17, CA-52), a representative's name, or asks about geographic policy impacts at district level. Provides HuggingFace district datasets.

data

Data

🎯

Exploratory Data Analysis

by Anthropic

Perform comprehensive exploratory data analysis on scientific data files across 200+ file formats. This skill should be used when analyzing any scientific data file to understand its structure, content, quality, and characteristics. Automatically detects file type and generates detailed markdown reports with format-specific analysis, quality metrics, and downstream analysis recommendations. Covers chemistry, bioinformatics, microscopy, spectroscopy, proteomics, metabolomics, and general scientif

data

DataMIT license

🎯

Excel Data Analyzer

by Anthropic

Analyze messy and unstructured Excel files to identify data quality issues, detect format inconsistencies, find missing values, and generate comprehensive analysis reports. Use when Claude needs to work with Excel files (.xlsx, .xls) for data quality assessment, structure analysis, or when users request data auditing, cleaning recommendations, or statistical summaries of spreadsheet data.

data

Data

🎯

Ds Review

by Anthropic

This skill should be used when running Phase 4 of the /ds workflow to review methodology, data quality, and statistical validity. Provides structured review checklists, confidence scoring, and issue identification for data analysis validation.

workflowdata

Data

🎯

Data Analyst

by Anthropic

数据分析专家，精通数据可视化、趋势分析、报告生成和预测分析

data

Data

🎯

Researching Stocks

by Anthropic

Workflow for multi-step financial research requiring multiple data sources. Use for company comparisons, due diligence, comprehensive analysis, or complex financial questions.

workflowdata

Data

🎯

Diagram Generator

by Anthropic

Generates architecture, database, and system diagrams using Mermaid syntax. Creates visual representations of system architecture, database schemas, component relationships, and data flows.

data

Data

🎯

Data Analysis

by Anthropic

Skill for data analysis and visualization using Python

data

Data

🎯

Plox

by Anthropic

Plot timestamped logs as graphs. Use when user wants to visualize log data, plot numeric values over time, count events, track time deltas between events, compare multiple log files, or get statistics from logs.

data

Data

🎯

Stock Data Fetcher

by Anthropic

获取A股数据(baostock)并缓存到本地CSV文件，避免MCP返回大量数据占用上下文。触发场景：(1)获取超过100条的K线数据 (2)需要多次查询同一股票数据 (3)需要用grep/awk分析数据 (4)用户提到"保存数据"或"缓存数据

data

Data

🎯

Aggregating Performance Metrics

by Anthropic

Aggregate and centralize performance metrics from applications, systems, databases, caches, and services. Use when consolidating monitoring data from multiple sources. Trigger with phrases like "aggregate metrics", "centralize monitoring", or "collect performance data".

data

DataMIT

🎯

Managing Database Recovery

by Anthropic

data

DataMIT

🎯

Preprocessing Data With Automated Pipelines

by Anthropic

Automate data cleaning, transformation, and validation for ML tasks.

data

DataMIT

🎯

Detecting Data Anomalies

by Anthropic

data

DataMIT

🎯

Chipseq Qc

by Anthropic

Performs ChIP-specific biological validation. It calculates metrics unique to protein-binding assays, such as Cross-correlation (NSC/RSC) and FRiP. Use this when you have filtered the BAM file and called peaks for ChIP-seq data. Do NOT use this skill for ATAC-seq data or general alignment statistics.

data

Data

🎯

Optimizing Queries

by Anthropic

Analyzes and optimizes SQL/NoSQL queries for performance. Use when reviewing query performance, optimizing slow queries, analyzing EXPLAIN output, suggesting indexes, identifying N+1 problems, recommending query rewrites, or improving database access patterns. Supports PostgreSQL, MySQL, SQLite, MongoDB, Redis, DynamoDB, and Elasticsearch.

data

Data

🎯

Database Administrator

by Anthropic

data

Data

🎯

Microdf

by Anthropic

Weighted pandas DataFrames for survey microdata analysis - inequality, poverty, and distributional calculations

data

Data

🎯

Synthesis Matrix

by Anthropic

Create evidence synthesis matrices for systematic reviews. Use when: (1) Organizing extracted data, (2) Comparing study characteristics, (3) Identifying patterns across studies, (4) Preparing synthesis for manuscripts.

data

Data

🎯

Hypothesis Test

by Anthropic

Guide selection and interpretation of statistical hypothesis tests. Use when: (1) Choosing appropriate test for research data, (2) Checking assumptions before analysis, (3) Interpreting test results correctly, (4) Reporting statistical findings, (5) Troubleshooting assumption violations.

data

Data

🎯

Data Visualization

by Anthropic

Create publication-quality data visualizations. Use when: (1) Presenting results, (2) Exploratory data analysis, (3) Manuscript preparation, (4) Grant proposals, (5) Presentations.

data

Data

🎯

Edn Analyzer

by Anthropic

Deep EDN template analyzer for Logseq database graphs. Analyzes template structure, counts classes/properties, finds orphaned items, checks quality, and compares variants. Use when analyzing template files, finding issues, or comparing different template versions.

templatedata

Data

🎯

Data Cleaning Pipeline Generator

by Anthropic

Generates data cleaning pipelines for pandas/polars with handling for missing values, duplicates, outliers, type conversions, and data validation. Use when user asks to "clean data", "generate data pipeline", "handle missing values", or "remove duplicates from dataset".

data

Data

🎯

Database Query Optimizer

by Anthropic

Analyzes and optimizes database queries for PostgreSQL, MySQL, MongoDB with EXPLAIN plans, index suggestions, and N+1 query detection. Use when user asks to "optimize query", "analyze EXPLAIN plan", "fix slow queries", or "suggest database indexes".

data

Data

PreviousPage 8 of 17Next

Data Skills

All Categories

Diagram Generator

Text To Sql

Explore Data

Interpreting Culture Index

Acsets Hatchery

Data Provenance

Matplotlib Best Practices

Pandas Best Practices

Data Analyst

Segment Cdp

Community Analytics

Ydata Eda Profiling

Duckdb Parquet Lab Workflow

Xlsx

Vaex

Umap Learn

Policyengine District Analysis

Exploratory Data Analysis

Excel Data Analyzer

Ds Review

Data Analyst

Researching Stocks

Diagram Generator

Data Analysis

Plox

Stock Data Fetcher

Aggregating Performance Metrics

Managing Database Recovery

Preprocessing Data With Automated Pipelines

Detecting Data Anomalies

Chipseq Qc

Optimizing Queries

Database Administrator

Microdf

Synthesis Matrix

Hypothesis Test

Data Visualization

Edn Analyzer

Data Cleaning Pipeline Generator

Database Query Optimizer

Data Skills

All Categories

Diagram Generator

Text To Sql

Explore Data

Interpreting Culture Index

Acsets Hatchery

Data Provenance

Matplotlib Best Practices

Pandas Best Practices

Data Analyst

Segment Cdp

Community Analytics

Ydata Eda Profiling

Duckdb Parquet Lab Workflow

Xlsx

Vaex

Umap Learn

Policyengine District Analysis

Exploratory Data Analysis

Excel Data Analyzer

Ds Review

Data Analyst

Researching Stocks

Diagram Generator

Data Analysis

Plox

Stock Data Fetcher

Aggregating Performance Metrics

Managing Database Recovery

Preprocessing Data With Automated Pipelines

Detecting Data Anomalies

Chipseq Qc

Optimizing Queries

Database Administrator

Microdf

Synthesis Matrix

Hypothesis Test