Data Analytics Engineering

by vasilyu1983

documentdata

Analytics engineering for reliable metrics and BI readiness. Build transformation layers, dimensional models, semantic metrics, data quality tests, and documentation. Use when you need dbt or SQL transformation strategy, metrics definition, or analytics data modeling.

Skill Details

Repository Files

1 file in this skill directory


name: data-analytics-engineering description: Analytics engineering for reliable metrics and BI readiness. Build transformation layers, dimensional models, semantic metrics, data quality tests, and documentation. Use when you need dbt or SQL transformation strategy, metrics definition, or analytics data modeling.

Data Analytics Engineering

Scope

  • Define metrics, grains, and dimensional models.
  • Build transformation layers and semantic models.
  • Implement data quality tests and observability.
  • Document datasets, lineage, and ownership.
  • Align analytics outputs with BI and product needs.

Ask For Inputs

  • Business metrics and decision use cases.
  • Source systems, data freshness, and latency needs.
  • Existing warehouse, tooling, and orchestration.
  • Expected data volumes and change cadence.
  • Governance requirements and access controls.

Workflow

  1. Define metric dictionary and grains.
  2. Design staging, intermediate, and mart layers.
  3. Model dimensions and facts with clear keys.
  4. Build semantic layer and metric definitions.
  5. Add tests for freshness, nulls, ranges, and duplicates.
  6. Document lineage, owners, and SLAs.
  7. Plan rollout, backfills, and validation checks.

Outputs

  • Metric dictionary and semantic model.
  • Data model with schema and grain definitions.
  • Transformation plan and dbt or SQLMesh structure.
  • Data quality test suite and alerting plan.
  • Documentation and ownership map.

Quality Checks

  • Keep metric definitions stable and versioned.
  • Treat metrics as APIs: document changes, deprecate safely, and backfill deliberately.
  • Define data contracts for core tables (schema, freshness, keys) to control downstream breakage.
  • Avoid mixed grains in a single model.
  • Ensure tests cover critical joins and aggregates.
  • Validate against source of truth and historical baselines.

Templates

  • assets/metric-dictionary.md for metric definitions and owners.
  • assets/semantic-layer-spec.md for entities, measures, and dimensions.
  • assets/data-quality-test-plan.md for test coverage planning.

Resources

  • references/modeling-patterns.md for modeling guidance and data quality patterns.
  • references/tool-comparison-2026.md for dbt vs SQLMesh vs Coalesce decision matrix.
  • data/sources.json for curated vendor docs and trend-tracking sources (use as a WebSearch seed list).

Related Skills


Trend Awareness Protocol

IMPORTANT: When users ask recommendation questions about analytics engineering, data modeling, or BI, you MUST use WebSearch to check current trends before answering. If WebSearch is unavailable, use data/sources.json + web browsing and state what you verified vs assumed.

Trigger Conditions

  • "What's the best tool for [analytics engineering/data modeling/BI]?"
  • "What should I use for [transformation/semantic layer/metrics]?"
  • "What's the latest in analytics engineering?"
  • "Current best practices for [dbt/metrics layers/data quality]?"
  • "Is [tool/approach] still relevant in 2026?"
  • "[dbt] vs [SQLMesh] vs [other]?"
  • "Best BI tool for [use case]?"
  • "SQLMesh acquisition" or "Fivetran transformation"
  • "Agentic analytics" or "AI data workflows"
  • "Metric debt" or "metric governance"

Required Searches

  1. Search: "analytics engineering best practices 2026"
  2. Search: "[dbt/SQLMesh/semantic layer] vs alternatives 2026"
  3. Search: "analytics engineering trends January 2026"
  4. Search: "[specific tool] new releases 2026"
  5. Search: "agentic analytics AI data 2026" (for AI-related queries)

What to Report

After searching, provide:

  • Current landscape: What analytics tools/patterns are popular NOW
  • Emerging trends: New tools, patterns, or standards gaining traction
  • Deprecated/declining: Tools/approaches losing relevance or support
  • Recommendation: Based on fresh data, not just static knowledge

Example Topics (verify with fresh search)

  • Transformation tools (dbt, SQLMesh, Coalesce)
  • Semantic layers (dbt Semantic Layer, Cube, AtScale, warehouse-native)
  • Metrics stores and headless BI
  • Data quality tools (dbt tests, Elementary, dbt-expectations/Metaplane)
  • BI platforms (Metabase, Superset, Lightdash, Hex)
  • Data modeling patterns (dimensional, wide tables, activity schema)
  • Analytics engineering workflows and CI/CD
  • Agentic AI workflows for analytics
  • Data mesh and domain-owned data products

Related Skills

Xlsx

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas

data

Clickhouse Io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

datacli

Clickhouse Io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

datacli

Analyzing Financial Statements

This skill calculates key financial ratios and metrics from financial statement data for investment analysis

data

Data Storytelling

Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.

data

Kpi Dashboard Design

Design effective KPI dashboards with metrics selection, visualization best practices, and real-time monitoring patterns. Use when building business dashboards, selecting metrics, or designing data visualization layouts.

designdata

Dbt Transformation Patterns

Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.

testingdocumenttool

Sql Optimization Patterns

Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.

designdata

Clinical Decision Support

Generate professional clinical decision support (CDS) documents for pharmaceutical and clinical research settings, including patient cohort analyses (biomarker-stratified with outcomes) and treatment recommendation reports (evidence-based guidelines with decision algorithms). Supports GRADE evidence grading, statistical analysis (hazard ratios, survival curves, waterfall plots), biomarker integration, and regulatory compliance. Outputs publication-ready LaTeX/PDF format optimized for drug develo

developmentdocumentcli

Anndata

This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.

arttooldata

Skill Information

Category:Document
Last Updated:1/26/2026